Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
sdmia_schedule [2015/10/16 16:32]
matthijs
sdmia_schedule [2015/11/13 12:08] (current)
matthijs
Line 7: Line 7:
 ^ ^ ^ Authors ^ Title ^ ^ ^ ^ Authors ^ Title ^
 |09:​00am| ​   09:​15am| ​       Welcome and introductions ​ ||    |09:​00am| ​   09:​15am| ​       Welcome and introductions ​ ||   
-|09:​15am| ​   10:05am|Milind Tambe, USC |TBA+|09:​15am| ​   10:00am|Milind Tambe, USC |[[sdmia_invited_speakers|Assisting HIV Prevention Amongst Homeless Youth using POMDPs 
-|10:05am|    10:​30am| ​       application discussion ​   ||+]]
 +|10:00am|    10:​30am| ​       application discussion ​   ||
 |10:​30am| ​   11:​00am| ​   //Coffee break// ​ ||        |10:​30am| ​   11:​00am| ​   //Coffee break// ​ ||       
-|11:​00am| ​   11:​45am|Shlomo Zilberstein,​ UMass Amherst |Do We Expect Too Much from DEC-POMDP Algorithms? |+|11:​00am| ​   11:​45am|Shlomo Zilberstein,​ UMass Amherst ​|[[sdmia_invited_speakers|Do We Expect Too Much from DEC-POMDP Algorithms?]] |
 |11:​45am| ​   12:10pm|C. Amato, G. Konidaris, S. Omidshafiei,​ A. Agha-Mohammadi,​ J. How and L. Kaelbling ​ |Probabilistic Planning for Decentralized Multi-Robot Systems| |11:​45am| ​   12:10pm|C. Amato, G. Konidaris, S. Omidshafiei,​ A. Agha-Mohammadi,​ J. How and L. Kaelbling ​ |Probabilistic Planning for Decentralized Multi-Robot Systems|
-|12:​10pm| ​   12:35pm|Siddharth ​Srivastava, ​Stuart ​Russell and Alessandro ​Pinto |Metaphysics of Planning Domain Descriptions |+|12:​10pm| ​   12:35pm|S. Srivastava, ​S. Russell and A. Pinto |Metaphysics of Planning Domain Descriptions |
 |12:​35pm| 02:​00pm| //​Lunch//​ || |12:​35pm| 02:​00pm| //​Lunch//​ ||
-|02:​00pm| 02:​25pm |Erwin Walraven and Matthijs ​T. J. Spaan |Planning under Uncertainty with Weighted State Scenarios | +|02:​00pm| 02:​25pm |E. Walraven and M. T. J. Spaan |Planning under Uncertainty with Weighted State Scenarios | 
-|02:​25pm| 02:​50pm |Fabio-Valerio ​Ferrari and Abdel-Illah ​Mouaddib |Hierarchical factored POMDP for joint tasks : application to escort tasks | +|02:​25pm| 02:​50pm |F. Ferrari and A. Mouaddib |Hierarchical factored POMDP for joint tasks : application to escort tasks | 
-|02:​50pm| 03:​15pm |Luis Pineda, ​Kyle Wray and Shlomo ​Zilberstein |Revisiting Multi-Objective MDPs with Relaxed Lexicographic Preferences|+|02:​50pm| 03:​15pm |L. Pineda, ​K. Wray and S. Zilberstein |Revisiting Multi-Objective MDPs with Relaxed Lexicographic Preferences|
 |03:​15pm| 03:​30pm | ​ open discussion || |03:​15pm| 03:​30pm | ​ open discussion ||
 |03:​30pm| 04:​00pm | ​ //Coffee break// || |03:​30pm| 04:​00pm | ​ //Coffee break// ||
-|04:​00pm| 04:​45pm |Mykel Kochenderfer,​ Stanford |Decision Theoretic Planning for Air Traffic Applications| +|04:​00pm| 04:​45pm |Mykel Kochenderfer,​ Stanford |[[sdmia_invited_speakers|Decision Theoretic Planning for Air Traffic Applications]]
-|04:​45pm| 05:​10pm |AReyes, P. H. Ibarguengoytia,​ I. Romero, D. Pech and MBorunda |Open questions ​for building optimal operation policies for dam management using Factored Markov Decision Processes ​+|04:​45pm| 05:​10pm |BLacerda, D. Parker ​and NHawes |Nested Value Iteration ​for Partially Satisfiable Co-Safe LTL Specifications (Extended Abstract) ​
-|05:​10pm| 05:​35pm |Kyle Wray and Shlomo ​Zilberstein |A Parallel Point-Based POMDP Algorithm Leveraging GPUs |+|05:​10pm| 05:​35pm |K. Wray and S. Zilberstein |A Parallel Point-Based POMDP Algorithm Leveraging GPUs |
 |06:​00pm| 07:​00pm | ​ //​Reception//​ || |06:​00pm| 07:​00pm | ​ //​Reception//​ ||
  
Line 27: Line 28:
  
 ^ ^ ^ Authors ^ Title ^ ^ ^ ^ Authors ^ Title ^
-|09:​00am| 09:​45am |Jason Williams, Microsoft Research |Decision-theoretic control in dialog systems: recent progress and opportunities for research|+|09:​00am| 09:​45am |Jason Williams, Microsoft Research |[[sdmia_invited_speakers|Decision-theoretic control in dialog systems: recent progress and opportunities for research]]|
 |09:​45am| 10:​05am | ​ application discussion || |09:​45am| 10:​05am | ​ application discussion ||
-|10:​05am| 10:​30am |Daniel ​Urieli and Peter Stone |Autonomous Electricity Trading using Time-Of-Use Tariffs in a Competitive Market |+|10:​05am| 10:​30am |D. Urieli and P. Stone |Autonomous Electricity Trading using Time-Of-Use Tariffs in a Competitive Market |
 |10:​30am| 11:​00am | ​ //Coffee break// ​ ||  |10:​30am| 11:​00am | ​ //Coffee break// ​ ||
-|11:​00am| 11:​25am |Edmund ​Durfee and Satinder ​Singh |Commitment Semantics for Sequential Decision Making Under Reward Uncertainty |+|11:​00am| 11:​25am |E. Durfee and S. Singh |Commitment Semantics for Sequential Decision Making Under Reward Uncertainty |
 |11:​25am| 11:​50am |A. Iwasaki, T. Sekiguchi, S. Yamamoto and M. Yokoo |How is cooperation/​collusion sustained in repeated multimarket contact with observation errors? | |11:​25am| 11:​50am |A. Iwasaki, T. Sekiguchi, S. Yamamoto and M. Yokoo |How is cooperation/​collusion sustained in repeated multimarket contact with observation errors? |
-|11:​50am| 12:​10pm |S. Mcgregor, H. Buckingham, R. Houtman, C. Montgomery, R. Metoyer and T. Dietterich |Facilitating ​Testing ​and Debugging of Markov Decision Processes ​with Interactive Visualization ​+|11:​50am| 12:​10pm |S. Mcgregor, H. Buckingham, R. Houtman, C. Montgomery, R. Metoyer and T. Dietterich |MDPvis: An Interactive Visualization for Testing Markov Decision Processes | 
-|12:​10pm| 12:​30pm |Frans Oliehoek, ​Matthijs ​T. J. Spaan, ​Philipp ​Robbel and Joao Messias |The MADP Toolbox: An Open-Source Library for Planning and Learning in (Multi-)Agent Systems |+|12:​10pm| 12:​30pm |F. Oliehoek, ​M. T. J. Spaan, ​P. Robbel and J. Messias |The MADP Toolbox: An Open-Source Library for Planning and Learning in (Multi-)Agent Systems |
 |12:​30pm| 02:​00pm | ​ //​Lunch// ​ ||  |12:​30pm| 02:​00pm | ​ //​Lunch// ​ ||
-|02:​00pm| 02:​45pm |Craig Boutilier, Google |Large-scale MDPs in Practice: Opportunities and Challenges |+|02:​00pm| 02:​45pm |Craig Boutilier, Google |[[sdmia_invited_speakers|Large-scale MDPs in Practice: Opportunities and Challenges]] |
 |02:​45pm| 03:​00pm | ​ application discussion || |02:​45pm| 03:​00pm | ​ application discussion ||
-|03:​00pm| 03:​25pm |Philipp ​Robbel, ​Frans A. Oliehoek and Mykel J. Kochenderfer |Exploiting Anonymity in Approximate Linear Programming:​ Scaling to Large Multiagent MDPs |+|03:​00pm| 03:​25pm |P. Robbel, ​F. A. Oliehoek and M. J. Kochenderfer |Exploiting Anonymity in Approximate Linear Programming:​ Scaling to Large Multiagent MDPs |
 |03:​30pm| 04:​00pm | ​ //Coffee break// || |03:​30pm| 04:​00pm | ​ //Coffee break// ||
-|04:​00pm| 04:​45pm |Emma Brunskill, CMU |Quickly Learning to Make Good Decisions | +|04:​00pm| 04:​45pm |Emma Brunskill, CMU |[[sdmia_invited_speakers|Quickly Learning to Make Good Decisions]] 
-|04:​45pm| 05:​10pm |Martin ​Allen |Complexity of Self-Preserving,​ Team-Based Competition in Partially Observable Stochastic Games | +|04:​45pm| 05:​10pm |M. Allen |Complexity of Self-Preserving,​ Team-Based Competition in Partially Observable Stochastic Games | 
-|05:​10pm| 05:​35pm |Bruno Lacerda, David Parker ​and Nick Hawes |Nested Value Iteration ​for Partially ​Satisfiable Co-Safe LTL Specifications (Extended Abstract) ​|+|05:​10pm| 05:​35pm |M. Hausknecht ​and P. Stone |Deep Recurrent Q-Learning ​for Partially ​Observable MDPs |
 |06:​00pm| 07:​30pm | ​ //Plenary session// ​ ||  |06:​00pm| 07:​30pm | ​ //Plenary session// ​ ||
  
Line 48: Line 49:
  
 ^ ^ ^ Authors ^ Title ^ ^ ^ ^ Authors ^ Title ^
-|09:​00am| 09:​45am |Alan Fern, Oregon State |Quickly ​Learning ​to Make Good Decisions ​|+|09:​00am| 09:​45am |Alan Fern, Oregon State |[[sdmia_invited_speakers|Kinder and Gentler Teaching Modes for Human-Assisted Policy ​Learning]] |
 |09:​45am| 10:​00am | ​ application discussion || |09:​45am| 10:​00am | ​ application discussion ||
-|10:​00am| 10:​25am |Matthew Hausknecht ​and Peter Stone |Deep Recurrent Q-Learning ​for Partially Observable MDPs |+|10:​00am| 10:​25am |A. Reyes, P. H. Ibarguengoytia,​ I. Romero, D. Pech and M. Borunda |Open questions ​for building optimal operation policies for dam management using Factored Markov Decision Processes  ​|
 |10:​30am| 11:​00am | ​ //Coffee break// || |10:​30am| 11:​00am | ​ //Coffee break// ||
-|11:​00am| 11:​25am |D. ​Ellis Hershkowitz, ​James MacGlashan and Stefanie ​Tellex |Learning Propositional Functions for Planning and Reinforcement Learning | +|11:​00am| 11:​25am |D. ​E. Hershkowitz, ​J. MacGlashan and S. Tellex |Learning Propositional Functions for Planning and Reinforcement Learning | 
-|11:​25am| 11:​50am |Yusen Zhan and Matthew ​Taylor |Online Transfer Learning in Reinforcement Learning Domains |+|11:​25am| 11:​50am |Y. Zhan and M. Taylor |Online Transfer Learning in Reinforcement Learning Domains |
 |11:​50am| 12:​30pm | ​ wrapup discussion || |11:​50am| 12:​30pm | ​ wrapup discussion ||
  
  
Recent changes RSS feed Creative Commons License Donate Minima Template by Wikidesign Driven by DokuWiki