Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
sdmia_invited_speakers [2015/09/28 15:43]
matthijs
sdmia_invited_speakers [2015/11/13 12:07] (current)
matthijs
Line 39: Line 39:
 [[http://​web.engr.oregonstate.edu/​~afern/​|Oregon State]] [[http://​web.engr.oregonstate.edu/​~afern/​|Oregon State]]
  
-Title: ​TBD\\+Title: ​**Kinder and Gentler Teaching Modes for Human-Assisted Policy Learning**\\
 Abstract:\\ Abstract:\\
-TBD+This talk considers the problem of teaching action policies to computers for sequential decision making. The vast majority of policy learning algorithms offer human teachers little flexibility in how policies are taught. In particular, one of two learning modes is typically considered: 1) Imitation learning, where the teacher demonstrates explicit action sequences to the learner, and 2) Reinforcement learning, where the teacher designs a reward function for the learner to autonomously optimize via practice. This is in sharp contrast to how humans teach other humans, where many other learning modes are commonly used besides imitation and practice. The talk will highlight some of our recent work on broadening the available learning modes for computer policy learners, with the eventual aim of allowing humans to teach computers more naturally and efficiently. In addition, we will sketch some of the challenges in this research direction for both policy learning and more general planning systems. 
 + 
  
 === Mykel Kochenderfer === === Mykel Kochenderfer ===
Line 53: Line 55:
 [[http://​teamcore.usc.edu/​tambe/​|USC]] [[http://​teamcore.usc.edu/​tambe/​|USC]]
  
-Title: ​TBD\\+Joint work with Eric Rice, Amulya Yadav, and Robin Petering. 
 + 
 +Title: ​**PSINET: Assisting HIV Prevention Amongst Homeless Youth using POMDPs**\\
 Abstract:\\ Abstract:\\
-TBD+Homeless youth are prone to Human Immunodeficiency 
 +Virus (HIV) due to their engagement in high risk behavior 
 +such as unprotected sex, sex under influence of 
 +drugs, etc. Many non-profit agencies conduct interventions 
 +to educate and train a select group of homeless 
 +youth about HIV prevention and treatment practices and 
 +rely on word-of-mouth spread of information through 
 +their social network. Previous work in strategic selection 
 +of intervention participants does not handle uncertainties 
 +in the social network’s structure and evolving 
 +network state, potentially causing significant shortcomings 
 +in spread of information. Thus, we developed 
 +PSINET, a decision support system to aid the agencies 
 +in this task. PSINET includes the following key novelties:​ 
 +(i) it handles uncertainties in network structure 
 +and evolving network state; (ii) it addresses these uncertainties 
 +by using POMDPs in influence maximization;​ 
 +and (iii) it provides algorithmic advances to allow high 
 +quality approximate solutions for such POMDPs. We are about 
 +to conduct a pilot test study with homeless youth in Los Angeles; 
 +we will present a progress report. ​
  
 === Jason Williams === === Jason Williams ===
Recent changes RSS feed Creative Commons License Donate Minima Template by Wikidesign Driven by DokuWiki