Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
optimal_values [2014/11/11 18:20]
chris
optimal_values [2014/11/17 04:26]
chris
Line 97: Line 97:
 | 3 | 9.380000 | [[#​References|(ADZ2009)]] [[#​References|(OWS2009)]] | [[#​note2|2)]] | | 3 | 9.380000 | [[#​References|(ADZ2009)]] [[#​References|(OWS2009)]] | [[#​note2|2)]] |
 | 4 | 10.180800 | [[#​References|(OWS2009)]] | [[#​note2|2)]] | | 4 | 10.180800 | [[#​References|(OWS2009)]] | [[#​note2|2)]] |
 +| 5 | 13.26 | [[#​References|(DABC2013)]] | [[#​note5|5)]] |
 +| 6 | 18.62 | [[#​References|(DABC2013)]] | [[#​note5|5)]] |
 +| 7 | 20.90 | [[#​References|(DABC2013)]] | [[#​note5|5)]] |
 +| 8 | 22.47 | [[#​References|(DABC2013)]] | [[#​note5|5)]] |
 +| 9 | 24.31 | [[#​References|(DABC2013)]] | [[#​note5|5)]] |
 +| 10 | 26.31 | [[#​References|(DABC2013)]] | [[#​note5|5)]] |
 | [[http://​www.st.ewi.tudelft.nl/​~mtjspaan/​decpomdp/​Grid3x3corners.dpomdp|Meeting in a 3x3 grid]] |||| | [[http://​www.st.ewi.tudelft.nl/​~mtjspaan/​decpomdp/​Grid3x3corners.dpomdp|Meeting in a 3x3 grid]] ||||
 ^ Horizon ^ Optimal value ^ First solved by ^ Notes ^ ^ Horizon ^ Optimal value ^ First solved by ^ Notes ^
Line 117: Line 123:
  
 == Notes == == Notes ==
-[[|1)]] In [[#​References|(NTYPM2003)]] an incorrect optimal value for DecTiger h=3 was presented.\\ [[|2)]] The [[#​References|(ADZ2009)]] paper included new results obtained using the algorithm presented in [[#​References|(OWS2009)]].\\ [[|3)]] In these cases the results reported in the referred paper concern a different discount factor (usually as stated in the problem definition).\\ [[|4)]] In [[#​References|(OWS2009)]] a typo in the third decimal was published for Fire Fighting h=2 (-4.3825 instead of the correct -4.3835).+[[|1)]] In [[#​References|(NTYPM2003)]] an incorrect optimal value for DecTiger h=3 was presented.\\ [[|2)]] The [[#​References|(ADZ2009)]] paper included new results obtained using the algorithm presented in [[#​References|(OWS2009)]].\\ [[|3)]] In these cases the results reported in the referred paper concern a different discount factor (usually as stated in the problem definition).\\ [[|4)]] In [[#​References|(OWS2009)]] a typo in the third decimal was published for Fire Fighting h=2 (-4.3825 instead of the correct -4.3835).\\ [[|5)]] In [[#​References|(DABC2013)]] epsilon-optimal solutions were found with an epsilon of 0.01.
  
 == References == == References ==
 [[|NTYPM2003]] (Nair, Tambe, Yokoo, Pynadath & Marsella, IJCAI 2003)\\ [[|HBZ2004]] (Hansen, Bernstein & Zilberstein,​ AAAI 2004)\\ [[|SCZ2005]] (Szer, Charpillet & Zilberstein,​ UAI 2005)\\ [[|SC2006]] (Szer & Charpillet, AAAI 2006)\\ [[|OSV2008]] (Oliehoek, Spaan & Vlassis, JAIR 2008)\\ [[|OWS2009]] (Oliehoek, Whiteson & Spaan, AAMAS 2009)\\ [[|ADZ2009]] (Amato, Dibangoye & Zilberstein,​ ICAPS 2009)\\ [[|SOA2011]] (Spaan, Oliehoek & Amato, IJCAI 2011)\\ [[|NTYPM2003]] (Nair, Tambe, Yokoo, Pynadath & Marsella, IJCAI 2003)\\ [[|HBZ2004]] (Hansen, Bernstein & Zilberstein,​ AAAI 2004)\\ [[|SCZ2005]] (Szer, Charpillet & Zilberstein,​ UAI 2005)\\ [[|SC2006]] (Szer & Charpillet, AAAI 2006)\\ [[|OSV2008]] (Oliehoek, Spaan & Vlassis, JAIR 2008)\\ [[|OWS2009]] (Oliehoek, Whiteson & Spaan, AAMAS 2009)\\ [[|ADZ2009]] (Amato, Dibangoye & Zilberstein,​ ICAPS 2009)\\ [[|SOA2011]] (Spaan, Oliehoek & Amato, IJCAI 2011)\\
 +[[|DABC2013]] (Dibangoye, Amato, Buffet & Charpillet, IJCAI 2013)\\
  
 ===== Highest known values for Infinite-Horizon Dec-POMDPs ===== ===== Highest known values for Infinite-Horizon Dec-POMDPs =====
  
-The table below provides the highest known values for  a range of benchmark problems with the common discount factor of 0.9. We also list the paper that each result first appeared. ​ Note that the results are often an average over a number of runs, so single runs may have higher values than those listed here.+The table below provides the highest known values for  a range of benchmark problems with the common discount factor of 0.9 (except Wireless network which used 0.99). We also list the paper that each result first appeared. ​ Note that the results are often an average over a number of runs, so single runs may have higher values than those listed here.
  
 | [[http://​www.st.ewi.tudelft.nl/​~mtjspaan/​decpomdp/​dectiger.dpomdp|DecTiger]] ||| | [[http://​www.st.ewi.tudelft.nl/​~mtjspaan/​decpomdp/​dectiger.dpomdp|DecTiger]] |||
Recent changes RSS feed Creative Commons License Donate Minima Template by Wikidesign Driven by DokuWiki