Differences
This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
optimal_values [2014/11/11 18:23] chris |
optimal_values [2014/11/17 04:26] chris |
||
---|---|---|---|
Line 123: | Line 123: | ||
== Notes == | == Notes == | ||
- | [[|1)]] In [[#References|(NTYPM2003)]] an incorrect optimal value for DecTiger h=3 was presented.\\ [[|2)]] The [[#References|(ADZ2009)]] paper included new results obtained using the algorithm presented in [[#References|(OWS2009)]].\\ [[|3)]] In these cases the results reported in the referred paper concern a different discount factor (usually as stated in the problem definition).\\ [[|4)]] In [[#References|(OWS2009)]] a typo in the third decimal was published for Fire Fighting h=2 (-4.3825 instead of the correct -4.3835). | + | [[|1)]] In [[#References|(NTYPM2003)]] an incorrect optimal value for DecTiger h=3 was presented.\\ [[|2)]] The [[#References|(ADZ2009)]] paper included new results obtained using the algorithm presented in [[#References|(OWS2009)]].\\ [[|3)]] In these cases the results reported in the referred paper concern a different discount factor (usually as stated in the problem definition).\\ [[|4)]] In [[#References|(OWS2009)]] a typo in the third decimal was published for Fire Fighting h=2 (-4.3825 instead of the correct -4.3835).\\ [[|5)]] In [[#References|(DABC2013)]] epsilon-optimal solutions were found with an epsilon of 0.01. |
== References == | == References == | ||
[[|NTYPM2003]] (Nair, Tambe, Yokoo, Pynadath & Marsella, IJCAI 2003)\\ [[|HBZ2004]] (Hansen, Bernstein & Zilberstein, AAAI 2004)\\ [[|SCZ2005]] (Szer, Charpillet & Zilberstein, UAI 2005)\\ [[|SC2006]] (Szer & Charpillet, AAAI 2006)\\ [[|OSV2008]] (Oliehoek, Spaan & Vlassis, JAIR 2008)\\ [[|OWS2009]] (Oliehoek, Whiteson & Spaan, AAMAS 2009)\\ [[|ADZ2009]] (Amato, Dibangoye & Zilberstein, ICAPS 2009)\\ [[|SOA2011]] (Spaan, Oliehoek & Amato, IJCAI 2011)\\ | [[|NTYPM2003]] (Nair, Tambe, Yokoo, Pynadath & Marsella, IJCAI 2003)\\ [[|HBZ2004]] (Hansen, Bernstein & Zilberstein, AAAI 2004)\\ [[|SCZ2005]] (Szer, Charpillet & Zilberstein, UAI 2005)\\ [[|SC2006]] (Szer & Charpillet, AAAI 2006)\\ [[|OSV2008]] (Oliehoek, Spaan & Vlassis, JAIR 2008)\\ [[|OWS2009]] (Oliehoek, Whiteson & Spaan, AAMAS 2009)\\ [[|ADZ2009]] (Amato, Dibangoye & Zilberstein, ICAPS 2009)\\ [[|SOA2011]] (Spaan, Oliehoek & Amato, IJCAI 2011)\\ | ||
+ | [[|DABC2013]] (Dibangoye, Amato, Buffet & Charpillet, IJCAI 2013)\\ | ||
===== Highest known values for Infinite-Horizon Dec-POMDPs ===== | ===== Highest known values for Infinite-Horizon Dec-POMDPs ===== | ||
- | The table below provides the highest known values for a range of benchmark problems with the common discount factor of 0.9. We also list the paper that each result first appeared. Note that the results are often an average over a number of runs, so single runs may have higher values than those listed here. | + | The table below provides the highest known values for a range of benchmark problems with the common discount factor of 0.9 (except Wireless network which used 0.99). We also list the paper that each result first appeared. Note that the results are often an average over a number of runs, so single runs may have higher values than those listed here. |
| [[http://www.st.ewi.tudelft.nl/~mtjspaan/decpomdp/dectiger.dpomdp|DecTiger]] ||| | | [[http://www.st.ewi.tudelft.nl/~mtjspaan/decpomdp/dectiger.dpomdp|DecTiger]] ||| |