Differences
This shows you the differences between two versions of the page.
Next revision | Previous revision | ||
problem_domains [2013/05/23 21:43] matthijs created |
problem_domains [2019/02/14 12:41] (current) matthijs [General Dec-POMDPs] |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== Dec-POMDP problem domains ====== | ====== Dec-POMDP problem domains ====== | ||
- | On this page we are collecting a number of Dec-POMDP problem descriptions, to create a repository containing several standard problem domains that can be used for benchmarking. Contributions are welcome. The models are provided in the .dpomdp file format, which is an extension of [[http://pomdp.org/pomdp/examples/index.shtml|Tony's POMDP file format]]. The file format is explained in example.dpomdp, and can be used in the [[http://www.science.uva.nl/~faolieho/index.php?fuseaction=software.madp|Multiagent decision process (MADP)]] Toolbox, which contains a parser for it. More information about this software toolbox can be found on its [[http://www.science.uva.nl/~faolieho/index.php?fuseaction=software.madp|homepage]] or in this [[http://users.isr.ist.utl.pt/~mtjspaan/pub/Spaan08msdm.pdf|MSDM 2008 paper]]. | + | On this page we are collecting a number of Dec-POMDP problem descriptions, to create a repository containing several standard problem domains that can be used for benchmarking. Contributions are welcome. The models are provided in the .dpomdp file format, which is an extension of [[http://pomdp.org/pomdp/examples/index.shtml|Tony's POMDP file format]]. The file format is explained in example.dpomdp, and can be used in the [[http://www.fransoliehoek.net/madp|Multiagent decision process (MADP)]] Toolbox, which contains a parser for it. More information about this software toolbox can be found on its [[http://www.fransoliehoek.net/madp|homepage]], on [[https://github.com/MADPToolbox/MADP|GitHub]] or in this [[http://users.isr.ist.utl.pt/~mtjspaan/pub/Spaan08msdm.pdf|MSDM 2008 paper]]. |
===== General Dec-POMDPs ===== | ===== General Dec-POMDPs ===== | ||
- | * dectiger.dpomdp, the Decentralized Tiger problem, introduced in (Nair, Tambe, Yokoo, Pynadath & Marsella, IJCAI 2003). | + | ^ File ^ Name ^ Reference and Notes ^ |
- | * broadcastChannel.dpomdp, the Broadcast Channel problem, introduced in (Hansen, Bernstein & Zilberstein, AAAI 2004). | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/dectiger.dpomdp|dectiger.dpomdp]] | Decentralized Tiger | (Nair, Tambe, Yokoo, Pynadath & Marsella, IJCAI 2003) | |
- | * GridSmall.dpomdp, the Meeting in a 2x2 Grid problem, originally from (Bernstein, Hansen & Zilberstein, IJCAI 2005), this is the two observations per agent version of (Amato, Bernstein & Zilberstein, AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM) 2006). | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/broadcastChannel.dpomdp|broadcastChannel.dpomdp]] | Broadcast Channel problem | (Hansen, Bernstein & Zilberstein, AAAI 2004) | |
- | * oneDoor_2_7_0.20_0.00_0_2.dpomdp, the One Door problem, introduced in (Oliehoek, Spaan & Vlassis, AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM) 2007). Other versions of this problem are available upon request. | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/GridSmall.dpomdp|GridSmall.dpomdp]] | Meeting in a 2x2 Grid | originally from (Bernstein, Hansen & Zilberstein, IJCAI 2005), this is the two observations per agent version of (Amato, Bernstein & Zilberstein, AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM) 2006) | |
- | * boxPushingUAI07.dpomdp, the Cooperative Box Pushing problem, introduced in (Seuken & Zilberstein, UAI 2007). | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/oneDoor_2_7_0.20_0.00_0_2.dpomdp|oneDoor_2_7_0.20_0.00_0_2.dpomdp]] | One Door | (Oliehoek, Spaan & Vlassis, AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains (MSDM) 2007). Other versions of this problem are available upon request. | |
- | * recycling.dpomdp, the Recycling Robots problem, introduced in (Amato, Bernstein & Zilberstein, UAI 2007). | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/boxPushingUAI07.dpomdp|boxPushingUAI07.dpomdp]] | Cooperative Box Pushing | (Seuken & Zilberstein, UAI 2007) | |
- | * fireFighting_2_3_3.dpomdp.gz, the Fire Fighting problem with 3 houses and 3 fire levels, introduced in (Oliehoek, Spaan & Vlassis, JAIR 2008). | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/recycling.dpomdp|recycling.dpomdp]] | Recycling Robots | (Amato, Bernstein & Zilberstein, UAI 2007) | |
- | * fireFighting_2_4_3.dpomdp.gz, the Fire Fighting problem with 4 houses and 3 fire levels, introduced in (Oliehoek, Spaan & Vlassis, JAIR 2008). | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/fireFighting_2_3_3.dpomdp.gz|fireFighting_2_3_3.dpomdp.gz]] | Fire Fighting problem with 3 houses and 3 fire levels | (Oliehoek, Spaan & Vlassis, JAIR 2008) | |
- | * Mars.dpomdp, the Mars rover problem, introduced in (Amato & Zilberstein, AAMAS 2009). | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/fireFighting_2_4_3.dpomdp.gz|fireFighting_2_4_3.dpomdp.gz]] | Fire Fighting problem with 4 houses and 3 fire levels | (Oliehoek, Spaan & Vlassis, JAIR 2008) | |
- | * Grid3x3corners.dpomdp, the Meeting in a 3x3 grid problem, introduced in (Amato, Dibangoye & Zilberstein, ICAPS 2009). | + | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/Mars.dpomdp.gz|Mars.dpomdp.gz]] | Mars rover | (Amato & Zilberstein, AAMAS 2009) | |
+ | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/Grid3x3corners.dpomdp.gz|Grid3x3corners.dpomdp.gz]]| Meeting in a 3x3 grid | (Amato, Dibangoye & Zilberstein, ICAPS 2009) | | ||
+ | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/alignment_2x4.dpomdp|alignment_2x4.dpomdp]], [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/alignment_noisy.tar.gz| Three larger versions of this problem]] | [[http://orca.st.usm.edu/~banerjee/alignment/|Robot Alignment]] | (Kraemer and Banerjee, AAMAS 2013) | | ||
+ | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/wirelessDelay.dpomdp|wirelessDelay.dpomdp]] | Wireless networking problem | (Pajarinen and Peltonen, NIPS 2011) | | ||
+ | | [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/wirelessWithOverhead.dpomdp|wirelessWithOverhead.dpomdp]], [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/longFireFight.dpomdp|longFireFight.dpomdp]] | Other problems from PP EMCL 2013 | (Pajarinen and Peltonen, ECML 2013) | | ||
===== Transition and observation independent Dec-POMDPs ===== | ===== Transition and observation independent Dec-POMDPs ===== | ||
- | * hotel1.tar.gz, hotel2.tar.gz, the two Hotel problems introduced in (Spaan & Melo, AAMAS 2008). | + | * [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/hotel1.tar.gz|hotel1.tar.gz]], [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/hotel2.tar.gz|hotel2.tar.gz]], the two Hotel problems introduced in (Spaan & Melo, AAMAS 2008). |
- | * ISR_2.tar.gz, Pent_2.tar.gz, MIT_2.tar.gz, SUNY_2.tar.gz, CIT_2.tar.gz, CMU_2.tar.gz, the two-robot navigation problems introduced in (Spaan & Melo, AAMAS 2008), inspired by Tony Cassandra's single-agent navigation domains. | + | * [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/ISR_2.tar.gz|ISR_2.tar.gz]], [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/Pent_2.tar.gz|Pent_2.tar.gz]], [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/MIT_2.tar.gz|MIT_2.tar.gz]], [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/SUNY_2.tar.gz|SUNY_2.tar.gz]], [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/CIT_2.tar.gz|CIT_2.tar.gz]], [[http://www.st.ewi.tudelft.nl/mtjspaan/decpomdp/CMU_2.tar.gz|CMU_2.tar.gz]], the two-robot navigation problems introduced in (Spaan & Melo, AAMAS 2008), inspired by Tony Cassandra's single-agent navigation domains. |