skip to main content
10.5555/2034396.2034404acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
research-article

Toward error-bounded algorithms for infinite-horizon DEC-POMDPs

Published: 02 May 2011 Publication History

Abstract

Over the past few years, attempts to scale up infinite-horizon DECPOMDPs are mainly due to approximate algorithms, but without the theoretical guarantees of their exact counterparts. In contrast, ε-optimal methods have only theoretical significance but are not efficient in practice. In this paper, we introduce an algorithmic frame-work (β-pi) that exploits the scalability of the former while preserving the theoretical properties of the latter. We build upon β-pi a family of approximate algorithms that can find (provably) errorbounded solutions in reasonable time. Among this family, h-pi uses a branch-and-bound search method that computes a near-optimal solution over distributions over histories experienced by the agents. These distributions often lie near structured, low-dimensional subspace embedded in the high-dimensional sufficient statistic. By planning only on this subspace, h-pi successfully solves all tested benchmarks, outperforming standard algorithms, both in solution time and policy quality.

References

[1]
C. Amato, D. S. Bernstein, and S. Zilberstein. Optimizing memory-bounded controllers for decentralized pomdps. In UAI, 2007.
[2]
C. Amato, J. S. Dibangoye, and S. Zilberstein. Incremental policy generation for finite-horizon dec-POMDPs. in ICAPS, 2009.
[3]
R. Aras and A. Dutech. An investigation into Mathematical Programming for Finite Horizon Decentralized POMDPs. in JAIR, 2010. to appear.
[4]
D. S. Bernstein, C. Amato, E. A. Hansen, and S. Zilberstein. Policy iteration for decentralized control of Markov Decision Processes. JAIR, 34:89--132, 2009.
[5]
D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of Markov Decision Processes. Math. Oper. Res., 27(4), 2002.
[6]
A. Boularias and B. Chaib-draa. Exact dynamic programming for decentralized POMDPs with lossless policy compression. In ICAPS, pages 20--27, 2008.
[7]
A. Carlin and S. Zilberstein. Value-based observation compression for DEC-POMDPs. In AAMAS, 2008.
[8]
J. S. Dibangoye. Contribution à la résolution des problèmes décisionnels de Markov centralisés et décentralisés: algorithmes et théorie. PhD thesis.
[9]
J. S. Dibangoye, B. Chaib-draa, and A.-I. Mouaddib. Policy iteration algorithms for DEC-POMDPs with discounted rewards. in MSDM, 2009.
[10]
J. S. Dibangoye, A.-I. Mouaddib, and B. Chaib-draa. Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs. in AAMAS, 2009.
[11]
E. A. Hansen, D. S. Bernstein, and S. Zilberstein. Dynamic programming for partially observable stochastic games. In AAAI, pages 709--715, 2004.
[12]
A. Kumar and S. Zilberstein. Point-based backup for decentralized pomdps: complexity and new algorithms. In AAMAS, pages 1315--1322, 2010.
[13]
M. L. Putterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley and Sons, New York, NY, 1994.
[14]
D. V. Pynadath and M. Tambe. Multiagent teamwork: analyzing the optimality and complexity of key theories and models. In AAMAS, pages 873--880, 2002.
[15]
Z. Rabinovich, C. V. Goldman, and J. S. Rosenschein. The complexity of multiagent systems: the price of silence. In AAMAS, pages 1102--1103, 2003.
[16]
S. Seuken and S. Zilberstein. Formal models and algorithms for decentralized decision making under uncertainty. JAAMAS, 17(2):190--250, 2008.
[17]
D. Szer and F. Charpillet. An optimal best-first search algorithm for solving infinite horizon DEC-POMDPs. In ECML, pages 389--399, 2005.
[18]
D. Szer and F. Charpillet. Point-based dynamic programming for DEC-POMDPs. In AAAI, pages 16--20, July 2006.

Cited By

View all
  • (2013)Producing efficient error-bounded solutions for transition independent decentralized mdpsProceedings of the 2013 international conference on Autonomous agents and multi-agent systems10.5555/2484920.2485006(539-546)Online publication date: 6-May-2013

Index Terms

  1. Toward error-bounded algorithms for infinite-horizon DEC-POMDPs

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    AAMAS '11: The 10th International Conference on Autonomous Agents and Multiagent Systems - Volume 3
    May 2011
    493 pages
    ISBN:098265717X

    Sponsors

    • IFAAMAS

    In-Cooperation

    Publisher

    International Foundation for Autonomous Agents and Multiagent Systems

    Richland, SC

    Publication History

    Published: 02 May 2011

    Check for updates

    Author Tags

    1. artificial intelligence
    2. decentralized POMDPs
    3. point-based solvers

    Qualifiers

    • Research-article

    Conference

    AAMAS'11
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 30 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2013)Producing efficient error-bounded solutions for transition independent decentralized mdpsProceedings of the 2013 international conference on Autonomous agents and multi-agent systems10.5555/2484920.2485006(539-546)Online publication date: 6-May-2013

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media