skip to main content
10.5555/1558013.1558092guideproceedingsArticle/Chapter ViewAbstractPublication PagesaamasConference Proceedingsconference-collections
research-article
Free access

Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs

Published: 10 May 2009 Publication History

Abstract

Recent scaling up of decentralized partially observable Markov decision process (DEC-POMDP) solvers towards realistic applications is mainly due to approximate methods. Of this family, Memory Bounded Dynamic Programming (MBDP), which combines in a suitable manner top-down heuristics and bottom-up value function updates, can solve DEC-POMDPs with large horizons. The performances of MBDP, can be, however, drastically improved by avoiding the systematic generation and evaluation of all possible policies which result from the exhaustive backup. To achieve that, we suggest a heuristic search method, namely Point Based Incremental Pruning (PBIP), which is able to distinguish policies with different heuristic estimates. Taking this insight into account, PBIP searches only among the most promising policies, finds those useful, and prunes dominated ones. Doing so permits us to reduce clearly the amount of computation required by the exhaustive backup. The computation experiment shows that PBIP solves DEC-POMDP benchmarks up to 800 times faster than the current best approximate algorithms, while providing solutions with higher values.

References

[1]
R. E. Bellman. Dynamic Programming. Dover Publications, Incorporated, 1957.
[2]
D. S. Bernstein, R. Givan, N. Immerman, and S. Zilberstein. The complexity of decentralized control of markov decision processes. Math. Oper. Res., 27(4), 2002.
[3]
C. Besse and B. Chaib-draa. Parallel rollout for online solution of dec-pomdps. In FLAIRS Conference, pages 619--624, 2008.
[4]
A. Boularias and B. Chaib-draa. Exact dynamic programming for decentralized pomdps with lossless policy compression. In ICAPS, pages 20--27, 2008.
[5]
A. Carlin and S. Zilberstein. Value-based observation compression for DEC-POMDPs. In AAMAS, 2008.
[6]
E. A. Hansen, D. S. Bernstein, and S. Zilberstein. Dynamic programming for partially observable stochastic games. In AAAI, pages 709--715, 2004.
[7]
R. Nair, P. Varakantham, M. Tambe, and M. Yokoo. Networked distributed POMDPs: A synthesis of distributed constraint optimization and POMDPs. In AAAI, pages 133--139, 2005.
[8]
D. S. Nau, V. Kumar, and L. Kanal. General branch and bound, and its relation to a* and ao*. Artif. Intell., 23(1):29--58, 1984.
[9]
J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In IJCAI, 2003.
[10]
Z. Rabinovich, C. V. Goldman, and J. S. Rosenschein. The complexity of multiagent systems: the price of silence. In AAMAS, pages 1102--1103, 2003.
[11]
S. Seuken and S. Zilberstein. Improved Memory-Bounded Dynamic Programming for DEC-POMDPs. In UAI, 2007.
[12]
S. Seuken and S. Zilberstein. Memory-bounded dynamic programming for DEC-POMDPs. In IJCAI, pages 2009--2015, 2007.
[13]
D. Szer and F. Charpillet. Point-based dynamic programming for DEC-POMDPs. In AAAI, pages 16--20, July 2006.
[14]
D. Szer, F. Charpillet, and S. Zilberstein. Maa*: A heuristic search algorithm for solving decentralized POMDPs. In UAI, pages 568--576, 2005.
[15]
P. Varakantham, J. Marecki, M. Tambe, and M. Yokoo. Letting loose a spider on a network of POMDPs: Generating quality guaranteed policies. In AAMAS, May 2007.

Cited By

View all

Index Terms

  1. Point-based incremental pruning heuristic for solving finite-horizon DEC-POMDPs

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image Guide Proceedings
    AAMAS '09: Proceedings of The 8th International Conference on Autonomous Agents and Multiagent Systems - Volume 1
    May 2009
    701 pages
    ISBN:9780981738161

    Sponsors

    • Drexel University
    • Wiley-Blackwell
    • Microsoft Research: Microsoft Research
    • Whitestein Technologies
    • European Office of Aerospace Research and Development, Air Force Office of Scientific Research, United States Air Force Research Laboratory
    • The Foundation for Intelligent Physical Agents

    Publisher

    International Foundation for Autonomous Agents and Multiagent Systems

    Richland, SC

    Publication History

    Published: 10 May 2009

    Author Tags

    1. artificial intelligence
    2. branch-and-bound
    3. decentralized pomdps
    4. planning under uncertainty
    5. point-based solver

    Qualifiers

    • Research-article

    Acceptance Rates

    AAMAS '09 Paper Acceptance Rate 132 of 651 submissions, 20%;
    Overall Acceptance Rate 1,155 of 5,036 submissions, 23%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)25
    • Downloads (Last 6 weeks)6
    Reflects downloads up to 30 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Login options

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media