skip to main content
10.1145/3200947.3201010acmotherconferencesArticle/Chapter ViewAbstractPublication PagessetnConference Proceedingsconference-collections
research-article

Multiagent Reinforcement Learning Methods to Resolve Demand Capacity Balance Problems

Published: 09 July 2018 Publication History

Abstract

In this article, we explore the computation of joint policies for autonomous agents to resolve congestions problems in the air traffic management (ATM) domain. Agents, representing flights, have limited information about others' payoffs and preferences, and need to coordinate to achieve their tasks while adhering to operational constraints. We formalize the problem as a multiagent Markov decision process (MDP) towards deciding flight delays to resolve demand and capacity balance (DCB) problems in ATM. To this end, we present multiagent reinforcement learning methods that allow agents to interact and form own policies in coordination with others. Experimental study on real-world cases, confirms the effectiveness of our approach in resolving the demand-capacity balance problem.

References

[1]
2011. Air Traffic Flow and Capacity Management (ATFCM). (2011). Eurocontrol, http://www.eurocontrol.int/articles/air-traffic-flow-and-capacity-management.
[2]
Adrian K Agogino and Kagan Tumer. 2012. A multiagent approach to managing air traffic flow. Autonomous Agents and Multi-Agent Systems 24, 1 (2012), 1--25.
[3]
Baraa Munqith Albaker and Nasrudin Abd Rahim. 2010. Unmanned aircraft collision avoidance system using cooperative agent-based negotiation approach. Int. J. Simulation, Syst. Sci. Technol 11, 4 (2010), 1--8.
[4]
Ana L. C. Bazzan, Joachim Wahle, and Franziska Klügl. 1999. Agents in Traffic Modelling - From Reactive to Social Behaviour. In KI-99: Advances in Artificial Intelligence, 23rd Annual German Conference on Artificial Intelligence, Bonn, Germany, September 13-15, 1999, Proceedings. 303--306.
[5]
Craig Boutilier. 1996. Planning, Learning and Coordination in Multiagent Decision Processes. In Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge, De Zeeuwse Stromen, The Netherlands, March 17-20 1996. 195--210.
[6]
Caroline Claus and Craig Boutilier. 1998. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems. In Proceedings of the Fifteenth National/Tenth Conference on Artificial Intelligence/Innovative Applications of Artificial Intelligence (AAAI '98/IAAI '98). American Association for Artificial Intelligence, Menlo Park, CA, USA, 746--752. http://dl.acm.org/citation.cfm?id=295240.295800
[7]
Andrew J Cook and Graham Tanner. 2015. European airline delay cost reference values. (2015). http://www.eurocontrol.int/publications/european-airline-delay-cost-reference-values.
[8]
K. Dresner and P. Stone. 2004. Multiagent traffic management: A reservationbased intersection control mechanism. In Proceedings of the 3rd International Conference on Autonomous Agents and Multiagent Systems (AAMAS '04). 530--537.
[9]
Carlos Guestrin, Michail Lagoudakis, and Ronald Parr. 2002. Coordinated Reinforcement Learning. In In Proceedings of the ICML-2002 The Nineteenth International Conference on Machine Learning. 227--234.
[10]
Carlos Ernesto Guestrin. 2003. Planning Under Uncertainty in Complex Structured Environments. Ph.D. Dissertation. Stanford, CA, USA. Advisor(s) Koller, Daphne. AAI3104233.
[11]
Jayesh K. Gupta, Maxim Egorov, and Mykel Kochenderfer. 2017. Cooperative Multiagent Control Using Deep Reinforcement Learning. In Adaptive Learning Agents at AAMAS-2017 (ALA2017) workshop.
[12]
Jelle R. Kok and Nikos Vlassis. 2006. Collaborative Multiagent Reinforcement Learning by Payoff Propagation. J. Mach. Learn. Res. 7 (Dec. 2006), 1789--1828. http://dl.acm.org/citation.cfm?id=1248547.1248612
[13]
Theocharis Kravaris, George A. Vouros, Christos Spatharis, Konstantinos Blekas, Georgios Chalkiadakis, and Jose Manuel Cordero Garcia. 2017. Learning Policies for Resolving Demand-Capacity Imbalances During Pre-tactical Air Traffic Management. In Multiagent System Technologies - 15th German Conference, MATES 2017, Leipzig, Germany, August 23-26, 2017, Proceedings. 238--255.
[14]
Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2016. Continuous control with deep reinforcement learning. In International Conference on Learning Representations (ICLR).
[15]
Kleanthis Malialis, Sam Devlin, and Daniel Kudenko. 2016. Resource Abstraction for Reinforcement Learning in Multiagent Congestion Problems. In Proceedings of the 15th International Conference on Autonomous Agents and Multiagent Systems (AAMAS '16). 503--511.
[16]
C. Meyers. 2006. Network flow problems and congestion games: complexity and approximation results. Ph.D. Dissertation. Cambridge, MA, USA.
[17]
I. Milchtaich. 2004. Social Optimality and Cooperation in Nonatomic Congestion Games. Journal of Economic Theory 114 (2004), 56--87.
[18]
Judea Pearl. 1988. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA.
[19]
Michal Penn, Maria Polukarov, and Moshe Tennenholtz. 2005. Congestion games with failures. In Proceedings 6th ACM Conference on Electronic Commerce (EC-2005), Vancouver, BC, Canada, June 5-8, 2005. 259--268.
[20]
R. W. Rosenthal. 1973. A Class of Games Possessing Pure-Strategy Nash Equilibria. International Journal of Game Theory 2 (1973), 65--67.
[21]
David Sislak, Přemysl Volf, and Michal Pechoucek. 2011. Agent-based cooperative decentralized airplane-collision avoidance. IEEE Transactions on Intelligent Transportation Systems 12, 1 (2011), 36--46.
[22]
Richard S Sutton and Andrew G Barto. 1998. Reinforcement learning: An introduction. MIT press Cambridge.

Cited By

View all
  1. Multiagent Reinforcement Learning Methods to Resolve Demand Capacity Balance Problems

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    SETN '18: Proceedings of the 10th Hellenic Conference on Artificial Intelligence
    July 2018
    339 pages
    ISBN:9781450364331
    DOI:10.1145/3200947
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    In-Cooperation

    • EETN: Hellenic Artificial Intelligence Society
    • UOP: University of Patras
    • University of Thessaly: University of Thessaly, Volos, Greece

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 09 July 2018

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Air Traffic Management
    2. Demand Capacity Balance
    3. Multi-agent reinforcement learning
    4. congestion problems

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    SETN '18

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)27
    • Downloads (Last 6 weeks)3
    Reflects downloads up to 06 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media