default search action
Thomy Phan
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j3]Thomy Phan, Felix Sommer, Fabian Ritz, Philipp Altmann, Jonas Nüßlein, Michael Kölle, Lenz Belzner, Claudia Linnhoff-Popien:
Emergent cooperation from mutual acknowledgment exchange in multi-agent reinforcement learning. Auton. Agents Multi Agent Syst. 38(2): 34 (2024) - [c42]Thomy Phan, Taoan Huang, Bistra Dilkina, Sven Koenig:
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search. AAAI 2024: 17514-17522 - [c41]Thomy Phan, Joseph Driscoll, Justin Romberg, Sven Koenig:
Confidence-Based Curriculum Learning for Multi-Agent Path Finding. AAMAS 2024: 1558-1566 - [c40]Philipp Altmann, Adelina Bärligea, Jonas Stein, Michael Kölle, Thomas Gabor, Thomy Phan, Claudia Linnhoff-Popien:
Quantum Circuit Design: A Reinforcement Learning Challenge. AAMAS 2024: 2123-2125 - [c39]Shao-Hung Chan, Zhe Chen, Dian-Lun Lin, Yue Zhang, Daniel Harabor, Sven Koenig, Tsung-Wei Huang, Thomy Phan:
Anytime Multi-Agent Path Finding using Operation Parallelism in Large Neighborhood Search. AAMAS 2024: 2183-2185 - [c38]Michael Kölle, Yannick Erpelding, Fabian Ritz, Thomy Phan, Steffen Illium, Claudia Linnhoff-Popien:
Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics Through Multi-Agent Reinforcement Learning Algorithms. ICAART (1) 2024: 59-70 - [c37]Michael Kölle, Felix Topp, Thomy Phan, Philipp Altmann, Jonas Nüßlein, Claudia Linnhoff-Popien:
Multi-Agent Quantum Reinforcement Learning Using Evolutionary Optimization. ICAART (1) 2024: 71-82 - [c36]Robert Müller, Hasan Turalic, Thomy Phan, Michael Kölle, Jonas Nüßlein, Claudia Linnhoff-Popien:
ClusterComm: Discrete Communication in Decentralized MARL Using Internal Representation Clustering. ICAART (1) 2024: 305-312 - [i28]Robert Müller, Hasan Turalic, Thomy Phan, Michael Kölle, Jonas Nüßlein, Claudia Linnhoff-Popien:
ClusterComm: Discrete Communication in Decentralized MARL using Internal Representation Clustering. CoRR abs/2401.03504 (2024) - [i27]Thomy Phan, Joseph Driscoll, Justin Romberg, Sven Koenig:
Confidence-Based Curriculum Learning for Multi-Agent Path Finding. CoRR abs/2401.05860 (2024) - [i26]Michael Kölle, Yannick Erpelding, Fabian Ritz, Thomy Phan, Steffen Illium, Claudia Linnhoff-Popien:
Aquarium: A Comprehensive Framework for Exploring Predator-Prey Dynamics through Multi-Agent Reinforcement Learning Algorithms. CoRR abs/2401.07056 (2024) - [i25]Shao-Hung Chan, Zhe Chen, Dian-Lun Lin, Yue Zhang, Daniel Harabor, Tsung-Wei Huang, Sven Koenig, Thomy Phan:
Anytime Multi-Agent Path Finding using Operation Parallelism in Large Neighborhood Search. CoRR abs/2402.01961 (2024) - [i24]Philipp Altmann, Katharina Winter, Michael Kölle, Maximilian Zorn, Thomy Phan, Claudia Linnhoff-Popien:
MEDIATE: Mutually Endorsed Distributed Incentive Acknowledgment Token Exchange. CoRR abs/2404.03431 (2024) - [i23]Michael Kölle, Karola Schneider, Sabrina Egger, Felix Topp, Thomy Phan, Philipp Altmann, Jonas Nüßlein, Claudia Linnhoff-Popien:
Architectural Influence on Variational Quantum Circuits in Multi-Agent Reinforcement Learning: Evolutionary Strategies for Optimization. CoRR abs/2407.20739 (2024) - [i22]Thomy Phan, Benran Zhang, Shao-Hung Chan, Sven Koenig:
Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic. CoRR abs/2408.02960 (2024) - 2023
- [b1]Thomy Phan:
Emergence and resilience in multi-agent reinforcement learning. Ludwig Maximilian University of Munich, Germany, 2023 - [c35]Thomy Phan, Fabian Ritz, Jonas Nüßlein, Michael Kölle, Thomas Gabor, Claudia Linnhoff-Popien:
Attention-Based Recurrency for Multi-Agent Reinforcement Learning under State Uncertainty. AAMAS 2023: 2839-2841 - [c34]Thomy Phan, Fabian Ritz, Philipp Altmann, Maximilian Zorn, Jonas Nüßlein, Michael Kölle, Thomas Gabor, Claudia Linnhoff-Popien:
Attention-Based Recurrence for Multi-Agent Reinforcement Learning under Stochastic Partial Observability. ICML 2023: 27840-27853 - [c33]Philipp Altmann, Fabian Ritz, Leonard Feuchtinger, Jonas Nüßlein, Claudia Linnhoff-Popien, Thomy Phan:
CROP: Towards Distributional-Shift Robust Reinforcement Learning Using Compact Reshaped Observation Processing. IJCAI 2023: 3414-3422 - [c32]Felip Guimerà Cuevas, Thomy Phan, Helmut Schmid:
Adaptive Bi-nonlinear Neural Networks Based on Complex Numbers with Weights Constrained Along the Unit Circle. PAKDD (1) 2023: 355-366 - [p1]Thomy Phan:
Emergenz und Resilienz in lernenden Multiagentensystemen. Ausgezeichnete Informatikdissertationen 2023: 231-240 - [i21]Thomy Phan, Fabian Ritz, Jonas Nüßlein, Michael Kölle, Thomas Gabor, Claudia Linnhoff-Popien:
Attention-Based Recurrency for Multi-Agent Reinforcement Learning under State Uncertainty. CoRR abs/2301.01649 (2023) - [i20]Philipp Altmann, Thomy Phan, Fabian Ritz, Thomas Gabor, Claudia Linnhoff-Popien:
DIRECT: Learning from Sparse and Shifting Rewards using Discriminative Reward Co-Training. CoRR abs/2301.07421 (2023) - [i19]Philipp Altmann, Fabian Ritz, Leonard Feuchtinger, Jonas Nüßlein, Claudia Linnhoff-Popien, Thomy Phan:
CROP: Towards Distributional-Shift Robust Reinforcement Learning using Compact Reshaped Observation Processing. CoRR abs/2304.13616 (2023) - [i18]Michael Kölle, Felix Topp, Thomy Phan, Philipp Altmann, Jonas Nüßlein, Claudia Linnhoff-Popien:
Multi-Agent Quantum Reinforcement Learning using Evolutionary Optimization. CoRR abs/2311.05546 (2023) - [i17]Philipp Altmann, Adelina Bärligea, Jonas Stein, Michael Kölle, Thomas Gabor, Thomy Phan, Claudia Linnhoff-Popien:
Challenges for Reinforcement Learning in Quantum Computing. CoRR abs/2312.11337 (2023) - [i16]Thomy Phan, Taoan Huang, Bistra Dilkina, Sven Koenig:
Adaptive Anytime Multi-Agent Path Finding Using Bandit-Based Large Neighborhood Search. CoRR abs/2312.16767 (2023) - 2022
- [c31]Thomy Phan, Felix Sommer, Philipp Altmann, Fabian Ritz, Lenz Belzner, Claudia Linnhoff-Popien:
Emergent Cooperation from Mutual Acknowledgment Exchange. AAMAS 2022: 1047-1055 - [c30]Robert Müller, Steffen Illium, Thomy Phan, Tom Haider, Claudia Linnhoff-Popien:
Towards Anomaly Detection in Reinforcement Learning. AAMAS 2022: 1799-1803 - [c29]Fabian Ritz, Thomy Phan, Andreas Sedlmeier, Philipp Altmann, Jan Wieghardt, Reiner N. Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien, Thomas Gabor:
Capturing Dependencies Within Machine Learning via a Formal Process Model. ISoLA (3) 2022: 249-265 - [i15]Fabian Ritz, Thomy Phan, Andreas Sedlmeier, Philipp Altmann, Jan Wieghardt, Reiner N. Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien, Thomas Gabor:
Capturing Dependencies within Machine Learning via a Formal Process Model. CoRR abs/2208.05219 (2022) - 2021
- [j2]Thomas Gabor, Thomy Phan, Claudia Linnhoff-Popien:
Productive fitness in diversity-aware evolutionary algorithms. Nat. Comput. 20(3): 363-376 (2021) - [c28]Thomy Phan, Lenz Belzner, Thomas Gabor, Andreas Sedlmeier, Fabian Ritz, Claudia Linnhoff-Popien:
Resilient Multi-Agent Reinforcement Learning with Adversarial Value Decomposition. AAAI 2021: 11308-11316 - [c27]Fabian Ritz, Thomy Phan, Robert Müller, Thomas Gabor, Andreas Sedlmeier, Marc Zeller, Jan Wieghardt, Reiner N. Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien:
Specification Aware Multi-Agent Reinforcement Learning. ICAART (Revised Selected Papers) 2021: 3-21 - [c26]Fabian Ritz, Thomy Phan, Robert Müller, Thomas Gabor, Andreas Sedlmeier, Marc Zeller, Jan Wieghardt, Reiner N. Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien:
SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning. ICAART (1) 2021: 28-37 - [c25]Fabian Ritz, Daniel Ratke, Thomy Phan, Lenz Belzner, Claudia Linnhoff-Popien:
A Sustainable Ecosystem through Emergent Cooperation in Multi-Agent Reinforcement Learning. ALIFE 2021: 74 - [c24]Thomy Phan, Fabian Ritz, Lenz Belzner, Philipp Altmann, Thomas Gabor, Claudia Linnhoff-Popien:
VAST: Value Function Factorization with Variable Agent Sub-Teams. NeurIPS 2021: 24018-24032 - 2020
- [j1]Thomas Gabor, Andreas Sedlmeier, Thomy Phan, Fabian Ritz, Marie Kiermeier, Lenz Belzner, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner N. Schmid, Jan Wieghardt, Marc Zeller, Claudia Linnhoff-Popien:
The scenario coevolution paradigm: adaptive quality assurance for adaptive systems. Int. J. Softw. Tools Technol. Transf. 22(4): 457-476 (2020) - [c23]Thomy Phan, Thomas Gabor, Andreas Sedlmeier, Fabian Ritz, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner N. Schmid, Jan Wieghardt, Marc Zeller, Claudia Linnhoff-Popien:
Learning and Testing Resilience in Cooperative Multi-Agent Systems. AAMAS 2020: 1055-1063 - [c22]Kyrill Schmid, Lenz Belzner, Thomy Phan, Thomas Gabor, Claudia Linnhoff-Popien:
Multi-agent Reinforcement Learning for Bargaining under Risk and Asymmetric Information. ICAART (1) 2020: 144-151 - [c21]Carsten Hahn, Thomy Phan, Sebastian Feld, Christoph Roch, Fabian Ritz, Andreas Sedlmeier, Thomas Gabor, Claudia Linnhoff-Popien:
Nash Equilibria in Multi-Agent Swarms. ICAART (1) 2020: 234-241 - [c20]Andreas Sedlmeier, Thomas Gabor, Thomy Phan, Lenz Belzner, Claudia Linnhoff-Popien:
Uncertainty-based Out-of-Distribution Classification in Deep Reinforcement Learning. ICAART (2) 2020: 522-529 - [c19]Christoph Roch, Thomy Phan, Sebastian Feld, Robert Müller, Thomas Gabor, Carsten Hahn, Claudia Linnhoff-Popien:
A Quantum Annealing Algorithm for Finding Pure Nash Equilibria in Graphical Games. ICCS (6) 2020: 488-501 - [c18]Christoph Roch, Alexander Impertro, Thomy Phan, Thomas Gabor, Sebastian Feld, Claudia Linnhoff-Popien:
Cross Entropy Hyperparameter Optimization for Constrained Problem Hamiltonians Applied to QAOA. ICRC 2020: 50-57 - [c17]Thomas Gabor, Sebastian Feld, Hila Safi, Thomy Phan, Claudia Linnhoff-Popien:
Insights on Training Neural Networks for QUBO Tasks. ICSE (Workshops) 2020: 436-441 - [c16]Thomas Gabor, Leo Sünkel, Fabian Ritz, Thomy Phan, Lenz Belzner, Christoph Roch, Sebastian Feld, Claudia Linnhoff-Popien:
The Holy Grail of Quantum Artificial Intelligence: Major Challenges in Accelerating the Machine Learning Pipeline. ICSE (Workshops) 2020: 456-461 - [c15]Carsten Hahn, Fabian Ritz, Paula Wikidal, Thomy Phan, Thomas Gabor, Claudia Linnhoff-Popien:
Foraging Swarms using Multi-Agent Reinforcement Learning. ALIFE 2020: 333-340 - [c14]Fabian Ritz, Felix Hohnstein, Robert Müller, Thomy Phan, Thomas Gabor, Carsten Hahn, Claudia Linnhoff-Popien:
Towards Ecosystem Management from Greedy Reinforcement Learning in a Predator-Prey Setting. ALIFE 2020: 518-525 - [i14]Andreas Sedlmeier, Thomas Gabor, Thomy Phan, Lenz Belzner, Claudia Linnhoff-Popien:
Uncertainty-Based Out-of-Distribution Classification in Deep Reinforcement Learning. CoRR abs/2001.00496 (2020) - [i13]Thomas Gabor, Leo Sünkel, Fabian Ritz, Thomy Phan, Lenz Belzner, Christoph Roch, Sebastian Feld, Claudia Linnhoff-Popien:
The Holy Grail of Quantum Artificial Intelligence: Major Challenges in Accelerating the Machine Learning Pipeline. CoRR abs/2004.14035 (2020) - [i12]Thomas Gabor, Sebastian Feld, Hila Safi, Thomy Phan, Claudia Linnhoff-Popien:
Insights on Training Neural Networks for QUBO Tasks. CoRR abs/2004.14036 (2020) - [i11]Markus Friedrich, Sebastian Feld, Thomy Phan, Pierre-Alain Fayolle:
Accelerating Evolutionary Construction Tree Extraction via Graph Partitioning. CoRR abs/2008.03669 (2020) - [i10]Fabian Ritz, Thomy Phan, Robert Müller, Thomas Gabor, Andreas Sedlmeier, Marc Zeller, Jan Wieghardt, Reiner N. Schmid, Horst Sauer, Cornel Klein, Claudia Linnhoff-Popien:
SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning. CoRR abs/2012.07949 (2020)
2010 – 2019
- 2019
- [c13]Thomy Phan, Lenz Belzner, Marie Kiermeier, Markus Friedrich, Kyrill Schmid, Claudia Linnhoff-Popien:
Memory Bounded Open-Loop Planning in Large POMDPs Using Thompson Sampling. AAAI 2019: 7941-7948 - [c12]Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor, Sebastian Feld, Claudia Linnhoff-Popien:
Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies. AAMAS 2019: 2162-2164 - [c11]Thomas Gabor, Andreas Sedlmeier, Marie Kiermeier, Thomy Phan, Marcel Henrich, Monika Pichlmair, Bernhard Kempter, Cornel Klein, Horst Sauer, Reiner N. Schmid, Jan Wieghardt:
Scenario co-evolution for reinforcement learning on a grid world smart factory domain. GECCO 2019: 898-906 - [c10]Thomas Gabor, Jan Peter, Thomy Phan, Christian Meyer, Claudia Linnhoff-Popien:
Subgoal-Based Temporal Abstraction in Monte-Carlo Tree Search. IJCAI 2019: 5562-5568 - [c9]Thomy Phan, Thomas Gabor, Robert Müller, Christoph Roch, Claudia Linnhoff-Popien:
Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning. IJCAI 2019: 5607-5613 - [c8]Carsten Hahn, Thomy Phan, Thomas Gabor, Lenz Belzner, Claudia Linnhoff-Popien:
Emergent Escape-based Flocking behavior using Multi-Agent Reinforcement Learning. ALIFE 2019: 598-605 - [i9]Andreas Sedlmeier, Thomas Gabor, Thomy Phan, Lenz Belzner, Claudia Linnhoff-Popien:
Uncertainty-Based Out-of-Distribution Detection in Deep Reinforcement Learning. CoRR abs/1901.02219 (2019) - [i8]Thomy Phan, Kyrill Schmid, Lenz Belzner, Thomas Gabor, Sebastian Feld, Claudia Linnhoff-Popien:
Distributed Policy Iteration for Scalable Approximation of Cooperative Multi-Agent Policies. CoRR abs/1901.08761 (2019) - [i7]Christoph Roch, Thomy Phan, Sebastian Feld, Robert Müller, Thomas Gabor, Claudia Linnhoff-Popien:
A Quantum Annealing Algorithm for Finding Pure Nash Equilibria in Graphical Games. CoRR abs/1903.06454 (2019) - [i6]Thomy Phan, Lenz Belzner, Marie Kiermeier, Markus Friedrich, Kyrill Schmid, Claudia Linnhoff-Popien:
Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling. CoRR abs/1905.04020 (2019) - [i5]Carsten Hahn, Thomy Phan, Thomas Gabor, Lenz Belzner, Claudia Linnhoff-Popien:
Emergent Escape-based Flocking Behavior using Multi-Agent Reinforcement Learning. CoRR abs/1905.04077 (2019) - [i4]Thomy Phan, Thomas Gabor, Robert Müller, Christoph Roch, Claudia Linnhoff-Popien:
Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning. CoRR abs/1907.05861 (2019) - 2018
- [c7]Thomy Phan, Lenz Belzner, Thomas Gabor, Kyrill Schmid:
Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation. AAMAS 2018: 730-738 - [c6]Thomas Gabor, Lenz Belzner, Thomy Phan, Kyrill Schmid:
Preparing for the Unexpected: Diversity Improves Planning Resilience in Evolutionary Algorithms. ICAC 2018: 131-140 - [c5]Kyrill Schmid, Lenz Belzner, Thomas Gabor, Thomy Phan:
Action Markets in Deep Multi-Agent Reinforcement Learning. ICANN (2) 2018: 240-249 - [c4]Marie Kiermeier, Sebastian Feld, Thomy Phan, Claudia Linnhoff-Popien:
Anomaly Detection in Spatial Layer Models of Autonomous Agents. IDEAL (1) 2018: 156-163 - [c3]Marie Kiermeier, Thomy Phan, Horst Sauer, Jan Wieghardt:
Monitoring Autonomous Agents in Self-Organizing Industrial Systems. INDIN 2018: 653-658 - [c2]Lenz Belzner, Kyrill Schmid, Thomy Phan, Thomas Gabor, Martin Wirsing:
The Sharer's Dilemma in Collective Adaptive Systems of Self-interested Agents. ISoLA (3) 2018: 241-256 - [c1]Kyrill Schmid, Lenz Belzner, Marie Kiermeier, Alexander Neitz, Thomy Phan, Thomas Gabor, Claudia Linnhoff:
Risk-Sensitivity in Simulation Based Online Planning. KI 2018: 229-240 - [i3]Thomy Phan, Lenz Belzner, Thomas Gabor, Kyrill Schmid:
Leveraging Statistical Multi-Agent Online Planning with Emergent Value Function Approximation. CoRR abs/1804.06311 (2018) - [i2]Lenz Belzner, Kyrill Schmid, Thomy Phan, Thomas Gabor, Martin Wirsing:
The Sharer's Dilemma in Collective Adaptive Systems of Self-Interested Agents. CoRR abs/1804.10781 (2018) - [i1]Thomas Gabor, Lenz Belzner, Thomy Phan, Kyrill Schmid:
Preparing for the Unexpected: Diversity Improves Planning Resilience in Evolutionary Algorithms. CoRR abs/1810.12483 (2018)
Coauthor Index
aka: Claudia Linnhoff
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 01:28 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint