skip to main content
10.1145/3430984.3431045acmotherconferencesArticle/Chapter ViewAbstractPublication PagescodsConference Proceedingsconference-collections
extended-abstract

A Multi-Agent Reinforcement Learning Approach for Stock Portfolio Allocation

Published: 02 January 2021 Publication History

Abstract

Stock portfolio allocation is one of the most challenging and interesting problems of modern finance. Recently, deep reinforcement learning applications have shown promising results in automating portfolio allocation. However, most current approaches use a single agent learning model which could inadequately capture the complex dynamics arising from the interactions of many traders in today’s stock market. In this paper, we explore the applicability of multi-agent deep reinforcement learning to this problem by implementing single-agent, 2-agent, 3-agent, and 4-agent deep deterministic policy gradients (DDPG) algorithms in a competitive setting. Upon analyzing the results obtained using standardized metrics, we observe that there is a significant improvement in the performance of our learning models with the introduction of multiple agents.

References

[1]
[1]Timothy P. Lillicrap et al. 2015. Continuous control with deep reinforcement learning. arXiv:1509.02971v6. Retrieved from https://arxiv.org/abs/1509.02971v6
[2]
[2]Zhuoran Xiong et al. 2018. Practical deep reinforcement learning approach for stock trading. arXiv:1811.07522v2. Retrieved from https://arxiv.org/abs/1811.07522v2
[3]
[3]Bruce I. Jacobs and Kenneth N. Levy. 1989. The complexity of the stock market. J. Portf. Manag. 16, 1 (Fall 1989), 19-27.
[4]
[4]Ryan Lowe et al. 2017. Multi-agent actor-critic for mixed cooperative-competitive environments. arXiv:1706.02275. Retrieved from https://arxiv.org/abs/1706.02275v4
[5]
[5]Wenhang Bao and Xiao-yang Liu. 2019. Multi-agent deep reinforcement learning for liquidation strategy analysis. arXiv:1906.11046v1. Retrieved from https://arxiv.org/abs/1906.11046v1
[6]
[6]William F. Sharpe. 1994. The Sharpe ratio. J. Portf. Manag. 21, 1 (Fall 1994), 49-58.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
CODS-COMAD '21: Proceedings of the 3rd ACM India Joint International Conference on Data Science & Management of Data (8th ACM IKDD CODS & 26th COMAD)
January 2021
453 pages
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 02 January 2021

Check for updates

Author Tags

  1. deep learning
  2. multi-agent systems
  3. portfolio allocation
  4. reinforcement learning
  5. stock trading

Qualifiers

  • Extended-abstract
  • Research
  • Refereed limited

Conference

CODS COMAD 2021
CODS COMAD 2021: 8th ACM IKDD CODS and 26th COMAD
January 2 - 4, 2021
Bangalore, India

Acceptance Rates

Overall Acceptance Rate 197 of 680 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)44
  • Downloads (Last 6 weeks)6
Reflects downloads up to 14 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media