skip to main content
10.1145/3647782.3647801acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccmbConference Proceedingsconference-collections
research-article

A General Framework on Enhancing Portfolio Management with Reinforcement Learning

Published: 06 May 2024 Publication History

Abstract

Portfolio management is the art and science in fiance that concerns continuous reallocation of funds and assets across financial instruments to meet the desired returns to risk profile. Deep reinforcement learning (RL) has gained increasing interest in portfolio management, where RL agents are trained base on financial data to optimize the asset reallocation process. Though there are prior efforts in trying to combine RL and portfolio management, previous works did not consider practical aspects such as transaction costs or short selling restrictions, limiting their applicability. To address these limitations, we propose a general RL framework for asset management that enables continuous asset weights, short selling and making decisions with relevant features. We compare the performance of three different RL algorithms: Policy Gradient with Actor-Critic (PGAC), Proximal Policy Optimization (PPO), and Evolution Strategies (ES) and demonstrate their advantages in a simulated environment with transaction costs. Our work aims to provide more options for utilizing RL frameworks in real-life asset management scenarios and can benefit further research in financial applications.

References

[1]
Yue Deng, Feng Bao, Youyong Kong, Zhiquan Ren, and Qionghai Dai. 2017. Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE Transactions on Neural Networks and Learning Systems 28 (2017), 653–664.
[2]
José E. Figueroa-López. 2005. A selected survey of portfolio optimization problems.
[3]
Thomas G. Fischer. 2018. Reinforcement learning in financial markets - a survey.
[4]
YiFeng Guo, XingYu Fu, Yuyan Shi, and MingWen Liu. 2018. Robust Log-Optimal Strategy with Reinforcement Learning. arXiv: Portfolio Management (2018).
[5]
Zhengyao Jiang, Dixing Xu, and Jinjun Liang. 2017. A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem. https://doi.org/10.48550/ARXIV.1706.10059
[6]
Jerzy J. Korczak, Piotr Lipiński, and Patrick Roger. 2001. Evolution Strategy in Portfolio Optimization. In Artificial Evolution.
[7]
Bin Li and Steven C. H. Hoi. 2012. Online Portfolio Selection: A Survey. https://doi.org/10.48550/ARXIV.1212.2129
[8]
Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous Methods for Deep Reinforcement Learning. CoRR abs/1602.01783 (2016). arXiv:1602.01783http://arxiv.org/abs/1602.01783
[9]
Ahmet Murat Ozbayoglu, Mehmet Ugur Gudelek, and Omer Berat Sezer. 2020. Deep Learning for Financial Applications : A Survey. https://doi.org/10.48550/ARXIV.2002.05786
[10]
I. Rechenberg. 1978. Evolutionsstrategien. In Simulationsmethoden in der Medizin und Biologie, Berthold Schneider and Ulrich Ranft (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 83–114.
[11]
Tim Salimans, Jonathan Ho, Xi Chen, Szymon Sidor, and Ilya Sutskever. 2017. Evolution Strategies as a Scalable Alternative to Reinforcement Learning. https://doi.org/10.48550/ARXIV.1703.03864
[12]
John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, and Pieter Abbeel. 2015. Trust Region Policy Optimization. https://doi.org/10.48550/ARXIV.1502.05477
[13]
John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal Policy Optimization Algorithms. https://doi.org/10.48550/ARXIV.1707.06347
[14]
William F. Sharpe. 1994. The Sharpe Ratio. The Journal of Portfolio Management 21, 1 (1994), 49–58. https://doi.org/10.3905/jpm.1994.409501 arXiv:https://jpm.pm-research.com/content/21/1/49.full.pdf
[15]
Charalampos Stasinakis and Georgios Sermpinis. 2014. Financial Forecasting and Trading Strategies: A Survey. 22–39. https://doi.org/10.4324/9780203084984
[16]
Alan L. Stuart and Harry M. Markowitz. 1959. Portfolio Selection: Efficient Diversification of Investments. A Quarterly Journal of Operations Research 10 (1959), 253.
[17]
Richard S Sutton, David McAllester, Satinder Singh, and Yishay Mansour. 1999. Policy Gradient Methods for Reinforcement Learning with Function Approximation. In Advances in Neural Information Processing Systems, S. Solla, T. Leen, and K. Müller (Eds.). Vol. 12. MIT Press. https://proceedings.neurips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf
[18]
Kamran Usmani. 2015. An Investigation into the Use of Reinforcement Learning Techniques within the Algorithmic Trading Domain.
[19]
Junfeng Wen, Saurabh Kumar, Ramki Gummadi, and Dale Schuurmans. 2021. Characterizing the Gap Between Actor-Critic and Policy Gradient. In International Conference on Machine Learning.

Index Terms

  1. A General Framework on Enhancing Portfolio Management with Reinforcement Learning

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Other conferences
        ICCMB '24: Proceedings of the 2024 7th International Conference on Computers in Management and Business
        January 2024
        235 pages
        ISBN:9798400716652
        DOI:10.1145/3647782
        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 06 May 2024

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Artificial Intelligence
        2. Deep Learning
        3. Evolution Strategy
        4. Policy Gradient
        5. Portfolio Management
        6. Proximal Policy Optimization
        7. Reinforcement Learning

        Qualifiers

        • Research-article
        • Research
        • Refereed limited

        Conference

        ICCMB 2024

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 59
          Total Downloads
        • Downloads (Last 12 months)59
        • Downloads (Last 6 weeks)8
        Reflects downloads up to 16 Jan 2025

        Other Metrics

        Citations

        View Options

        Login options

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media