research-article

A General Framework on Enhancing Portfolio Management with Reinforcement Learning

Authors:

Yijie CaoAuthors Info & Claims

ICCMB '24: Proceedings of the 2024 7th International Conference on Computers in Management and Business

Pages 119 - 125

https://doi.org/10.1145/3647782.3647801

Published: 06 May 2024 Publication History

Abstract

Portfolio management is the art and science in fiance that concerns continuous reallocation of funds and assets across financial instruments to meet the desired returns to risk profile. Deep reinforcement learning (RL) has gained increasing interest in portfolio management, where RL agents are trained base on financial data to optimize the asset reallocation process. Though there are prior efforts in trying to combine RL and portfolio management, previous works did not consider practical aspects such as transaction costs or short selling restrictions, limiting their applicability. To address these limitations, we propose a general RL framework for asset management that enables continuous asset weights, short selling and making decisions with relevant features. We compare the performance of three different RL algorithms: Policy Gradient with Actor-Critic (PGAC), Proximal Policy Optimization (PPO), and Evolution Strategies (ES) and demonstrate their advantages in a simulated environment with transaction costs. Our work aims to provide more options for utilizing RL frameworks in real-life asset management scenarios and can benefit further research in financial applications.

References

[1]

Yue Deng, Feng Bao, Youyong Kong, Zhiquan Ren, and Qionghai Dai. 2017. Deep Direct Reinforcement Learning for Financial Signal Representation and Trading. IEEE Transactions on Neural Networks and Learning Systems 28 (2017), 653–664.

[2]

José E. Figueroa-López. 2005. A selected survey of portfolio optimization problems.

[3]

Thomas G. Fischer. 2018. Reinforcement learning in financial markets - a survey.

[4]

YiFeng Guo, XingYu Fu, Yuyan Shi, and MingWen Liu. 2018. Robust Log-Optimal Strategy with Reinforcement Learning. arXiv: Portfolio Management (2018).

[5]

Zhengyao Jiang, Dixing Xu, and Jinjun Liang. 2017. A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem. https://doi.org/10.48550/ARXIV.1706.10059

[6]

Jerzy J. Korczak, Piotr Lipiński, and Patrick Roger. 2001. Evolution Strategy in Portfolio Optimization. In Artificial Evolution.

[7]

Bin Li and Steven C. H. Hoi. 2012. Online Portfolio Selection: A Survey. https://doi.org/10.48550/ARXIV.1212.2129

[8]

Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, and Koray Kavukcuoglu. 2016. Asynchronous Methods for Deep Reinforcement Learning. CoRR abs/1602.01783 (2016). arXiv:1602.01783http://arxiv.org/abs/1602.01783

[9]

Ahmet Murat Ozbayoglu, Mehmet Ugur Gudelek, and Omer Berat Sezer. 2020. Deep Learning for Financial Applications : A Survey. https://doi.org/10.48550/ARXIV.2002.05786

[10]

I. Rechenberg. 1978. Evolutionsstrategien. In Simulationsmethoden in der Medizin und Biologie, Berthold Schneider and Ulrich Ranft (Eds.). Springer Berlin Heidelberg, Berlin, Heidelberg, 83–114.

[11]

Tim Salimans, Jonathan Ho, Xi Chen, Szymon Sidor, and Ilya Sutskever. 2017. Evolution Strategies as a Scalable Alternative to Reinforcement Learning. https://doi.org/10.48550/ARXIV.1703.03864

[12]

John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, and Pieter Abbeel. 2015. Trust Region Policy Optimization. https://doi.org/10.48550/ARXIV.1502.05477

[13]

John Schulman, Filip Wolski, Prafulla Dhariwal, Alec Radford, and Oleg Klimov. 2017. Proximal Policy Optimization Algorithms. https://doi.org/10.48550/ARXIV.1707.06347

[14]

William F. Sharpe. 1994. The Sharpe Ratio. The Journal of Portfolio Management 21, 1 (1994), 49–58. https://doi.org/10.3905/jpm.1994.409501 arXiv:https://jpm.pm-research.com/content/21/1/49.full.pdf

[15]

Charalampos Stasinakis and Georgios Sermpinis. 2014. Financial Forecasting and Trading Strategies: A Survey. 22–39. https://doi.org/10.4324/9780203084984

[16]

Alan L. Stuart and Harry M. Markowitz. 1959. Portfolio Selection: Efficient Diversification of Investments. A Quarterly Journal of Operations Research 10 (1959), 253.

[17]

Richard S Sutton, David McAllester, Satinder Singh, and Yishay Mansour. 1999. Policy Gradient Methods for Reinforcement Learning with Function Approximation. In Advances in Neural Information Processing Systems, S. Solla, T. Leen, and K. Müller (Eds.). Vol. 12. MIT Press. https://proceedings.neurips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf

[18]

Kamran Usmani. 2015. An Investigation into the Use of Reinforcement Learning Techniques within the Algorithmic Trading Domain.

[19]

Junfeng Wen, Saurabh Kumar, Ramki Gummadi, and Dale Schuurmans. 2021. Characterizing the Gap Between Actor-Critic and Policy Gradient. In International Conference on Machine Learning.

Index Terms

A General Framework on Enhancing Portfolio Management with Reinforcement Learning
1. Computing methodologies
  1. Artificial intelligence
  2. Machine learning
    1. Machine learning algorithms
    2. Machine learning approaches

Recommendations

Reinforcement Learning with Maskable Stock Representation for Portfolio Management in Customizable Stock Pools
WWW '24: Proceedings of the ACM Web Conference 2024

Portfolio management (PM) is a fundamental financial trading task, which explores the optimal periodical reallocation of capitals into different stocks to pursue long-term profits. Reinforcement learning (RL) has recently shown its potential to train ...
From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization
Abstract
As a fundamental problem in algorithmic trading, portfolio optimization aims to maximize the cumulative return by continuously investing in various financial derivatives within a given time period. Recent years have witnessed the transformation ...
Reinforcement Learning Driven Trading Algorithm with Optimized Stock Portfolio Management Scheme to Control Financial Risk
Abstract
In recent years, the application of deep learning techniques in financial markets has achieved significant attention for developing effective investment strategies. Traditional schemes focus on optimizing models to maximize returns but often fail ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

ICCMB '24: Proceedings of the 2024 7th International Conference on Computers in Management and Business

January 2024

235 pages

ISBN:9798400716652

DOI:10.1145/3647782

Copyright © 2024 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 May 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

ICCMB 2024

ICCMB 2024: 2024 7th International Conference on Computers in Management and Business

January 12 - 14, 2024

Singapore, Singapore

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
59
Total Downloads

Downloads (Last 12 months)59
Downloads (Last 6 weeks)8

Reflects downloads up to 16 Jan 2025

Other Metrics

View Author Metrics

Citations

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents