research-article

Learning to Minimize Cost to Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce

Authors:

Pranavi Pathakota,

Anulekha Dhara,

Hardik Meisheri,

Harshad KhadilkarAuthors Info & Claims

CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)

Pages 176 - 184

https://doi.org/10.1145/3570991.3571016

Published: 04 January 2023 Publication History

Abstract

In the retail industry, electronic commerce (e-commerce) has grown quickly in the last decade and has further accelerated as a result of movement restrictions during the pandemic. While working with logistics and retail industry business collaborators, we found that the cost of delivery of products from the most opportune node in the supply chain (a quantity called the cost-to-serve or CTS) is a key challenge. In this paper, we formally define CTS as a decision-making problem. We then focus on the specific subproblem of delivering multiple products in arbitrary quantities from any warehouse to the customer doorstep. We find that a reinforcement learning (RL) formulation is able to exceed the performance of the state of the art rule based policies, while being significantly faster than traditional optimisation approaches such as mixed-integer linear programming. We hypothesise that scaling up the RL based methodology will have a significant impact on the operating margins of retailers in the ‘new normal’.

References

[1]

2021. What is Cost to Serve. Online. https://www.easymetrics.com/what-is-cost-to-serve/ Accessed 2021-06-10.

[2]

Michael Allen, Kerry Pearn, and Tom Monks. 2021. Developing an OpenAI Gym-compatible framework and simulation environment for testing Deep Reinforcement Learning agents solving the Ambulance Location Problem. (2021).

[3]

Amin Asadi and Sarah Nurre Pinkley. 2021. A stochastic scheduling, allocation, and inventory replenishment problem for battery swap stations. Transportation Research Part E: Logistics and Transportation Review 146 (2021), 102212.

[4]

Schirin Baer, Jupiter Bakakeu, Richard Meyes, and Tobias Meisen. 2019. Multi-Agent Reinforcement Learning for Job Shop Scheduling in Flexible Manufacturing Systems. In 2019 Second International Conference on Artificial Intelligence for Industries (AI4I). 22–25.

[5]

Nejib Ben-Khedher and Candace A. Yano. 1994. The Multi-Item Joint Replenishment Problem with Transportation and Container Effects. Transportation Science 28, 1 (1994), 37–54.

Digital Library

[6]

Alan Braithwaite and Edouard Samakh. 1998. The Cost-to-Serve Method. International Journal of Logistics Management, The 9 (01 1998), 69–84. https://doi.org/10.1108/09574099810805753

[7]

Ronghua Chen, Bo Yang, Shi Li, and Shilong Wang. 2020. A self-learning genetic algorithm based on reinforcement learning for flexible job-shop scheduling problem. Computers & Industrial Engineering 149 (2020), 106778.

[8]

Leon Cooper. 1964. Heuristic Methods for Location-Allocation Problems. SIAM Rev. 6, 1 (1964), 37–53.

Digital Library

[9]

Robin Cooper and Robert Kaplan. 1997. Cost & effect: using integrated cost systems to drive profitability and performance. (01 1997).

[10]

Adrien Ecoffet, Joost Huizinga, Joel Lehman, Kenneth O Stanley, and Jeff Clune. 2019. Go-explore: a new approach for hard-exploration problems. arXiv preprint arXiv:1901.10995(2019).

[11]

Javier Garcıa and Fernando Fernández. 2015. A comprehensive survey on safe reinforcement learning. Journal of Machine Learning Research 16, 1 (2015), 1437–1480.

Digital Library

[12]

Riccardo Giusti, Daniele Manerba, and Roberto Tadei. 2021. Multiperiod transshipment location–allocation problem with flow synchronization under stochastic handling operations. Networks 78(2021), 104 – 88.

[13]

Susan L Golicic, Donna F Davis, Teresa M McCarthy, and John T Mentzer. 2002. The impact of e-commerce on supply chain relationships. International Journal of Physical Distribution & Logistics Management (2002).

[14]

Noah Golowich, Harikrishna Narasimhan, and David C. Parkes. 2018. Deep Learning for Multi-Facility Location Mechanism Design. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18. 261–267.

[15]

Reinaldo Guerreiro, Sérgio Bio, and Elvira Merschmann. 2008. Cost-to-serve measurement and customer profitability analysis. International Journal of Logistics Management, The 19 (11 2008), 389–407.

[16]

Charles W Haley and Robert C Higgins. 1973. Inventory policy and trade credit financing. Management science 20, 4-part-i (1973), 464–471.

[17]

Madiha Harrabi, Olfa Belkahla Driss, and Khaled Ghedira. 2021. A hybrid evolutionary approach to job-shop scheduling with generic time lags. Journal of Scheduling(2021).

[18]

Charles C Holt, Franco Modigliani, and Herbert A Simon. 1955. A linear decision rule for production and employment scheduling. Management Science 2, 1 (1955), 1–30.

Digital Library

[19]

Dmitry Ivanov, Suresh Sethi, Alexandre Dolgui, and Boris Sokolov. 2018. A survey on control theory applications to operational systems, supply chain management, and Industry 4.0. Annual Reviews in Control 46 (2018), 134–147.

[20]

Deborah Kaplan. 2017. The real cost of e-commerce logistics. Online. https://www.supplychaindive.com/news/amazon-effect-logistics-cost-delivery/444138/

[21]

Robert Kaplan and V.G. Narayanan. 2001. Measuring and managing customer profitability. 15 (09 2001).

[22]

Robert S. Kaplan. 1989. Kanthal (A). Technical Report. Harvard Business School.

[23]

Douglas M Lambert and Martha C Cooper. 2000. Issues in supply chain management. Industrial marketing management 29, 1 (2000), 65–83.

[24]

Jan Leike, Miljan Martic, Victoria Krakovna, Pedro A Ortega, Tom Everitt, Andrew Lefrancq, Laurent Orseau, and Shane Legg. 2017. AI safety gridworlds. arXiv preprint arXiv:1711.09883(2017).

[25]

Wing Sun Li. 2018. Cost to Serve and Customer Selection. 57–74. https://doi.org/10.1007/978-981-10-5729-8_4

[26]

Qihui Lu and Nan Liu. 2015. Effects of e-commerce channel entry in a two-echelon supply chain: A comparative analysis of single-and dual-channel distribution systems. International Journal of Production Economics 165 (2015), 100–111.

[27]

Hardik Meisheri, Vinita Baniwal, Nazneen N Sultana, Harshad Khadilkar, and Balaraman Ravindran. 2020. Using Reinforcement Learning for a Large Variable-Dimensional Inventory Management Problem. In Adaptive Learning Agents Work-shop, ALA-2020. AAMAS.

[28]

Christopher Mejía Argueta and Catalina Salazar. 2015. Cost to serve as a strategic decision variable in the design of strategies as regards emerging marketing channels. Estudios Gerenciales 31 (01 2015), 50–61.

[29]

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Andrei A Rusu, Joel Veness, Marc G Bellemare, Alex Graves, Martin Riedmiller, Andreas K Fidjeland, Georg Ostrovski, 2015. Human-level control through deep reinforcement learning. nature 518, 7540 (2015), 529–533.

[30]

Mohammadreza Nazari, Afshin Oroojlooy, Martin Takáč, and Lawrence V. Snyder. 2018. Reinforcement Learning for Solving the Vehicle Routing Problem(NIPS’18). Curran Associates Inc., Red Hook, NY, USA, 9861–9871.

[31]

Hiroshi Ohta and Toshihiro Nakatani. 2006. A heuristic job-shop scheduling algorithm to minimize the total holding cost of completed and in-process products subject to no tardy jobs. International Journal of Production Economics 101, 1(2006), 19–29. Integrated Enterprise and Supply Chain Management.

[32]

Rob O’Byrne. 2022. Cost To Serve - A Smarter Way to Improved Supply Chain Profitability. Online. https://www.logisticsbureau.com/cost-to-serve-a-smarter-way-to-improved-supply-chain-profitability/ Accessed 2021-06-10.

[33]

Binbin Pan, Zhenzhen Zhang, and Andrew Lim. 2021. A hybrid algorithm for time-dependent vehicle routing problem with time windows. Computers & Operations Research 128 (2021), 105193.

[34]

Puca Huachi Vaz Penna, Anand Subramanian, and Luiz Satoru Ochi. 2013. An Iterated Local Search heuristic for the Heterogeneous Fleet Vehicle Routing Problem. Journal of Heuristics 19(2013), 201–232.

Digital Library

[35]

Masoud Rabbani, Mahdi Mokhtarzadeh, and Neda Manavizadeh. 2021. A constraint programming approach and a hybrid of genetic and K-means algorithms to solve the p-hub location-allocation problems. International Journal of Management Science and Engineering Management 16, 2(2021), 123–133.

[36]

E P Robinson, A Narayanan, and L-L Gao. 2007. Effective heuristics for the dynamic demand joint replenishment problem. Journal of the Operational Research Society 58, 6 (2007), 808–815.

[37]

Nazneen Sultana, Vinita Baniwal, Ansuma Basumatary, Piyush Mittal, Supratim Ghosh, and Harshad Khadilkar. 2021. Fast Approximate Solutions using Reinforcement Learning for Dynamic Capacitated Vehicle Routing with Time Windows. In Adaptive Learning Agents Work-shop, ALA-2021. AAMAS.

[38]

Richard Wilding. 2020. Understanding Supply Chain cost drivers. In https://www.richardwilding.info/supply-chain-finance-and-cost-to-serve.html.

Cited By

Dai BXiao TLiu YLi F(2024)A Profit Allocation Mechanism for Multiple-Channels Order Fulfillment System of an E-Retailers AllianceSage Open10.1177/2158244024130078114:4Online publication date: 27-Nov-2024
https://doi.org/10.1177/21582440241300781

Index Terms

Learning to Minimize Cost to Serve for Multi-Node Multi-Product Order Fulfilment in Electronic Commerce
1. Applied computing
  1. Electronic commerce
  2. Operations research
    1. Decision analysis
      1. Multi-criterion optimization and decision-making
2. Computing methodologies
  1. Artificial intelligence
    1. Planning and scheduling
  2. Machine learning
    1. Learning paradigms
      1. Reinforcement learning

Recommendations

Allocating Cost of Service to Customers in Inventory Routing

Vendor-managed inventory VMI replenishment is a collaboration between a supplier and its customers, where the supplier is responsible for managing the customers' inventory levels. In the VMI setting we consider, the supplier exploits synergies between ...
The value of postponing online fulfillment decisions in multi-channel retail/e-tail organizations

Many retail/e-tail organizations assign responsibilities for online sales immediately and to the closest fulfillment location that has available stock. Unfortunately there is little research on the value of using such policies in retail/e-tail ...
Strategies to Predict E-Commerce Inventory and Order Planning

This study examines the characteristics of a prediction model for businesses in the online marketplace by considering the market trend, prior sales and decision maker's preference on potential demand estimate. With the rapid growth of the electronic ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

CODS-COMAD '23: Proceedings of the 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)

January 2023

357 pages

ISBN:9781450397971

DOI:10.1145/3570991

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 04 January 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

CODS-COMAD 2023

CODS-COMAD 2023: 6th Joint International Conference on Data Science & Management of Data (10th ACM IKDD CODS and 28th COMAD)

January 4 - 7, 2023

Mumbai, India

Acceptance Rates

Overall Acceptance Rate 197 of 680 submissions, 29%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
95
Total Downloads

Downloads (Last 12 months)36
Downloads (Last 6 weeks)1

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Dai BXiao TLiu YLi F(2024)A Profit Allocation Mechanism for Multiple-Channels Order Fulfillment System of an E-Retailers AllianceSage Open10.1177/2158244024130078114:4Online publication date: 27-Nov-2024
https://doi.org/10.1177/21582440241300781

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Media

Figures

Other

Tables

View Table of Contents