skip to main content
10.1145/3534678.3539141acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article
Open access

Real-Time Rideshare Driver Supply Values Using Online Reinforcement Learning

Published: 14 August 2022 Publication History

Abstract

In this paper, we present Online Supply Values (OSV), a system for estimating the return of available rideshare drivers to match drivers to ride requests at Lyft. Because a future driver state can be accurately predicted from a request destination, it is possible to estimate the expected action value of assigning a ride request to an available driver as a Markov Decision Process using the Bellman Equation. These estimates are updated using temporal difference and are shown to adapt to changing marketplace conditions in real-time. While reinforcement learning has been studied for rideshare dispatch, fully-online approaches without offline priors or other guardrails had never been evaluated in the real world. This work presents the algorithmic changes needed to bridge this gap. OSV is now deployed globally as a core component of Lyft's dispatch matching system. Our A/B user experiments in major US cities measure a +(0.96±0.53)% increase in the request fulfillment rate and a +(0.73±0.22)% increase to profit per passenger session over the previous algorithm.

References

[1]
Juan Camilo Castillo, Dan Knoepfle, and Glen Weyl. 2017. Surge Pricing Solves the Wild Goose Chase. In Proceedings of the 2017 ACM Conference on Economics and Computation (Cambridge, Massachusetts, USA) (EC '17). Association for Computing Machinery, New York, NY, USA, 241--242.
[2]
Giuseppe DeCandia, Deniz Hastorun, Madan Jampani, Gunavardhan Kakulapati, Avinash Lakshman, Alex Pilchin, Swaminathan Sivasubramanian, Peter Vosshall, and Werner Vogels. 2007. Dynamo: amazon's highly available key-value store. In SOSP. ACM, 205--220.
[3]
Benjamin Han. 2020. Open Source Implementation using TD(0). https://www.biendata.xyz/forum/view_post_category/1048/
[4]
Benjamin Han and Carl Arndt. 2021. Budget Allocation as a Multi-Agent System of Contextual & Continuous Bandits. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD '21). 2937--2945.
[5]
Geoffrey Hinton. 2014. rmsprop: Divide the gradient by a running average of its recent magnitude. https://www.cs.toronto.edu/ tijmen/csc321/slides/lecture_slides_lec6.pdf
[6]
Zhiwei (Tony) Qin, Hongtu Zhu, and Jieping Ye. 2021. Reinforcement Learning for Ridesharing: A Survey. CoRR, Vol. abs/2105.01099 (2021).
[7]
Jaein Song, Yun Ji Cho, Min Hee Kang, and Kee Yeon Hwang. 2020. An Application of Reinforced Learning-Based Dynamic Pricing for Improvement of Ridesharing Platform Service in Seoul. Electronics, Vol. 9, 11 (2020).
[8]
Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction second ed.). The MIT Press. http://incompleteideas.net/book/the-book-2nd.html
[9]
Xiaocheng Tang, Fan Zhang, Zhiwei Qin, Yansheng Wang, Dingyuan Shi, Bingchen Song, Yongxin Tong, Hongtu Zhu, and Jieping Ye. 2021. Value Function is All You Need: A Unified Learning Framework for Ride Hailing Platforms. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining (KDD '21). 3605--3615.
[10]
Uber Technologies. 2021. Hexagonal hierarchical geospatial indexing system. https://h3geo.org/
[11]
Jacob van Gogh, Benjamin Han, and Alex Contryman. 2020. Team Hail Mary 2020 KDD Cup Submission. https://github.com/jrvangogh/kddcup-2020/blob/master/model/agent.py
[12]
Erich Veach, Jesse Rosenstock, Eric Engle, Robert Snedegar, Julien Basch, and Tom Manshreck. 2021. S2 Geometry. http://s2geometry.io/
[13]
Yansheng Wang, Dingyuan Shi, Maoxiaomin Peng, Yi Xu, and Yongxin Tong. 2020. 1st solution for KDD Cup 2020 (RL track). https://github.com/maybeluo/KDDCup2020-RL-1st-solution
[14]
Fanyou Wu and Yang Liu. 2020. 1st solution for KDD Cup 2020 (RL track). https://wufanyou.github.io/assets/pdf/kdd-cup-2020-rl-solution.pdf
[15]
Zhe Xu, Zhixin Li, Qingwen Guan, Dingshui Zhang, Qiang Li, Junxiao Nan, Chunyang Liu, Wei Bian, and Jieping Ye. 2018. Large-Scale Order Dispatch in On-Demand Ride-Hailing Platforms: A Learning and Planning Approach. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '18). 905--913.
[16]
Cheng Zeng and Nir Oren. 2014. Dynamic taxi pricing. Frontiers in Artificial Intelligence and Applications, Vol. 263 (01 2014), 1135--1136.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining
August 2022
5033 pages
ISBN:9781450393850
DOI:10.1145/3534678
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives International 4.0 License.

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 14 August 2022

Check for updates

Author Tags

  1. adaptive
  2. dispatch
  3. matching
  4. multi-agent reinforcement learning
  5. on-policy control
  6. online learning
  7. real-time
  8. rideshare
  9. streaming
  10. temporal difference
  11. transportation

Qualifiers

  • Research-article

Conference

KDD '22
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)623
  • Downloads (Last 6 weeks)38
Reflects downloads up to 06 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media