Deep Reinforcement Learning with Knowledge Transfer for Online Rides Order Dispatching
Citations Over TimeTop 10% of 2018 papers
Abstract
Ride dispatching is a central operation task on a ride-sharing platform to continuously match drivers to trip-requesting passengers. In this work, we model the ride dispatching problem as a Markov Decision Process and propose learning solutions based on deep Q-networks with action search to optimize the dispatching policy for drivers on ride-sharing platforms. We train and evaluate dispatching agents for this challenging decision task using real-world spatio-temporal trip data from the DiDi ride-sharing platform. A large-scale dispatching system typically supports many geographical locations with diverse demand-supply settings. To increase learning adaptability and efficiency, we propose a new transfer learning method Correlated Feature Progressive Transfer, along with two existing methods, enabling knowledge transfer in both spatial and temporal spaces. Through an extensive set of experiments, we demonstrate the learning and optimization capabilities of our deep reinforcement learning algorithms. We further show that dispatching policies learned by transferring knowledge from a source city to target cities or across temporal space within the same city significantly outperform those without transfer learning.
Related Papers
- On the Social Adaptability of the Floating Population(2003)
- Research Review of Driver Adaptability(2010)
- Crops eco-economic adaptability evaluation based on ecological-market potential capacity(2006)
- Preliminary Exploration to Higher Vocational College Students'School Adaptability(2007)
- Empirical Study on the Status of Undergraduate Students' Occupational Adaptability and Countermeasures(2008)