87 research outputs found
Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning
A fundamental question in any peer-to-peer ridesharing system is how to, both
effectively and efficiently, dispatch user's ride requests to the right driver
in real time. Traditional rule-based solutions usually work on a simplified
problem setting, which requires a sophisticated hand-crafted weight design for
either centralized authority control or decentralized multi-agent scheduling
systems. Although recent approaches have used reinforcement learning to provide
centralized combinatorial optimization algorithms with informative weight
values, their single-agent setting can hardly model the complex interactions
between drivers and orders. In this paper, we address the order dispatching
problem using multi-agent reinforcement learning (MARL), which follows the
distributed nature of the peer-to-peer ridesharing problem and possesses the
ability to capture the stochastic demand-supply dynamics in large-scale
ridesharing scenarios. Being more reliable than centralized approaches, our
proposed MARL solutions could also support fully distributed execution through
recent advances in the Internet of Vehicles (IoV) and the Vehicle-to-Network
(V2N). Furthermore, we adopt the mean field approximation to simplify the local
interactions by taking an average action among neighborhoods. The mean field
approximation is capable of globally capturing dynamic demand-supply variations
by propagating many local interactions between agents and the environment. Our
extensive experiments have shown the significant improvements of MARL order
dispatching algorithms over several strong baselines on the gross merchandise
volume (GMV), and order response rate measures. Besides, the simulated
experiments with real data have also justified that our solution can alleviate
the supply-demand gap during the rush hours, thus possessing the capability of
reducing traffic congestion.Comment: 11 pages, 9 figure
CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms
How to optimally dispatch orders to vehicles and how to tradeoff between
immediate and future returns are fundamental questions for a typical
ride-hailing platform. We model ride-hailing as a large-scale parallel ranking
problem and study the joint decision-making task of order dispatching and fleet
management in online ride-hailing platforms. This task brings unique challenges
in the following four aspects. First, to facilitate a huge number of vehicles
to act and learn efficiently and robustly, we treat each region cell as an
agent and build a multi-agent reinforcement learning framework. Second, to
coordinate the agents from different regions to achieve long-term benefits, we
leverage the geographical hierarchy of the region grids to perform hierarchical
reinforcement learning. Third, to deal with the heterogeneous and variant
action space for joint order dispatching and fleet management, we design the
action as the ranking weight vector to rank and select the specific order or
the fleet management destination in a unified formulation. Fourth, to achieve
the multi-scale ride-hailing platform, we conduct the decision-making process
in a hierarchical way where a multi-head attention mechanism is utilized to
incorporate the impacts of neighbor agents and capture the key agent in each
scale. The whole novel framework is named as CoRide. Extensive experiments
based on multiple cities real-world data as well as analytic synthetic data
demonstrate that CoRide provides superior performance in terms of platform
revenue and user experience in the task of city-wide hybrid order dispatching
and fleet management over strong baselines.Comment: CIKM 201
A Better Match for Drivers and Riders: Reinforcement Learning at Lyft
To better match drivers to riders in our ridesharing application, we revised
Lyft's core matching algorithm. We use a novel online reinforcement learning
approach that estimates the future earnings of drivers in real time and use
this information to find more efficient matches. This change was the first
documented implementation of a ridesharing matching algorithm that can learn
and improve in real time. We evaluated the new approach during weeks of
switchback experimentation in most Lyft markets, and estimated how it benefited
drivers, riders, and the platform. In particular, it enabled our drivers to
serve millions of additional riders each year, leading to more than $30 million
per year in incremental revenue. Lyft rolled out the algorithm globally in
2021
CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms
How to optimally dispatch orders to vehicles and how to trade
off between immediate and future returns are fundamental questions for a typical ride-hailing platform. We model ride-hailing as a
large-scale parallel ranking problem and study the joint decisionmaking task of order dispatching and fleet management in online
ride-hailing platforms. This task brings unique challenges in the
following four aspects. First, to facilitate a huge number of vehicles
to act and learn efficiently and robustly, we treat each region cell
as an agent and build a multi-agent reinforcement learning framework. Second, to coordinate the agents from different regions to
achieve long-term benefits, we leverage the geographical hierarchy
of the region grids to perform hierarchical reinforcement learning.
Third, to deal with the heterogeneous and variant action space
for joint order dispatching and fleet management, we design the
action as the ranking weight vector to rank and select the specific
order or the fleet management destination in a unified formulation.
Fourth, to achieve the multi-scale ride-hailing platform, we conduct
the decision-making process in a hierarchical way where a multihead attention mechanism is utilized to incorporate the impacts
of neighbor agents and capture the key agent in each scale. The
whole novel framework is named as CoRide. Extensive experiments
based on multiple cities real-world data as well as analytic synthetic
data demonstrate that CoRide provides superior performance in
terms of platform revenue and user experience in the task of citywide hybrid order dispatching and fleet management over strong
baselines
AI for Explaining Decisions in Multi-Agent Environments
Explanation is necessary for humans to understand and accept decisions made
by an AI system when the system's goal is known. It is even more important when
the AI system makes decisions in multi-agent environments where the human does
not know the systems' goals since they may depend on other agents' preferences.
In such situations, explanations should aim to increase user satisfaction,
taking into account the system's decision, the user's and the other agents'
preferences, the environment settings and properties such as fairness, envy and
privacy. Generating explanations that will increase user satisfaction is very
challenging; to this end, we propose a new research direction: xMASE. We then
review the state of the art and discuss research directions towards efficient
methodologies and algorithms for generating explanations that will increase
users' satisfaction from AI system's decisions in multi-agent environments.Comment: This paper has been submitted to the Blue Sky Track of the AAAI 2020
conference. At the time of submission, it is under review. The tentative
notification date will be November 10, 2019. Current version: Name of first
author had been added in metadat
Data-driven Methodologies and Applications in Urban Mobility
The world is urbanizing at an unprecedented rate where urbanization goes from 39% in 1980 to 58% in 2019 (World Bank, 2019). This poses more and more transportation demand and pressure on the already at or over-capacity old transport infrastructure, especially in urban areas. Along the same timeline, more data generated as a byproduct of daily activity are being collected via the advancement of the internet of things, and computers are getting more and more powerful. These are shown by the statistics such as 90% of the world’s data is generated within the last two years and IBM’s computer is now processing at the speed of 120,000 GPS points per second. Thus, this dissertation discusses the challenges and opportunities arising from the growing demand for urban mobility, particularly in cities with outdated infrastructure, and how to capitalize on the unprecedented growth in data in solving these problems by ways of data-driven transportation-specific methodologies. The dissertation identifies three primary challenges and/or opportunities, which are (1) optimally locating dynamic wireless charging to promote the adoption of electric vehicles, (2) predicting dynamic traffic state using an enormously large dataset of taxi trips, and (3) improving the ride-hailing system with carpooling, smart dispatching, and preemptive repositioning. The dissertation presents potential solutions/methodologies that have become available only recently thanks to the extraordinary growth of data and computers with explosive power, and these methodologies are (1) bi-level optimization planning frameworks for locating dynamic wireless charging facilities, (2) Traffic Graph Convolutional Network for dynamic urban traffic state estimation, and (3) Graph Matching and Reinforcement Learning for the operation and management of mixed autonomous electric taxi fleets. These methodologies are then carefully calibrated, methodically scrutinized under various performance metrics and procedures, and validated with previous research and ground truth data, which is gathered directly from the real world. In order to bridge the gap between scientific discoveries and practical applications, the three methodologies are applied to the case study of (1) Montgomery County, MD, (2) the City of New York, and (3) the City of Chicago and from which, real-world implementation are suggested. This dissertation’s contribution via the provided methodologies, along with the continual increase in data, have the potential to significantly benefit urban mobility and work toward a sustainable transportation system
A multi-functional simulation platform for on-demand ride service operations
On-demand ride services or ride-sourcing services have been experiencing fast
development in the past decade. Various mathematical models and optimization
algorithms have been developed to help ride-sourcing platforms design
operational strategies with higher efficiency. However, due to cost and
reliability issues (implementing an immature algorithm for real operations may
result in system turbulence), it is commonly infeasible to validate these
models and train/test these optimization algorithms within real-world ride
sourcing platforms. Acting as a useful test bed, a simulation platform for
ride-sourcing systems will be very important to conduct algorithm
training/testing or model validation through trails and errors. While previous
studies have established a variety of simulators for their own tasks, it lacks
a fair and public platform for comparing the models or algorithms proposed by
different researchers. In addition, the existing simulators still face many
challenges, ranging from their closeness to real environments of ride-sourcing
systems, to the completeness of different tasks they can implement. To address
the challenges, we propose a novel multi-functional and open-sourced simulation
platform for ride-sourcing systems, which can simulate the behaviors and
movements of various agents on a real transportation network. It provides a few
accessible portals for users to train and test various optimization algorithms,
especially reinforcement learning algorithms, for a variety of tasks, including
on-demand matching, idle vehicle repositioning, and dynamic pricing. In
addition, it can be used to test how well the theoretical models approximate
the simulated outcomes. Evaluated on real-world data based experiments, the
simulator is demonstrated to be an efficient and effective test bed for various
tasks related to on-demand ride service operations
- …