814 research outputs found
A Simple Baseline for Travel Time Estimation using Large-Scale Trip Data
The increased availability of large-scale trajectory data around the world
provides rich information for the study of urban dynamics. For example, New
York City Taxi Limousine Commission regularly releases source-destination
information about trips in the taxis they regulate. Taxi data provide
information about traffic patterns, and thus enable the study of urban flow --
what will traffic between two locations look like at a certain date and time in
the future? Existing big data methods try to outdo each other in terms of
complexity and algorithmic sophistication. In the spirit of "big data beats
algorithms", we present a very simple baseline which outperforms
state-of-the-art approaches, including Bing Maps and Baidu Maps (whose APIs
permit large scale experimentation). Such a travel time estimation baseline has
several important uses, such as navigation (fast travel time estimates can
serve as approximate heuristics for A search variants for path finding) and
trip planning (which uses operating hours for popular destinations along with
travel time estimates to create an itinerary).Comment: 12 page
A method for extracting travel patterns using data polishing
With recent developments in ICT, the interest in using large amounts of accumulated data for traffic policy planning has increased significantly. In recent years, data polishing has been proposed as a new method of big data analysis. Data polishing is a graphical clustering method, which can be used to extract patterns that are similar or related to each other by identifying the cluster structures present in the data. The purpose of this study is to identify the travel patterns of railway passengers by applying data polishing to smart card data collected in the Kagawa Prefecture, Japan. To this end, we consider 9,008,709 data points collected over a period of 15 months, ranging from December 1st, 2013 to February 28th, 2015. This dataset includes various types of information, including trip histories and types of passengers. This study implements data polishing to cluster 4,667,520 combinations of information regarding individual rides in terms of the day of the week, the time of the day, passenger types, and origin and destination stations. Via the analysis, 127 characteristic travel patterns are identified in aggregate
Urban Anomaly Analytics: Description, Detection, and Prediction
Urban anomalies may result in loss of life or property if not handled properly. Automatically alerting anomalies in their early stage or even predicting anomalies before happening is of great value for populations. Recently, data-driven urban anomaly analysis frameworks have been forming, which utilize urban big data and machine learning algorithms to detect and predict urban anomalies automatically. In this survey, we make a comprehensive review of the state-of-the-art research on urban anomaly analytics. We first give an overview of four main types of urban anomalies, traffic anomaly, unexpected crowds, environment anomaly, and individual anomaly. Next, we summarize various types of urban datasets obtained from diverse devices, i.e., trajectory, trip records, CDRs, urban sensors, event records, environment data, social media and surveillance cameras. Subsequently, a comprehensive survey of issues on detecting and predicting techniques for urban anomalies is presented. Finally, research challenges and open problems as discussed.Peer reviewe
Data Fusion for MaaS: Opportunities and Challenges
© 2018 IEEE. Computer Supported Cooperative Work (CSCW) in design is an essential facilitator for the development and implementation of smart cities, where modern cooperative transportation and integrated mobility are highly demanded. Owing to greater availability of different data sources, data fusion problem in intelligent transportation systems (ITS) has been very challenging, where machine learning modelling and approaches are promising to offer an important yet comprehensive solution. In this paper, we provide an overview of the recent advances in data fusion for Mobility as a Service (MaaS), including the basics of data fusion theory and the related machine learning methods. We also highlight the opportunities and challenges on MaaS, and discuss potential future directions of research on the integrated mobility modelling
How machine learning informs ride-hailing services: A survey
In recent years, online ride-hailing services have emerged as an important component of urban transportation system, which not only provide significant ease for residents’ travel activities, but also shape new travel behavior and diversify urban mobility patterns. This study provides a thorough review of machine-learning-based methodologies for on-demand ride-hailing services. The importance of on-demand ride-hailing services in the spatio-temporal dynamics of urban traffic is first highlighted, with machine-learning-based macro-level ride-hailing research demonstrating its value in guiding the design, planning, operation, and control of urban intelligent transportation systems. Then, the research on travel behavior from the perspective of individual mobility patterns, including carpooling behavior and modal choice behavior, is summarized. In addition, existing studies on order matching and vehicle dispatching strategies, which are among the most important components of on-line ride-hailing systems, are collected and summarized. Finally, some of the critical challenges and opportunities in ride-hailing services are discussed
Routine pattern discovery and anomaly detection in individual travel behavior
Discovering patterns and detecting anomalies in individual travel behavior is
a crucial problem in both research and practice. In this paper, we address this
problem by building a probabilistic framework to model individual
spatiotemporal travel behavior data (e.g., trip records and trajectory data).
We develop a two-dimensional latent Dirichlet allocation (LDA) model to
characterize the generative mechanism of spatiotemporal trip records of each
traveler. This model introduces two separate factor matrices for the spatial
dimension and the temporal dimension, respectively, and use a two-dimensional
core structure at the individual level to effectively model the joint
interactions and complex dependencies. This model can efficiently summarize
travel behavior patterns on both spatial and temporal dimensions from very
sparse trip sequences in an unsupervised way. In this way, complex travel
behavior can be modeled as a mixture of representative and interpretable
spatiotemporal patterns. By applying the trained model on future/unseen
spatiotemporal records of a traveler, we can detect her behavior anomalies by
scoring those observations using perplexity. We demonstrate the effectiveness
of the proposed modeling framework on a real-world license plate recognition
(LPR) data set. The results confirm the advantage of statistical learning
methods in modeling sparse individual travel behavior data. This type of
pattern discovery and anomaly detection applications can provide useful
insights for traffic monitoring, law enforcement, and individual travel
behavior profiling
Predicting passenger origin-destination in online taxi-hailing systems
Because of transportation planning, traffic management, and dispatch
optimization importance, passenger origin-destination prediction has become one
of the most important requirements for intelligent transportation systems
management. In this paper, we propose a model to predict the next specified
time window travels' origin and destination. To extract meaningful travel
flows, we use K-means clustering in four-dimensional space with maximum cluster
size limitation for origin and destination zones. Because of the large number
of clusters, we use non-negative matrix factorization to decrease the number of
travel clusters. Also, we use a stacked recurrent neural network model to
predict travel count in each cluster. Comparing our results with other existing
models shows that our proposed model has 5-7% lower mean absolute percentage
error (MAPE) for 1-hour time windows, and 14% lower MAPE for 30-minute time
windows.Comment: 25 pages, 20 figure
- …