3,158 research outputs found
Kernel Regression with Sparse Metric Learning
Kernel regression is a popular non-parametric fitting technique. It aims at
learning a function which estimates the targets for test inputs as precise as
possible. Generally, the function value for a test input is estimated by a
weighted average of the surrounding training examples. The weights are
typically computed by a distance-based kernel function and they strongly depend
on the distances between examples. In this paper, we first review the latest
developments of sparse metric learning and kernel regression. Then a novel
kernel regression method involving sparse metric learning, which is called
kernel regression with sparse metric learning (KRSML), is proposed. The
sparse kernel regression model is established by enforcing a mixed -norm
regularization over the metric matrix. It learns a Mahalanobis distance metric
by a gradient descent procedure, which can simultaneously conduct
dimensionality reduction and lead to good prediction results. Our work is the
first to combine kernel regression with sparse metric learning. To verify the
effectiveness of the proposed method, it is evaluated on 19 data sets for
regression. Furthermore, the new method is also applied to solving practical
problems of forecasting short-term traffic flows. In the end, we compare the
proposed method with other three related kernel regression methods on all test
data sets under two criterions. Experimental results show that the proposed
method is much more competitive
Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting
Spatiotemporal forecasting has various applications in neuroscience, climate
and transportation domain. Traffic forecasting is one canonical example of such
learning task. The task is challenging due to (1) complex spatial dependency on
road networks, (2) non-linear temporal dynamics with changing road conditions
and (3) inherent difficulty of long-term forecasting. To address these
challenges, we propose to model the traffic flow as a diffusion process on a
directed graph and introduce Diffusion Convolutional Recurrent Neural Network
(DCRNN), a deep learning framework for traffic forecasting that incorporates
both spatial and temporal dependency in the traffic flow. Specifically, DCRNN
captures the spatial dependency using bidirectional random walks on the graph,
and the temporal dependency using the encoder-decoder architecture with
scheduled sampling. We evaluate the framework on two real-world large scale
road network traffic datasets and observe consistent improvement of 12% - 15%
over state-of-the-art baselines.Comment: Published as a conference paper at ICLR 201
Deep Learning for Short-Term Traffic Flow Prediction
We develop a deep learning model to predict traffic flows. The main
contribution is development of an architecture that combines a linear model
that is fitted using regularization and a sequence of layers.
The challenge of predicting traffic flows are the sharp nonlinearities due to
transitions between free flow, breakdown, recovery and congestion. We show that
deep learning architectures can capture these nonlinear spatio-temporal
effects. The first layer identifies spatio-temporal relations among predictors
and other layers model nonlinear relations. We illustrate our methodology on
road sensor data from Interstate I-55 and predict traffic flows during two
special events; a Chicago Bears football game and an extreme snowstorm event.
Both cases have sharp traffic flow regime changes, occurring very suddenly, and
we show how deep learning provides precise short term traffic flow predictions
A Hybrid Method for Traffic Flow Forecasting Using Multimodal Deep Learning
Traffic flow forecasting has been regarded as a key problem of intelligent
transport systems. In this work, we propose a hybrid multimodal deep learning
method for short-term traffic flow forecasting, which can jointly and
adaptively learn the spatial-temporal correlation features and long temporal
interdependence of multi-modality traffic data by an attention auxiliary
multimodal deep learning architecture. According to the highly nonlinear
characteristics of multi-modality traffic data, the base module of our method
consists of one-dimensional Convolutional Neural Networks (1D CNN) and Gated
Recurrent Units (GRU) with the attention mechanism. The former is to capture
the local trend features and the latter is to capture the long temporal
dependencies. Then, we design a hybrid multimodal deep learning framework
(HMDLF) for fusing share representation features of different modality traffic
data by multiple CNN-GRU-Attention modules. The experimental results indicate
that the proposed multimodal deep learning model is capable of dealing with
complex nonlinear urban traffic flow forecasting with satisfying accuracy and
effectiveness
Machine Learning for Spatiotemporal Sequence Forecasting: A Survey
Spatiotemporal systems are common in the real-world. Forecasting the
multi-step future of these spatiotemporal systems based on the past
observations, or, Spatiotemporal Sequence Forecasting (STSF), is a significant
and challenging problem. Although lots of real-world problems can be viewed as
STSF and many research works have proposed machine learning based methods for
them, no existing work has summarized and compared these methods from a unified
perspective. This survey aims to provide a systematic review of machine
learning for STSF. In this survey, we define the STSF problem and classify it
into three subcategories: Trajectory Forecasting of Moving Point Cloud
(TF-MPC), STSF on Regular Grid (STSF-RG) and STSF on Irregular Grid (STSF-IG).
We then introduce the two major challenges of STSF: 1) how to learn a model for
multi-step forecasting and 2) how to adequately model the spatial and temporal
structures. After that, we review the existing works for solving these
challenges, including the general learning strategies for multi-step
forecasting, the classical machine learning based methods for STSF, and the
deep learning based methods for STSF. We also compare these methods and point
out some potential research directions
Wireless Traffic Prediction with Scalable Gaussian Process: Framework, Algorithms, and Verification
The cloud radio access network (C-RAN) is a promising paradigm to meet the
stringent requirements of the fifth generation (5G) wireless systems.
Meanwhile, wireless traffic prediction is a key enabler for C-RANs to improve
both the spectrum efficiency and energy efficiency through load-aware network
managements. This paper proposes a scalable Gaussian process (GP) framework as
a promising solution to achieve large-scale wireless traffic prediction in a
cost-efficient manner. Our contribution is three-fold. First, to the best of
our knowledge, this paper is the first to empower GP regression with the
alternating direction method of multipliers (ADMM) for parallel hyper-parameter
optimization in the training phase, where such a scalable training framework
well balances the local estimation in baseband units (BBUs) and information
consensus among BBUs in a principled way for large-scale executions. Second, in
the prediction phase, we fuse local predictions obtained from the BBUs via a
cross-validation based optimal strategy, which demonstrates itself to be
reliable and robust for general regression tasks. Moreover, such a
cross-validation based optimal fusion strategy is built upon a well
acknowledged probabilistic model to retain the valuable closed-form GP
inference properties. Third, we propose a C-RAN based scalable wireless
prediction architecture, where the prediction accuracy and the time consumption
can be balanced by tuning the number of the BBUs according to the real-time
system demands. Experimental results show that our proposed scalable GP model
can outperform the state-of-the-art approaches considerably, in terms of
wireless traffic prediction performance
Nonparametric Basis Pursuit via Sparse Kernel-based Learning
Signal processing tasks as fundamental as sampling, reconstruction, minimum
mean-square error interpolation and prediction can be viewed under the prism of
reproducing kernel Hilbert spaces. Endowing this vantage point with
contemporary advances in sparsity-aware modeling and processing, promotes the
nonparametric basis pursuit advocated in this paper as the overarching
framework for the confluence of kernel-based learning (KBL) approaches
leveraging sparse linear regression, nuclear-norm regularization, and
dictionary learning. The novel sparse KBL toolbox goes beyond translating
sparse parametric approaches to their nonparametric counterparts, to
incorporate new possibilities such as multi-kernel selection and matrix
smoothing. The impact of sparse KBL to signal processing applications is
illustrated through test cases from cognitive radio sensing, microarray data
imputation, and network traffic prediction.Comment: IEEE SIGNAL PROCESSING MAGAZINE, 2013 (TO APPEAR
Dynamical functional prediction and classification, with application to traffic flow prediction
Motivated by the need for accurate traffic flow prediction in transportation
management, we propose a functional data method to analyze traffic flow
patterns and predict future traffic flow. In this study we approach the problem
by sampling traffic flow trajectories from a mixture of stochastic processes.
The proposed functional mixture prediction approach combines functional
prediction with probabilistic functional classification to take distinct
traffic flow patterns into account. The probabilistic classification procedure,
which incorporates functional clustering and discrimination, hinges on subspace
projection. The proposed methods not only assist in predicting traffic flow
trajectories, but also identify distinct patterns in daily traffic flow of
typical temporal trends and variabilities. The proposed methodology is widely
applicable in analysis and prediction of longitudinally recorded functional
data.Comment: Published in at http://dx.doi.org/10.1214/12-AOAS595 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Dynamic Spatio-temporal Graph-based CNNs for Traffic Prediction
Forecasting future traffic flows from previous ones is a challenging problem
because of their complex and dynamic nature of spatio-temporal structures. Most
existing graph-based CNNs attempt to capture the static relations while largely
neglecting the dynamics underlying sequential data. In this paper, we present
dynamic spatio-temporal graph-based CNNs (DST-GCNNs) by learning expressive
features to represent spatio-temporal structures and predict future traffic
flows from surveillance video data. In particular, DST-GCNN is a two stream
network. In the flow prediction stream, we present a novel graph-based
spatio-temporal convolutional layer to extract features from a graph
representation of traffic flows. Then several such layers are stacked together
to predict future flows over time. Meanwhile, the relations between traffic
flows in the graph are often time variant as the traffic condition changes over
time. To capture the graph dynamics, we use the graph prediction stream to
predict the dynamic graph structures, and the predicted structures are fed into
the flow prediction stream. Experiments on real datasets demonstrate that the
proposed model achieves competitive performances compared with the other
state-of-the-art methods
Bayesian Particle Tracking of Traffic Flows
We develop a Bayesian particle filter for tracking traffic flows that is
capable of capturing non-linearities and discontinuities present in flow
dynamics. Our model includes a hidden state variable that captures sudden
regime shifts between traffic free flow, breakdown and recovery. We develop an
efficient particle learning algorithm for real time on-line inference of states
and parameters. This requires a two step approach, first, resampling the
current particles, with a mixture predictive distribution and second,
propagation of states using the conditional posterior distribution. Particle
learning of parameters follows from updating recursions for conditional
sufficient statistics. To illustrate our methodology, we analyze measurements
of daily traffic flow from the Illinois interstate I-55 highway system. We
demonstrate how our filter can be used to inference the change of traffic flow
regime on a highway road segment based on a measurement from freeway
single-loop detectors. Finally, we conclude with directions for future
research
- …