8,080 research outputs found
FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras
In this paper, we develop deep spatio-temporal neural networks to
sequentially count vehicles from low quality videos captured by city cameras
(citycams). Citycam videos have low resolution, low frame rate, high occlusion
and large perspective, making most existing methods lose their efficacy. To
overcome limitations of existing methods and incorporate the temporal
information of traffic video, we design a novel FCN-rLSTM network to jointly
estimate vehicle density and vehicle count by connecting fully convolutional
neural networks (FCN) with long short term memory networks (LSTM) in a residual
learning fashion. Such design leverages the strengths of FCN for pixel-level
prediction and the strengths of LSTM for learning complex temporal dynamics.
The residual learning connection reformulates the vehicle count regression as
learning residual functions with reference to the sum of densities in each
frame, which significantly accelerates the training of networks. To preserve
feature map resolution, we propose a Hyper-Atrous combination to integrate
atrous convolution in FCN and combine feature maps of different convolution
layers. FCN-rLSTM enables refined feature representation and a novel end-to-end
trainable mapping from pixels to vehicle count. We extensively evaluated the
proposed method on different counting tasks with three datasets, with
experimental results demonstrating their effectiveness and robustness. In
particular, FCN-rLSTM reduces the mean absolute error (MAE) from 5.31 to 4.21
on TRANCOS, and reduces the MAE from 2.74 to 1.53 on WebCamT. Training process
is accelerated by 5 times on average.Comment: Accepted by International Conference on Computer Vision (ICCV), 201
Understanding Traffic Density from Large-Scale Web Camera Data
Understanding traffic density from large-scale web camera (webcam) videos is
a challenging problem because such videos have low spatial and temporal
resolution, high occlusion and large perspective. To deeply understand traffic
density, we explore both deep learning based and optimization based methods. To
avoid individual vehicle detection and tracking, both methods map the image
into vehicle density map, one based on rank constrained regression and the
other one based on fully convolution networks (FCN). The regression based
method learns different weights for different blocks in the image to increase
freedom degrees of weights and embed perspective information. The FCN based
method jointly estimates vehicle density map and vehicle count with a residual
learning framework to perform end-to-end dense prediction, allowing arbitrary
image resolution, and adapting to different vehicle scales and perspectives. We
analyze and compare both methods, and get insights from optimization based
method to improve deep model. Since existing datasets do not cover all the
challenges in our work, we collected and labelled a large-scale traffic video
dataset, containing 60 million frames from 212 webcams. Both methods are
extensively evaluated and compared on different counting tasks and datasets.
FCN based method significantly reduces the mean absolute error from 10.99 to
5.31 on the public dataset TRANCOS compared with the state-of-the-art baseline.Comment: Accepted by CVPR 2017. Preprint version was uploaded on
http://welcome.isr.tecnico.ulisboa.pt/publications/understanding-traffic-density-from-large-scale-web-camera-data
DDP-GCN: Multi-Graph Convolutional Network for Spatiotemporal Traffic Forecasting
Traffic speed forecasting is one of the core problems in Intelligent
Transportation Systems. For a more accurate prediction, recent studies started
using not only the temporal speed patterns but also the spatial information on
the road network through the graph convolutional networks. Even though the road
network is highly complex due to its non-Euclidean and directional
characteristics, previous approaches mainly focus on modeling the spatial
dependencies only with the distance. In this paper, we identify two essential
spatial dependencies in traffic forecasting in addition to distance, direction
and positional relationship, for designing basic graph elements as the smallest
building blocks. Using the building blocks, we suggest DDP-GCN (Distance,
Direction, and Positional relationship Graph Convolutional Network) to
incorporate the three spatial relationships into prediction network for traffic
forecasting. We evaluate the proposed model with two large-scale real-world
datasets, and find 7.40% average improvement for 1-hour forecasting in highly
complex urban networks
Neural-Attention-Based Deep Learning Architectures for Modeling Traffic Dynamics on Lane Graphs
Deep neural networks can be powerful tools, but require careful
application-specific design to ensure that the most informative relationships
in the data are learnable. In this paper, we apply deep neural networks to the
nonlinear spatiotemporal physics problem of vehicle traffic dynamics. We
consider problems of estimating macroscopic quantities (e.g., the queue at an
intersection) at a lane level. First-principles modeling at the lane scale has
been a challenge due to complexities in modeling social behaviors like lane
changes, and those behaviors' resultant macro-scale effects. Following domain
knowledge that upstream/downstream lanes and neighboring lanes affect each
others' traffic flows in distinct ways, we apply a form of neural attention
that allows the neural network layers to aggregate information from different
lanes in different manners. Using a microscopic traffic simulator as a testbed,
we obtain results showing that an attentional neural network model can use
information from nearby lanes to improve predictions, and, that explicitly
encoding the lane-to-lane relationship types significantly improves
performance. We also demonstrate the transfer of our learned neural network to
a more complex road network, discuss how its performance degradation may be
attributable to new traffic behaviors induced by increased topological
complexity, and motivate learning dynamics models from many road network
topologies.Comment: To appear at 2019 IEEE Conference on Intelligent Transportation
System
- …