1,910 research outputs found
A Deep Spatio-Temporal Fuzzy Neural Network for Passenger Demand Prediction
In spite of its importance, passenger demand prediction is a highly
challenging problem, because the demand is simultaneously influenced by the
complex interactions among many spatial and temporal factors and other external
factors such as weather. To address this problem, we propose a Spatio-TEmporal
Fuzzy neural Network (STEF-Net) to accurately predict passenger demands
incorporating the complex interactions of all known important factors. We
design an end-to-end learning framework with different neural networks modeling
different factors. Specifically, we propose to capture spatio-temporal feature
interactions via a convolutional long short-term memory network and model
external factors via a fuzzy neural network that handles data uncertainty
significantly better than deterministic methods. To keep the temporal relations
when fusing two networks and emphasize discriminative spatio-temporal feature
interactions, we employ a novel feature fusion method with a convolution
operation and an attention layer. As far as we know, our work is the first to
fuse a deep recurrent neural network and a fuzzy neural network to model
complex spatial-temporal feature interactions with additional uncertain input
features for predictive learning. Experiments on a large-scale real-world
dataset show that our model achieves more than 10% improvement over the
state-of-the-art approaches.Comment: https://epubs.siam.org/doi/abs/10.1137/1.9781611975673.1
Recurrent Attention Models for Depth-Based Person Identification
We present an attention-based model that reasons on human body shape and
motion dynamics to identify individuals in the absence of RGB information,
hence in the dark. Our approach leverages unique 4D spatio-temporal signatures
to address the identification problem across days. Formulated as a
reinforcement learning task, our model is based on a combination of
convolutional and recurrent neural networks with the goal of identifying small,
discriminative regions indicative of human identity. We demonstrate that our
model produces state-of-the-art results on several published datasets given
only depth images. We further study the robustness of our model towards
viewpoint, appearance, and volumetric changes. Finally, we share insights
gleaned from interpretable 2D, 3D, and 4D visualizations of our model's
spatio-temporal attention.Comment: Computer Vision and Pattern Recognition (CVPR) 201
Structured Sequence Modeling with Graph Convolutional Recurrent Networks
This paper introduces Graph Convolutional Recurrent Network (GCRN), a deep
learning model able to predict structured sequences of data. Precisely, GCRN is
a generalization of classical recurrent neural networks (RNN) to data
structured by an arbitrary graph. Such structured sequences can represent
series of frames in videos, spatio-temporal measurements on a network of
sensors, or random walks on a vocabulary graph for natural language modeling.
The proposed model combines convolutional neural networks (CNN) on graphs to
identify spatial structures and RNN to find dynamic patterns. We study two
possible architectures of GCRN, and apply the models to two practical problems:
predicting moving MNIST data, and modeling natural language with the Penn
Treebank dataset. Experiments show that exploiting simultaneously graph spatial
and dynamic information about data can improve both precision and learning
speed
- …