205 research outputs found
Increment entropy as a measure of complexity for time series
Entropy has been a common index to quantify the complexity of time series in
a variety of fields. Here, we introduce increment entropy to measure the
complexity of time series in which each increment is mapped into a word of two
letters, one letter corresponding to direction and the other corresponding to
magnitude. The Shannon entropy of the words is termed as increment entropy
(IncrEn). Simulations on synthetic data and tests on epileptic EEG signals have
demonstrated its ability of detecting the abrupt change, regardless of
energetic (e.g. spikes or bursts) or structural changes. The computation of
IncrEn does not make any assumption on time series and it can be applicable to
arbitrary real-world data.Comment: 12pages,7figure,2 table
Sparse Semantic Map-Based Monocular Localization in Traffic Scenes Using Learned 2D-3D Point-Line Correspondences
Vision-based localization in a prior map is of crucial importance for
autonomous vehicles. Given a query image, the goal is to estimate the camera
pose corresponding to the prior map, and the key is the registration problem of
camera images within the map. While autonomous vehicles drive on the road under
occlusion (e.g., car, bus, truck) and changing environment appearance (e.g.,
illumination changes, seasonal variation), existing approaches rely heavily on
dense point descriptors at the feature level to solve the registration problem,
entangling features with appearance and occlusion. As a result, they often fail
to estimate the correct poses. To address these issues, we propose a sparse
semantic map-based monocular localization method, which solves 2D-3D
registration via a well-designed deep neural network. Given a sparse semantic
map that consists of simplified elements (e.g., pole lines, traffic sign
midpoints) with multiple semantic labels, the camera pose is then estimated by
learning the corresponding features between the 2D semantic elements from the
image and the 3D elements from the sparse semantic map. The proposed sparse
semantic map-based localization approach is robust against occlusion and
long-term appearance changes in the environments. Extensive experimental
results show that the proposed method outperforms the state-of-the-art
approaches
FEND: A Future Enhanced Distribution-Aware Contrastive Learning Framework for Long-tail Trajectory Prediction
Predicting the future trajectories of the traffic agents is a gordian
technique in autonomous driving. However, trajectory prediction suffers from
data imbalance in the prevalent datasets, and the tailed data is often more
complicated and safety-critical. In this paper, we focus on dealing with the
long-tail phenomenon in trajectory prediction. Previous methods dealing with
long-tail data did not take into account the variety of motion patterns in the
tailed data. In this paper, we put forward a future enhanced contrastive
learning framework to recognize tail trajectory patterns and form a feature
space with separate pattern clusters. Furthermore, a distribution aware hyper
predictor is brought up to better utilize the shaped feature space. Our method
is a model-agnostic framework and can be plugged into many well-known
baselines. Experimental results show that our framework outperforms the
state-of-the-art long-tail prediction method on tailed samples by 9.5% on ADE
and 8.5% on FDE, while maintaining or slightly improving the averaged
performance. Our method also surpasses many long-tail techniques on trajectory
prediction task.Comment: Accepted for publication at the IEEE/CVF Conference on Computer
Vision and Pattern Recognition 2023 (CVPR 2023
Behavioral Intention Prediction in Driving Scenes: A Survey
In the driving scene, the road agents usually conduct frequent interactions
and intention understanding of the surroundings. Ego-agent (each road agent
itself) predicts what behavior will be engaged by other road users all the time
and expects a shared and consistent understanding for safe movement. Behavioral
Intention Prediction (BIP) simulates such a human consideration process and
fulfills the early prediction of specific behaviors. Similar to other
prediction tasks, such as trajectory prediction, data-driven deep learning
methods have taken the primary pipeline in research. The rapid development of
BIP inevitably leads to new issues and challenges. To catalyze future research,
this work provides a comprehensive review of BIP from the available datasets,
key factors and challenges, pedestrian-centric and vehicle-centric BIP
approaches, and BIP-aware applications. Based on the investigation, data-driven
deep learning approaches have become the primary pipelines. The behavioral
intention types are still monotonous in most current datasets and methods
(e.g., Crossing (C) and Not Crossing (NC) for pedestrians and Lane Changing
(LC) for vehicles) in this field. In addition, for the safe-critical scenarios
(e.g., near-crashing situations), current research is limited. Through this
investigation, we identify open issues in behavioral intention prediction and
suggest possible insights for future research.Comment: 254 reference
Deep Virtual-to-Real Distillation for Pedestrian Crossing Prediction
Pedestrian crossing is one of the most typical behavior which conflicts with
natural driving behavior of vehicles. Consequently, pedestrian crossing
prediction is one of the primary task that influences the vehicle planning for
safe driving. However, current methods that rely on the practically collected
data in real driving scenes cannot depict and cover all kinds of scene
condition in real traffic world. To this end, we formulate a deep virtual to
real distillation framework by introducing the synthetic data that can be
generated conveniently, and borrow the abundant information of pedestrian
movement in synthetic videos for the pedestrian crossing prediction in real
data with a simple and lightweight implementation. In order to verify this
framework, we construct a benchmark with 4667 virtual videos owning about 745k
frames (called Virtual-PedCross-4667), and evaluate the proposed method on two
challenging datasets collected in real driving situations, i.e., JAAD and PIE
datasets. State-of-the-art performance of this framework is demonstrated by
exhaustive experiment analysis. The dataset and code can be downloaded from the
website \url{http://www.lotvs.net/code_data/}.Comment: Accepted by ITSC 202
- …