71 research outputs found
Person Search with Natural Language Description
Searching persons in large-scale image databases with the query of natural
language description has important applications in video surveillance. Existing
methods mainly focused on searching persons with image-based or attribute-based
queries, which have major limitations for a practical usage. In this paper, we
study the problem of person search with natural language description. Given the
textual description of a person, the algorithm of the person search is required
to rank all the samples in the person database then retrieve the most relevant
sample corresponding to the queried description. Since there is no person
dataset or benchmark with textual description available, we collect a
large-scale person description dataset with detailed natural language
annotations and person samples from various sources, termed as CUHK Person
Description Dataset (CUHK-PEDES). A wide range of possible models and baselines
have been evaluated and compared on the person search benchmark. An Recurrent
Neural Network with Gated Neural Attention mechanism (GNA-RNN) is proposed to
establish the state-of-the art performance on person search
A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation
Traffic signal control (TSC) is crucial for reducing traffic congestion that
leads to smoother traffic flow, reduced idling time, and mitigated CO2
emissions. In this study, we explore the computer vision approach for TSC that
modulates on-road traffic flows through visual observation. Unlike traditional
feature-based approaches, vision-based methods depend much less on heuristics
and predefined features, bringing promising potentials for end-to-end learning
and optimization of traffic signals. Thus, we introduce a holistic traffic
simulation framework called TrafficDojo towards vision-based TSC and its
benchmarking by integrating the microscopic traffic flow provided in SUMO into
the driving simulator MetaDrive. This proposed framework offers a versatile
traffic environment for in-depth analysis and comprehensive evaluation of
traffic signal controllers across diverse traffic conditions and scenarios. We
establish and compare baseline algorithms including both traditional and
Reinforecment Learning (RL) approaches. This work sheds insights into the
design and development of vision-based TSC approaches and open up new research
opportunities. All the code and baselines will be made publicly available.Comment: Under review for IEEE publication
CAT: Closed-loop Adversarial Training for Safe End-to-End Driving
Driving safety is a top priority for autonomous vehicles. Orthogonal to prior
work handling accident-prone traffic events by algorithm designs at the policy
level, we investigate a Closed-loop Adversarial Training (CAT) framework for
safe end-to-end driving in this paper through the lens of environment
augmentation. CAT aims to continuously improve the safety of driving agents by
training the agent on safety-critical scenarios that are dynamically generated
over time. A novel resampling technique is developed to turn log-replay
real-world driving scenarios into safety-critical ones via probabilistic
factorization, where the adversarial traffic generation is modeled as the
multiplication of standard motion prediction sub-problems. Consequently, CAT
can launch more efficient physical attacks compared to existing safety-critical
scenario generation methods and yields a significantly less computational cost
in the iterative learning pipeline. We incorporate CAT into the MetaDrive
simulator and validate our approach on hundreds of driving scenarios imported
from real-world driving datasets. Experimental results demonstrate that CAT can
effectively generate adversarial scenarios countering the agent being trained.
After training, the agent can achieve superior driving safety in both
log-replay and safety-critical traffic scenarios on the held-out test set. Code
and data are available at https://metadriverse.github.io/cat.Comment: 7th Conference on Robot Learning (CoRL 2023
Guarded Policy Optimization with Imperfect Online Demonstrations
The Teacher-Student Framework (TSF) is a reinforcement learning setting where
a teacher agent guards the training of a student agent by intervening and
providing online demonstrations. Assuming optimal, the teacher policy has the
perfect timing and capability to intervene in the learning process of the
student agent, providing safety guarantee and exploration guidance.
Nevertheless, in many real-world settings it is expensive or even impossible to
obtain a well-performing teacher policy. In this work, we relax the assumption
of a well-performing teacher and develop a new method that can incorporate
arbitrary teacher policies with modest or inferior performance. We instantiate
an Off-Policy Reinforcement Learning algorithm, termed Teacher-Student Shared
Control (TS2C), which incorporates teacher intervention based on
trajectory-based value estimation. Theoretical analysis validates that the
proposed TS2C algorithm attains efficient exploration and substantial safety
guarantee without being affected by the teacher's own performance. Experiments
on various continuous control tasks show that our method can exploit teacher
policies at different performance levels while maintaining a low training cost.
Moreover, the student policy surpasses the imperfect teacher policy in terms of
higher accumulated reward in held-out testing environments. Code is available
at https://metadriverse.github.io/TS2C.Comment: Accepted at ICLR 2023 (top 25%
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning
Driving safely requires multiple capabilities from human and intelligent
agents, such as the generalizability to unseen environments, the safety
awareness of the surrounding traffic, and the decision-making in complex
multi-agent settings. Despite the great success of Reinforcement Learning (RL),
most of the RL research works investigate each capability separately due to the
lack of integrated environments. In this work, we develop a new driving
simulation platform called MetaDrive to support the research of generalizable
reinforcement learning algorithms for machine autonomy. MetaDrive is highly
compositional, which can generate an infinite number of diverse driving
scenarios from both the procedural generation and the real data importing.
Based on MetaDrive, we construct a variety of RL tasks and baselines in both
single-agent and multi-agent settings, including benchmarking generalizability
across unseen scenes, safe exploration, and learning multi-agent traffic. The
generalization experiments conducted on both procedurally generated scenarios
and real-world scenarios show that increasing the diversity and the size of the
training set leads to the improvement of the generalizability of the RL agents.
We further evaluate various safe reinforcement learning and multi-agent
reinforcement learning algorithms in MetaDrive environments and provide the
benchmarks. Source code, documentation, and demo video are available at
https://metadriverse.github.io/metadrive . More research projects based on
MetaDrive simulator are listed at https://metadriverse.github.ioComment: Source code, documentation, and demo video are available at
https://metadriverse.github.io/metadrive . More research projects based on
MetaDrive simulator are listed at https://metadriverse.github.i
- …