482 research outputs found
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Deep neural networks have been remarkable successful in various AI tasks but
often cast high computation and energy cost for energy-constrained applications
such as mobile sensing. We address this problem by proposing a novel framework
that optimizes the prediction accuracy and energy cost simultaneously, thus
enabling effective cost-accuracy trade-off at test time. In our framework, each
data instance is pushed into a cascade of deep neural networks with increasing
sizes, and a selection module is used to sequentially determine when a
sufficiently accurate classifier can be used for this data instance. The
cascade of neural networks and the selection module are jointly trained in an
end-to-end fashion by the REINFORCE algorithm to optimize a trade-off between
the computational cost and the predictive accuracy. Our method is able to
simultaneously improve the accuracy and efficiency by learning to assign easy
instances to fast yet sufficiently accurate classifiers to save computation and
energy cost, while assigning harder instances to deeper and more powerful
classifiers to ensure satisfiable accuracy. With extensive experiments on
several image classification datasets using cascaded ResNet classifiers, we
demonstrate that our method outperforms the standard well-trained ResNets in
accuracy but only requires less than 20% and 50% FLOPs cost on the CIFAR-10/100
datasets and 66% on the ImageNet dataset, respectively
FOLT: Fast Multiple Object Tracking from UAV-captured Videos Based on Optical Flow
Multiple object tracking (MOT) has been successfully investigated in computer
vision.
However, MOT for the videos captured by unmanned aerial vehicles (UAV) is
still challenging due to small object size, blurred object appearance, and very
large and/or irregular motion in both ground objects and UAV platforms.
In this paper, we propose FOLT to mitigate these problems and reach fast and
accurate MOT in UAV view.
Aiming at speed-accuracy trade-off, FOLT adopts a modern detector and
light-weight optical flow extractor to extract object detection features and
motion features at a minimum cost.
Given the extracted flow, the flow-guided feature augmentation is designed to
augment the object detection feature based on its optical flow, which improves
the detection of small objects.
Then the flow-guided motion prediction is also proposed to predict the
object's position in the next frame, which improves the tracking performance of
objects with very large displacements between adjacent frames.
Finally, the tracker matches the detected objects and predicted objects using
a spatially matching scheme to generate tracks for every object.
Experiments on Visdrone and UAVDT datasets show that our proposed model can
successfully track small objects with large and irregular motion and outperform
existing state-of-the-art methods in UAV-MOT tasks.Comment: Accepted by ACM Multi-Media 202
Teaching Autonomous Vehicles to Express Interaction Intent during Unprotected Left Turns: A Human-Driving-Prior-Based Trajectory Planning Approach
Incorporating Autonomous Vehicles (AVs) into existing transportation systems
necessitates examining their coexistence with Human-driven Vehicles (HVs) in
mixed traffic environments. Central to this coexistence is the AVs' ability to
emulate human-like interaction intentions within traffic scenarios. We
introduce a novel framework for planning unprotected left-turn trajectories for
AVs, designed to mirror human driving behaviors and effectively communicate
social intentions. This framework consists of three phases: trajectory
generation, evaluation, and selection.In the trajectory generation phase, we
utilize real human-driving trajectory data to establish constraints for a
predicted trajectory space, creating candidate motion trajectories that reflect
intent. The evaluation phase incorporates maximum entropy inverse reinforcement
learning (ME-IRL) to gauge human trajectory preferences, considering aspects
like traffic efficiency, driving comfort, and interactive safety. During the
selection phase, a Boltzmann distribution-based approach is employed to assign
rewards and probabilities to the candidate trajectories, promoting human-like
decision-making. We validate our framework using an authentic trajectory
dataset and conduct a comparative analysis with various baseline methods. Our
results, derived from simulator tests and human-in-the-loop driving
experiments, affirm our framework's superiority in mimicking human-like
driving, expressing intent, and computational efficiency. For additional
information of this research, please visit https://shorturl.at/jqu35
DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement
Decision-making stands as a pivotal component in the realm of autonomous
vehicles (AVs), playing a crucial role in navigating the intricacies of
autonomous driving. Amidst the evolving landscape of data-driven methodologies,
enhancing decision-making performance in complex scenarios has emerged as a
prominent research focus. Despite considerable advancements, current
learning-based decision-making approaches exhibit potential for refinement,
particularly in aspects of policy articulation and safety assurance. To address
these challenges, we introduce DDM-Lag, a Diffusion Decision Model, augmented
with Lagrangian-based safety enhancements. This work conceptualizes the
sequential decision-making challenge inherent in autonomous driving as a
problem of generative modeling, adopting diffusion models as the medium for
assimilating patterns of decision-making. We introduce a hybrid policy update
strategy for diffusion models, amalgamating the principles of behavior cloning
and Q-learning, alongside the formulation of an Actor-Critic architecture for
the facilitation of updates. To augment the model's exploration process with a
layer of safety, we incorporate additional safety constraints, employing a
sophisticated policy optimization technique predicated on Lagrangian relaxation
to refine the policy learning endeavor comprehensively. Empirical evaluation of
our proposed decision-making methodology was conducted across a spectrum of
driving tasks, distinguished by their varying degrees of complexity and
environmental contexts. The comparative analysis with established baseline
methodologies elucidates our model's superior performance, particularly in
dimensions of safety and holistic efficacy
- …