126 research outputs found
SIG-VC: A Speaker Information Guided Zero-shot Voice Conversion System for Both Human Beings and Machines
Nowadays, as more and more systems achieve good performance in traditional
voice conversion (VC) tasks, people's attention gradually turns to VC tasks
under extreme conditions. In this paper, we propose a novel method for
zero-shot voice conversion. We aim to obtain intermediate representations for
speaker-content disentanglement of speech to better remove speaker information
and get pure content information. Accordingly, our proposed framework contains
a module that removes the speaker information from the acoustic feature of the
source speaker. Moreover, speaker information control is added to our system to
maintain the voice cloning performance. The proposed system is evaluated by
subjective and objective metrics. Results show that our proposed system
significantly reduces the trade-off problem in zero-shot voice conversion,
while it also manages to have high spoofing power to the speaker verification
system
Probabilistic Traversability Model for Risk-Aware Motion Planning in Off-Road Environments
A key challenge in off-road navigation is that even visually similar terrains
or ones from the same semantic class may have substantially different traction
properties. Existing work typically assumes no wheel slip or uses the expected
traction for motion planning, where the predicted trajectories provide a poor
indication of the actual performance if the terrain traction has high
uncertainty. In contrast, this work proposes to analyze terrain traversability
with the empirical distribution of traction parameters in unicycle dynamics,
which can be learned by a neural network in a self-supervised fashion. The
probabilistic traction model leads to two risk-aware cost formulations that
account for the worst-case expected cost and traction. To help the learned
model generalize to unseen environment, terrains with features that lead to
unreliable predictions are detected via a density estimator fit to the trained
network's latent space and avoided via auxiliary penalties during planning.
Simulation results demonstrate that the proposed approach outperforms existing
work that assumes no slip or uses the expected traction in both navigation
success rate and completion time. Furthermore, avoiding terrains with low
density-based confidence score achieves up to 30% improvement in success rate
when the learned traction model is used in a novel environment.Comment: To appear in IROS23. Video and code:
https://github.com/mit-acl/mppi_numb
RAMP: A Risk-Aware Mapping and Planning Pipeline for Fast Off-Road Ground Robot Navigation
A key challenge in fast ground robot navigation in 3D terrain is balancing
robot speed and safety. Recent work has shown that 2.5D maps (2D
representations with additional 3D information) are ideal for real-time safe
and fast planning. However, the prevalent approach of generating 2D occupancy
grids through raytracing makes the generated map unsafe to plan in, due to
inaccurate representation of unknown space. Additionally, existing planners
such as MPPI do not consider speeds in known free and unknown space separately,
leading to slower overall plans. The RAMP pipeline proposed here solves these
issues using new mapping and planning methods. This work first presents ground
point inflation with persistent spatial memory as a way to generate accurate
occupancy grid maps from classified pointclouds. Then we present an MPPI-based
planner with embedded variability in horizon, to maximize speed in known free
space while retaining cautionary penetration into unknown space. Finally, we
integrate this mapping and planning pipeline with risk constraints arising from
3D terrain, and verify that it enables fast and safe navigation using
simulations and hardware demonstrations.Comment: 7 pages submitted to ICRA 202
A Distributed Pipeline for Scalable, Deconflicted Formation Flying
Reliance on external localization infrastructure and centralized coordination
are main limiting factors for formation flying of vehicles in large numbers and
in unprepared environments. While solutions using onboard localization address
the dependency on external infrastructure, the associated coordination
strategies typically lack collision avoidance and scalability. To address these
shortcomings, we present a unified pipeline with onboard localization and a
distributed, collision-free motion planning strategy that scales to a large
number of vehicles. Since distributed collision avoidance strategies are known
to result in gridlock, we also present a decentralized task assignment solution
to deconflict vehicles. We experimentally validate our pipeline in simulation
and hardware. The results show that our approach for solving the optimization
problem associated with motion planning gives solutions within seconds in cases
where general purpose solvers fail due to high complexity. In addition, our
lightweight assignment strategy leads to successful and quicker formation
convergence in 96-100% of all trials, whereas indefinite gridlocks occur
without it for 33-50% of trials. By enabling large-scale, deconflicted
coordination, this pipeline should help pave the way for anytime, anywhere
deployment of aerial swarms.Comment: 8 main pages, 1 additional page, accepted to RA-L and IROS'2
Energy-Aware, Collision-Free Information Gathering for Heterogeneous Robot Teams
This paper considers the problem of safely coordinating a team of
sensor-equipped robots to reduce uncertainty about a dynamical process, where
the objective trades off information gain and energy cost. Optimizing this
trade-off is desirable, but leads to a non-monotone objective function in the
set of robot trajectories. Therefore, common multi-robot planners based on
coordinate descent lose their performance guarantees. Furthermore, methods that
handle non-monotonicity lose their performance guarantees when subject to
inter-robot collision avoidance constraints. As it is desirable to retain both
the performance guarantee and safety guarantee, this work proposes a
hierarchical approach with a distributed planner that uses local search with a
worst-case performance guarantees and a decentralized controller based on
control barrier functions that ensures safety and encourages timely arrival at
sensing locations. Via extensive simulations, hardware-in-the-loop tests and
hardware experiments, we demonstrate that the proposed approach achieves a
better trade-off between sensing and energy cost than coordinate-descent-based
algorithms.Comment: To appear in Transactions on Robotics; 18 pages and 16 figures. arXiv
admin note: text overlap with arXiv:2101.1109
CGD: Constraint-Guided Diffusion Policies for UAV Trajectory Planning
Traditional optimization-based planners, while effective, suffer from high
computational costs, resulting in slow trajectory generation. A successful
strategy to reduce computation time involves using Imitation Learning (IL) to
develop fast neural network (NN) policies from those planners, which are
treated as expert demonstrators. Although the resulting NN policies are
effective at quickly generating trajectories similar to those from the expert,
(1) their output does not explicitly account for dynamic feasibility, and (2)
the policies do not accommodate changes in the constraints different from those
used during training.
To overcome these limitations, we propose Constraint-Guided Diffusion (CGD),
a novel IL-based approach to trajectory planning. CGD leverages a hybrid
learning/online optimization scheme that combines diffusion policies with a
surrogate efficient optimization problem, enabling the generation of
collision-free, dynamically feasible trajectories. The key ideas of CGD include
dividing the original challenging optimization problem solved by the expert
into two more manageable sub-problems: (a) efficiently finding collision-free
paths, and (b) determining a dynamically-feasible time-parametrization for
those paths to obtain a trajectory. Compared to conventional neural network
architectures, we demonstrate through numerical evaluations significant
improvements in performance and dynamic feasibility under scenarios with new
constraints never encountered during training.Comment: 8 pages, 3 figure
- …