100 research outputs found
ORLA*: Mobile Manipulator-Based Object Rearrangement with Lazy A*
Effectively performing object rearrangement is an essential skill for mobile
manipulators, e.g., setting up a dinner table or organizing a desk. A key
challenge in such problems is deciding an appropriate manipulation order for
objects to effectively untangle dependencies between objects while considering
the necessary motions for realizing the manipulations (e.g., pick and place).
To our knowledge, computing time-optimal multi-object rearrangement solutions
for mobile manipulators remains a largely untapped research direction. In this
research, we propose ORLA*, which leverages delayed (lazy) evaluation in
searching for a high-quality object pick and place sequence that considers both
end-effector and mobile robot base travel. ORLA* also supports multi-layered
rearrangement tasks considering pile stability using machine learning.
Employing an optimal solver for finding temporary locations for displacing
objects, ORLA* can achieve global optimality. Through extensive simulation and
ablation study, we confirm the effectiveness of ORLA* delivering quality
solutions for challenging rearrangement instances. Supplementary materials are
available at: https://gaokai15.github.io/ORLA-Star/Comment: Submitted to ICRA 202
Nonprehensile Planar Manipulation through Reinforcement Learning with Multimodal Categorical Exploration
Developing robot controllers capable of achieving dexterous nonprehensile
manipulation, such as pushing an object on a table, is challenging. The
underactuated and hybrid-dynamics nature of the problem, further complicated by
the uncertainty resulting from the frictional interactions, requires
sophisticated control behaviors. Reinforcement Learning (RL) is a powerful
framework for developing such robot controllers. However, previous RL
literature addressing the nonprehensile pushing task achieves low accuracy,
non-smooth trajectories, and only simple motions, i.e. without rotation of the
manipulated object. We conjecture that previously used unimodal exploration
strategies fail to capture the inherent hybrid-dynamics of the task, arising
from the different possible contact interaction modes between the robot and the
object, such as sticking, sliding, and separation. In this work, we propose a
multimodal exploration approach through categorical distributions, which
enables us to train planar pushing RL policies for arbitrary starting and
target object poses, i.e. positions and orientations, and with improved
accuracy. We show that the learned policies are robust to external disturbances
and observation noise, and scale to tasks with multiple pushers. Furthermore,
we validate the transferability of the learned policies, trained entirely in
simulation, to a physical robot hardware using the KUKA iiwa robot arm. See our
supplemental video: https://youtu.be/vTdva1mgrk4
Progressive Transfer Learning for Dexterous In-Hand Manipulation with Multi-Fingered Anthropomorphic Hand
Dexterous in-hand manipulation for a multi-fingered anthropomorphic hand is
extremely difficult because of the high-dimensional state and action spaces,
rich contact patterns between the fingers and objects. Even though deep
reinforcement learning has made moderate progress and demonstrated its strong
potential for manipulation, it is still faced with certain challenges, such as
large-scale data collection and high sample complexity. Especially, for some
slight change scenes, it always needs to re-collect vast amounts of data and
carry out numerous iterations of fine-tuning. Remarkably, humans can quickly
transfer learned manipulation skills to different scenarios with little
supervision. Inspired by human flexible transfer learning capability, we
propose a novel dexterous in-hand manipulation progressive transfer learning
framework (PTL) based on efficiently utilizing the collected trajectories and
the source-trained dynamics model. This framework adopts progressive neural
networks for dynamics model transfer learning on samples selected by a new
samples selection method based on dynamics properties, rewards and scores of
the trajectories. Experimental results on contact-rich anthropomorphic hand
manipulation tasks show that our method can efficiently and effectively learn
in-hand manipulation skills with a few online attempts and adjustment learning
under the new scene. Compared to learning from scratch, our method can reduce
training time costs by 95%.Comment: 12 pages, 7 figures, submitted to TNNL
Learning Hybrid Actor-Critic Maps for 6D Non-Prehensile Manipulation
Manipulating objects without grasping them is an essential component of human
dexterity, referred to as non-prehensile manipulation. Non-prehensile
manipulation may enable more complex interactions with the objects, but also
presents challenges in reasoning about gripper-object interactions. In this
work, we introduce Hybrid Actor-Critic Maps for Manipulation (HACMan), a
reinforcement learning approach for 6D non-prehensile manipulation of objects
using point cloud observations. HACMan proposes a temporally-abstracted and
spatially-grounded object-centric action representation that consists of
selecting a contact location from the object point cloud and a set of motion
parameters describing how the robot will move after making contact. We modify
an existing off-policy RL algorithm to learn in this hybrid discrete-continuous
action representation. We evaluate HACMan on a 6D object pose alignment task in
both simulation and in the real world. On the hardest version of our task, with
randomized initial poses, randomized 6D goals, and diverse object categories,
our policy demonstrates strong generalization to unseen object categories
without a performance drop, achieving an 89% success rate on unseen objects in
simulation and 50% success rate with zero-shot transfer in the real world.
Compared to alternative action representations, HACMan achieves a success rate
more than three times higher than the best baseline. With zero-shot sim2real
transfer, our policy can successfully manipulate unseen objects in the real
world for challenging non-planar goals, using dynamic and contact-rich
non-prehensile skills. Videos can be found on the project website:
https://hacman-2023.github.io
Legged Robots for Object Manipulation: A Review
Legged robots can have a unique role in manipulating objects in dynamic,
human-centric, or otherwise inaccessible environments. Although most legged
robotics research to date typically focuses on traversing these challenging
environments, many legged platform demonstrations have also included "moving an
object" as a way of doing tangible work. Legged robots can be designed to
manipulate a particular type of object (e.g., a cardboard box, a soccer ball,
or a larger piece of furniture), by themselves or collaboratively. The
objective of this review is to collect and learn from these examples, to both
organize the work done so far in the community and highlight interesting open
avenues for future work. This review categorizes existing works into four main
manipulation methods: object interactions without grasping, manipulation with
walking legs, dedicated non-locomotive arms, and legged teams. Each method has
different design and autonomy features, which are illustrated by available
examples in the literature. Based on a few simplifying assumptions, we further
provide quantitative comparisons for the range of possible relative sizes of
the manipulated object with respect to the robot. Taken together, these
examples suggest new directions for research in legged robot manipulation, such
as multifunctional limbs, terrain modeling, or learning-based control, to
support a number of new deployments in challenging indoor/outdoor scenarios in
warehouses/construction sites, preserved natural areas, and especially for home
robotics.Comment: Preprint of the paper submitted to Frontiers in Mechanical
Engineerin
Versatile Multi-Contact Planning and Control for Legged Loco-Manipulation
Loco-manipulation planning skills are pivotal for expanding the utility of
robots in everyday environments. These skills can be assessed based on a
system's ability to coordinate complex holistic movements and multiple contact
interactions when solving different tasks. However, existing approaches have
been merely able to shape such behaviors with hand-crafted state machines,
densely engineered rewards, or pre-recorded expert demonstrations. Here, we
propose a minimally-guided framework that automatically discovers whole-body
trajectories jointly with contact schedules for solving general
loco-manipulation tasks in pre-modeled environments. The key insight is that
multi-modal problems of this nature can be formulated and treated within the
context of integrated Task and Motion Planning (TAMP). An effective bilevel
search strategy is achieved by incorporating domain-specific rules and
adequately combining the strengths of different planning techniques: trajectory
optimization and informed graph search coupled with sampling-based planning. We
showcase emergent behaviors for a quadrupedal mobile manipulator exploiting
both prehensile and non-prehensile interactions to perform real-world tasks
such as opening/closing heavy dishwashers and traversing spring-loaded doors.
These behaviors are also deployed on the real system using a two-layer
whole-body tracking controller
Energy-saving Trajectory And Control Design For Quadrotors With Slung Payloads
Quadrotors have promising applications such as payload transportation, which can change the future of the package delivery industry. However, many challenges block the way of implementing payload transportation in reality. Slung payload vibrations and quadrotor's energy consumption are among the major challenges, which are related to each other because payload vibrations affect energy consumption.
In this dissertation, the kinematics, dynamics, and energy models are first developed for both a single quadrotor and a transportation system consisting of a quadrotor with a slung payload. The proposed energy model is novel and introduces the concepts of power and energy quotients that, unlike the existing models, do not depend on quadrotor-related parameters such as motor and propeller parameters.
This is the first energy model for such a transportation system.
Second, this dissertation focuses on polynomial trajectories, where a generic framework to design feasible polynomial trajectories of arbitrary degree with a large number of waypoints is presented. This allows for extending the capabilities of polynomial trajectories to overcome some kinematic limitations associated with continuous-path trajectories, e.g., arbitrary kinematic constraints.
Third, extensive vibration analyses of the transportation system and polynomial trajectories are conducted. As a result, a novel controller-independent payload vibration reduction method is proposed. The proposed method is more generic than the existing methods, e.g., anti-swing controllers.
Fourth, the effects of polynomial trajectories, payload mass, and cable length on quadrotor's energy consumption are studied. The comparison with an energy-minimized trajectory shows that polynomial trajectories are not only energy-efficient, but their design is simpler than energy-minimized trajectories and does not require quadrotor-related parameters.
Lastly, a robust energy-saving sliding mode controller with input saturation is designed for the transportation system. The experimental results show that the proposed controller is robust and energy-efficient when, qualitatively, compared with an existing energy-saving controller. The proposed controller is the first energy-saving controllers for such a transportation system.
This dissertation opens the door for package delivery with quadrotors by providing the first energy analysis, and energy-saving trajectories and controllers for quadrotors with slung payloads
Robust Model Predictive Control for Linear Parameter Varying Systems along with Exploration of its Application in Medical Mobile Robots
This thesis seeks to develop a robust model predictive controller (MPC) for Linear Parameter Varying (LPV) systems. LPV models based on input-output display are employed. We aim to improve robust MPC methods for LPV systems with an input-output display. This improvement will be examined from two perspectives. First, the system must be stable in conditions of uncertainty (in signal scheduling or due to disturbance) and perform well in both tracking and regulation problems. Secondly, the proposed method should be practical, i.e., it should have a reasonable computational load and not be conservative.
Firstly, an interpolation approach is utilized to minimize the conservativeness of the MPC. The controller is calculated as a linear combination of a set of offline predefined control laws. The coefficients of these offline controllers are derived from a real-time optimization problem. The control gains are determined to ensure stability and increase the terminal set.
Secondly, in order to test the system's robustness to external disturbances, a free control move was added to the control law. Also, a Recurrent Neural Network (RNN) algorithm is applied for online optimization, showing that this optimization method has better speed and accuracy than traditional algorithms. The proposed controller was compared with two methods (robust MPC and MPC with LPV model based on input-output) in reference tracking and disturbance rejection scenarios. It was shown that the proposed method works well in both parts. However, two other methods could not deal with the disturbance.
Thirdly, a support vector machine was introduced to identify the input-output LPV model to estimate the output. The estimated model was compared with the actual nonlinear system outputs, and the identification was shown to be effective. As a consequence, the controller can accurately follow the reference.
Finally, an interpolation-based MPC with free control moves is implemented for a wheeled mobile robot in a hospital setting, where an RNN solves the online optimization problem. The controller was compared with a robust MPC and MPC-LPV in reference tracking, disturbance rejection, online computational load, and region of attraction. The results indicate that our proposed method surpasses and can navigate quickly and reliably while avoiding obstacles
- …