Search CORE

54,963 research outputs found

On discrete control of nonlinear systems with applications to robotics

Author: Eslami Mansour
Publication venue
Publication date
Field of study

Much progress has been reported in the areas of modeling and control of nonlinear dynamic systems in a continuous-time framework. From implementation point of view, however, it is essential to study these nonlinear systems directly in a discrete setting that is amenable for interfacing with digital computers. But to develop discrete models and discrete controllers for a nonlinear system such as robot is a nontrivial task. Robot is also inherently a variable-inertia dynamic system involving additional complications. Not only the computer-oriented models of these systems must satisfy the usual requirements for such models, but these must also be compatible with the inherent capabilities of computers and must preserve the fundamental physical characteristics of continuous-time systems such as the conservation of energy and/or momentum. Preliminary issues regarding discrete systems in general and discrete models of a typical industrial robot that is developed with full consideration of the principle of conservation of energy are presented. Some research on the pertinent tactile information processing is reviewed. Finally, system control methods and how to integrate these issues in order to complete the task of discrete control of a robot manipulator are also reviewed

NASA Technical Reports Server

Autonomous Reinforcement of Behavioral Sequences in Neural Dynamics

Author: Kazerounian Sohrob
Luciw Matthew
Richter Mathis
Sandamirskaya Yulia
Publication venue
Publication date: 01/01/2012
Field of study

We introduce a dynamic neural algorithm called Dynamic Neural (DN) SARSA(\lambda) for learning a behavioral sequence from delayed reward. DN-SARSA(\lambda) combines Dynamic Field Theory models of behavioral sequence representation, classical reinforcement learning, and a computational neuroscience model of working memory, called Item and Order working memory, which serves as an eligibility trace. DN-SARSA(\lambda) is implemented on both a simulated and real robot that must learn a specific rewarding sequence of elementary behaviors from exploration. Results show DN-SARSA(\lambda) performs on the level of the discrete SARSA(\lambda), validating the feasibility of general reinforcement learning without compromising neural dynamics.Comment: Sohrob Kazerounian, Matthew Luciw are Joint first author

arXiv.org e-Print Archive

CiteSeerX

Control of Locomotion with Shape-Changing Wheels

Author: Kumar Vijay
Mellinger Daniel
Yim Mark
Publication venue: ScholarlyCommons
Publication date: 01/05/2009
Field of study

We present a novel approach to controlling the locomotion of a wheel by changing its shape, leading to applications to the synthesis and closed-loop control of gaits for modular robots. A dynamic model of a planar, continuous deformable ellipse in contact with a ground surface is derived. We present two alternative approaches to controlling this system and a method for mapping the gaits to a discrete rolling polygon. Mathematical models and dynamic simulation of the continuous approximation and the discrete n-body system, and experimental results obtained from a physical modular robot system illustrate the accuracy of the dynamic models and the validity of the approach

Crossref

ScholarlyCommons@Penn

Macro-continuous dynamics for hyper-redundant robots: application to locomotion bio-inspired by elongated animals

Author: Ali Shaukat
Boyer Frédéric
Porez Mathieu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

International audienceThis article presents a unified dynamic modeling approach of continuum robots. The robot is modeled as a geometrically exact beam continuously actuated through an active strain law. Once included into the geometric mechanics of locomotion, the approach applies to any hyper-redundant or continuous robot devoted to manipulation and/or locomotion. Furthermore, exploiting the nature of the resulting models as being a continuous version of the Newton-Euler models of discrete robots, an algorithm is proposed which is capable of computing the internal control torques (and/or forces) as well as the rigid overall motions of the locomotor robot. The efficiency of the approach is finally illustrated through many examples directly related to the terrestrial locomotion of elongated animals as snakes, worms or caterpillars and their associated bio-mimetic artifacts

HAL Descartes

HAL Mines Nantes

Multi-Robot Transfer Learning: A Dynamical System Perspective

Author: Helwa Mohamed K.
Schoellig Angela P.
Publication venue
Publication date: 26/07/2017
Field of study

Multi-robot transfer learning allows a robot to use data generated by a second, similar robot to improve its own behavior. The potential advantages are reducing the time of training and the unavoidable risks that exist during the training phase. Transfer learning algorithms aim to find an optimal transfer map between different robots. In this paper, we investigate, through a theoretical study of single-input single-output (SISO) systems, the properties of such optimal transfer maps. We first show that the optimal transfer learning map is, in general, a dynamic system. The main contribution of the paper is to provide an algorithm for determining the properties of this optimal dynamic map including its order and regressors (i.e., the variables it depends on). The proposed algorithm does not require detailed knowledge of the robots' dynamics, but relies on basic system properties easily obtainable through simple experimental tests. We validate the proposed algorithm experimentally through an example of transfer learning between two different quadrotor platforms. Experimental results show that an optimal dynamic map, with correct properties obtained from our proposed algorithm, achieves 60-70% reduction of transfer learning error compared to the cases when the data is directly transferred or transferred using an optimal static map.Comment: 7 pages, 6 figures, accepted at the 2017 IEEE/RSJ International Conference on Intelligent Robots and System

arXiv.org e-Print Archive

Crossref

Closed-loop Bayesian Semantic Data Fusion for Collaborative Human-Autonomy Target Search

Author: Ahmed Nisar
Barbier Luke
Burks Luke
Loefgren Ian
McGinley Jamison
Muesing Jeremy
Vunnam Sousheel
Publication venue
Publication date: 02/06/2018
Field of study

In search applications, autonomous unmanned vehicles must be able to efficiently reacquire and localize mobile targets that can remain out of view for long periods of time in large spaces. As such, all available information sources must be actively leveraged -- including imprecise but readily available semantic observations provided by humans. To achieve this, this work develops and validates a novel collaborative human-machine sensing solution for dynamic target search. Our approach uses continuous partially observable Markov decision process (CPOMDP) planning to generate vehicle trajectories that optimally exploit imperfect detection data from onboard sensors, as well as semantic natural language observations that can be specifically requested from human sensors. The key innovation is a scalable hierarchical Gaussian mixture model formulation for efficiently solving CPOMDPs with semantic observations in continuous dynamic state spaces. The approach is demonstrated and validated with a real human-robot team engaged in dynamic indoor target search and capture scenarios on a custom testbed.Comment: Final version accepted and submitted to 2018 FUSION Conference (Cambridge, UK, July 2018

arXiv.org e-Print Archive

Crossref

Learning a Unified Control Policy for Safe Falling

Author: Ha Sehoon
Kumar Visak CV
Liu C Karen
Publication venue
Publication date: 20/04/2017
Field of study

Being able to fall safely is a necessary motor skill for humanoids performing highly dynamic tasks, such as running and jumping. We propose a new method to learn a policy that minimizes the maximal impulse during the fall. The optimization solves for both a discrete contact planning problem and a continuous optimal control problem. Once trained, the policy can compute the optimal next contacting body part (e.g. left foot, right foot, or hands), contact location and timing, and the required joint actuation. We represent the policy as a mixture of actor-critic neural network, which consists of n control policies and the corresponding value functions. Each pair of actor-critic is associated with one of the n possible contacting body parts. During execution, the policy corresponding to the highest value function will be executed while the associated body part will be the next contact with the ground. With this mixture of actor-critic architecture, the discrete contact sequence planning is solved through the selection of the best critics while the continuous control problem is solved by the optimization of actors. We show that our policy can achieve comparable, sometimes even higher, rewards than a recursive search of the action space using dynamic programming, while enjoying 50 to 400 times of speed gain during online execution

arXiv.org e-Print Archive

Crossref