Search CORE

80 research outputs found

Learning visual docking for non-holonomic autonomous vehicles

Author: Duckett Tom
Martinez-Marin Tomas
Publication venue
Publication date: 01/06/2008
Field of study

This paper presents a new method of learning visual docking skills for non-holonomic vehicles by direct interaction with the environment. The method is based on a reinforcement algorithm, which speeds up Q-learning by applying memorybased sweeping and enforcing the “adjoining property”, a filtering mechanism to only allow transitions between states that satisfy a fixed distance. The method overcomes some limitations of reinforcement learning techniques when they are employed in applications with continuous non-linear systems, such as car-like vehicles. In particular, a good approximation to the optimal behaviour is obtained by a small look-up table. The algorithm is tested within an image-based visual servoing framework on a docking task. The training time was less than 1 hour on the real vehicle. In experiments, we show the satisfactory performance of the algorithm

University of Lincoln Institutional Repository

Crossref

Fast reinforcement learning for vision-guided mobile robots

Author: Duckett T.
Martinez-Marin T.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2005
Field of study

This paper presents a new reinforcement learning algorithm for accelerating acquisition of new skills by real mobile robots, without requiring simulation. It speeds up Q-learning by applying memory-based sweeping and enforcing the “adjoining property”, a technique that exploits the natural ordering of sensory state spaces in many robotic applications by only allowing transitions between neighbouring states. The algorithm is tested within an image-based visual servoing framework on a docking task, in which the robot has to position its gripper at a desired configuration relative to an object on a table. In experiments, we compare the performance of the new algorithm with a hand-designed linear controller and a scheme using the linear controller as a bias to further accelerate the learning. By analysis of the controllability and docking time, we show that the biased learner could improve on the performance of the linear controller, while requiring substantially lower training time than unbiased learning (less than 1 hour on the real robot)

University of Lincoln Institutional Repository

Crossref

Vision-based reinforcement learning using approximate policy iteration

Author: Duckett Tom
Shaker Marwan
Yue Shigang
Publication venue
Publication date: 01/01/2009
Field of study

A major issue for reinforcement learning (RL) applied to robotics is the time required to learn a new skill. While RL has been used to learn mobile robot control in many simulated domains, applications involving learning on real robots are still relatively rare. In this paper, the Least-Squares Policy Iteration (LSPI) reinforcement learning algorithm and a new model-based algorithm Least-Squares Policy Iteration with Prioritized Sweeping (LSPI+), are implemented on a mobile robot to acquire new skills quickly and efficiently. LSPI+ combines the benefits of LSPI and prioritized sweeping, which uses all previous experience to focus the computational effort on the most “interesting” or dynamic parts of the state space. The proposed algorithms are tested on a household vacuum cleaner robot for learning a docking task using vision as the only sensor modality. In experiments these algorithms are compared to other model-based and model-free RL algorithms. The results show that the number of trials required to learn the docking task is significantly reduced using LSPI compared to the other RL algorithms investigated, and that LSPI+ further improves on the performance of LSPI

University of Lincoln Institutional Repository

CiteSeerX

Visual Servoing from Deep Neural Networks

Author: Bateux Quentin
Chaumette Francois
Corke Peter
Leitner Jürgen
Marchand Eric
Publication venue
Publication date: 01/01/2017
Field of study

We present a deep neural network-based method to perform high-precision, robust and real-time 6 DOF visual servoing. The paper describes how to create a dataset simulating various perturbations (occlusions and lighting conditions) from a single real-world image of the scene. A convolutional neural network is fine-tuned using this dataset to estimate the relative pose between two images of the same scene. The output of the network is then employed in a visual servoing control scheme. The method converges robustly even in difficult real-world settings with strong lighting variations and occlusions.A positioning error of less than one millimeter is obtained in experiments with a 6 DOF robot.Comment: fixed authors lis

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Queensland University of Technology ePrints Archive

HAL-Rennes 1

Learning to reach by reinforcement learning using a receptive field based function approximation approach with continuous actions

Author: A Arleo
B Espiau
C Breazeal
CJ Watkins
DJ Foster
F Chaumette
F Chaumette
F Wörgötter
Florentin Wörgötter
G Tesauro
GJ Gordon
J Peters
J Soechting
J Soechting
M Moussa
M Moussa
M Tamosiunaite
Minija Tamosiunaite
R Dillmann
R Horaud
R Sutton
RJ Williams
RS Sutton
RS Sutton
T Strösslin
Tamim Asfour
V Ruis de Angulo
Publication venue: Springer-Verlag
Publication date: 01/01/2009
Field of study

Reinforcement learning methods can be used in robotics applications especially for specific target-oriented problems, for example the reward-based recalibration of goal directed actions. To this end still relatively large and continuous state-action spaces need to be efficiently handled. The goal of this paper is, thus, to develop a novel, rather simple method which uses reinforcement learning with function approximation in conjunction with different reward-strategies for solving such problems. For the testing of our method, we use a four degree-of-freedom reaching problem in 3D-space simulated by a two-joint robot arm system with two DOF each. Function approximation is based on 4D, overlapping kernels (receptive fields) and the state-action space contains about 10,000 of these. Different types of reward structures are being compared, for example, reward-on- touching-only against reward-on-approach. Furthermore, forbidden joint configurations are punished. A continuous action space is used. In spite of a rather large number of states and the continuous action space these reward/punishment strategies allow the system to find a good solution usually within about 20 trials. The efficiency of our method demonstrated in this test scenario suggests that it might be possible to use it on a real robot for problems where mixed rewards can be defined in situations where other types of learning might be difficult

Crossref

Springer - Publisher Connector

Vytautas Magnus University Institutional Repository (VMU ePub)

PubMed Central

A review of aerial manipulation of small-scale rotorcraft unmanned robotic systems

Author: Ding X
Guo P
Xu K.
Yu Yushu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Small-scale rotorcraft unmanned robotic systems (SRURSs) are a kind of unmanned rotorcraft with manipulating devices. This review aims to provide an overview on aerial manipulation of SRURSs nowadays and promote relative research in the future. In the past decade, aerial manipulation of SRURSs has attracted the interest of researchers globally. This paper provides a literature review of the last 10 years (2008–2017) on SRURSs, and details achievements and challenges. Firstly, the definition, current state, development, classification, and challenges of SRURSs are introduced. Then, related papers are organized into two topical categories: mechanical structure design, and modeling and control. Following this, research groups involved in SRURS research and their major achievements are summarized and classified in the form of tables. The research groups are introduced in detail from seven parts. Finally, trends and challenges are compiled and presented to serve as a resource for researchers interested in aerial manipulation of SRURSs. The problem, trends, and challenges are described from three aspects. Conclusions of the paper are presented, and the future of SRURSs is discussed to enable further research interests

Directory of Open Access Journals

Chalmers Research

Recommended from our members

Visual Dynamics Models for Robotic Planning and Control

Author: Lee Alex Xavier
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

For a robot to interact with its environment, it must perceive the world and understand how the world evolves as a consequence of its actions. This thesis studies a few methods that a robot can use to respond to its observations, with a focus on instances that can leverage visual dynamic models. In general, these are models of how the visual observations of a robot evolves as a consequence of its actions. This could be in the form of predictive models that directly predict the future in the space of image pixels, in the space of visual features extracted from these images, or in the space of compact learned latent representations. The three instances that this thesis studies are in the context of visual servoing, visual planning, and representation learning for reinforcement learning. In the first case, we combine learned visual features with learning single-step predictive dynamics models and reinforcement learning to learn visual servoing mechanisms. In the second case, we use a deterministic multi-step video prediction model to achieve various manipulation tasks through visual planning. In addition, we show that conventional video prediction models are unequipped to model uncertainty and multiple futures, which could limit the planning capabilities of the robot. To address this, we propose a stochastic video prediction model that is trained with a combination of variational losses, adversarial losses, and perceptual losses, and show that this model can predict futures that are more realistic, diverse, and accurate. Unlike the first two cases, in which the dynamics model is used to make predictions for decision-making, the third case learns the model solely for representation learning. We learn a stochastic sequential latent variable model to learn a latent representation, and then use it as an intermediate representation for reinforcement learning. We show that this approach improves final performance and sample efficiency

eScholarship - University of California

Survey of Visual and Force/Tactile Control of Robots for Physical Interaction in Spain

Author: Arbib
Bachiller
Bruyninckx
Cervera
Cervera
Cervera
Chaumette
Chaumette
Chaumette
Christensen
Cutkosky
Dahiya
De Fazio
Fernando Torres
Fraile
Gabriel Garcia
Galvez
Galvez
Garcia
Garcia
Garcia
Garcia
Gil
Hogan
Howe
Hutchinson
Isard
Jimenez
Jinjun
Jorge Pomares
Juan Corrales
Kobayashi
Kopacek
Lee
Lopez-Coronado
López-Nicolás
Maldonado-Lopez
Malis
Mason
Mejias
Merino
Mezouar
Nabulsi
Nickels
Ortiz
Papanikolopoulos
Patarinski
Payo
Pedreno-Molina
Perez-Vidal
Pomares
Pomares
Pomares
Prats
Puangmali
Raibert
Schramm
Sebastian
Shirai
Tegin
Valera
Vargas
Villani
Wells
Xie
Publication venue: Molecular Diversity Preservation International (MDPI)
Publication date: 01/01/2009
Field of study

Sensors provide robotic systems with the information required to perceive the changes that happen in unstructured environments and modify their actions accordingly. The robotic controllers which process and analyze this sensory information are usually based on three types of sensors (visual, force/torque and tactile) which identify the most widespread robotic control strategies: visual servoing control, force control and tactile control. This paper presents a detailed review on the sensor architectures, algorithmic techniques and applications which have been developed by Spanish researchers in order to implement these mono-sensor and multi-sensor controllers which combine several sensors

Repositorio Institucional de la Universidad de Alicante

Multidisciplinary Digital Publishing Institute

CiteSeerX

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

PubMed Central

A direct visual servoing scheme for automatic nanopositioning.

Author: Le Fort - Piat Nadine
Marchand Eric
Tamadazte Brahim
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

International audienceThis paper demonstrates an accurate nanopositioning scheme based on a direct visual servoing process. This technique uses only the pure image signal (photometric information) to design the visual servoing control law. With respect to traditional visual servoing approaches that use geometric visual features (points, lines ...), the visual features used in the control law is the pixel intensity. The proposed approach has been tested in term of accuracy and robustness in several experimental conditions. The obtained results have demonstrated a good behavior of the control law and very good positioning accuracy. The obtained accuracies are 89 nm, 14 nm, and 0.001 degrees in the x, y and axes of a positioning platform, respectively

HAL-CentraleSupelec

HAL - Université de Franche-Comté

INRIA a CCSD electronic archive server

HAL-Rennes 1