Search CORE

452,919 research outputs found

Model-based Reinforcement Learning with Parametrized Physical Models and Optimism-Driven Exploration

Author: Abbeel Pieter
Levine Sergey
Moldovan Teodor
Patil Sachin
Xie Christopher
Publication venue
Publication date: 15/03/2016
Field of study

In this paper, we present a robotic model-based reinforcement learning method that combines ideas from model identification and model predictive control. We use a feature-based representation of the dynamics that allows the dynamics model to be fitted with a simple least squares procedure, and the features are identified from a high-level specification of the robot's morphology, consisting of the number and connectivity structure of its links. Model predictive control is then used to choose the actions under an optimistic model of the dynamics, which produces an efficient and goal-directed exploration strategy. We present real time experimental results on standard benchmark problems involving the pendulum, cartpole, and double pendulum systems. Experiments indicate that our method is able to learn a range of benchmark tasks substantially faster than the previous best methods. To evaluate our approach on a realistic robotic control task, we also demonstrate real time control of a simulated 7 degree of freedom arm.Comment: 8 page

arXiv.org e-Print Archive

CiteSeerX

Crossref

Semantic Robot Programming for Goal-Directed Manipulation in Cluttered Scenes

Author: Jenkins Odest Chadwicke
Sui Zhiqiang
Zeng Zhen
Zhou Zheming
Publication venue
Publication date: 18/10/2018
Field of study

We present the Semantic Robot Programming (SRP) paradigm as a convergence of robot programming by demonstration and semantic mapping. In SRP, a user can directly program a robot manipulator by demonstrating a snapshot of their intended goal scene in workspace. The robot then parses this goal as a scene graph comprised of object poses and inter-object relations, assuming known object geometries. Task and motion planning is then used to realize the user's goal from an arbitrary initial scene configuration. Even when faced with different initial scene configurations, SRP enables the robot to seamlessly adapt to reach the user's demonstrated goal. For scene perception, we propose the Discriminatively-Informed Generative Estimation of Scenes and Transforms (DIGEST) method to infer the initial and goal states of the world from RGBD images. The efficacy of SRP with DIGEST perception is demonstrated for the task of tray-setting with a Michigan Progress Fetch robot. Scene perception and task execution are evaluated with a public household occlusion dataset and our cluttered scene dataset.Comment: published in ICRA 201

arXiv.org e-Print Archive

Crossref

The integrated theory of emotional behavior follows a radically goal-directed approach

Author: Moors Agnes
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2017
Field of study

status: publishe

Lirias

Ghent University Academic Bibliography

Speech Styles Used by Young Female and Male Teachers in Teaching English to Their Older Students

Author: Kuntjara E. H. (Esther)
Ronsumbre F. N. (Febriani)
Publication venue: 'Petra Christian University'
Publication date: 01/01/2015
Field of study

This study discussed about the speech styles used by a young female and a young male teacher in teaching English to their older students, the tendencies of using the features of each speech styles, and the use of opposite gendered speech styles. Through the classroom observation, the findings showed that the young female teacher applied six features of female speech styles and five features of male speech styles while the findings of theyoung male teacher showed that he only applied five features of male speech styles and applied six features of female speech styles.From the findings, it could be concluded that the most dominant speech styles used by the young female teacher is ‘Female Speech Style' and the most dominant feature is ‘Co-operative'. And, the most dominant speech styles used by the young male teacher is ‘Male Speech Style' and the most dominant feature is ‘Co-operative' feature of female speech styl

Neliti

Learning Deployable Navigation Policies at Kilometer Scale from a Single Traversal

Author: Bruce Jake
Hadsell Raia
Milford Michael
Mirowski Piotr
Sünderhauf Niko
Publication venue
Publication date: 01/01/2018
Field of study

Model-free reinforcement learning has recently been shown to be effective at learning navigation policies from complex image input. However, these algorithms tend to require large amounts of interaction with the environment, which can be prohibitively costly to obtain on robots in the real world. We present an approach for efficiently learning goal-directed navigation policies on a mobile robot, from only a single coverage traversal of recorded data. The navigation agent learns an effective policy over a diverse action space in a large heterogeneous environment consisting of more than 2km of travel, through buildings and outdoor regions that collectively exhibit large variations in visual appearance, self-similarity, and connectivity. We compare pretrained visual encoders that enable precomputation of visual embeddings to achieve a throughput of tens of thousands of transitions per second at training time on a commodity desktop computer, allowing agents to learn from millions of trajectories of experience in a matter of hours. We propose multiple forms of computationally efficient stochastic augmentation to enable the learned policy to generalise beyond these precomputed embeddings, and demonstrate successful deployment of the learned policy on the real robot without fine tuning, despite environmental appearance differences at test time. The dataset and code required to reproduce these results and apply the technique to other datasets and robots is made publicly available at rl-navigation.github.io/deployable

arXiv.org e-Print Archive

Queensland University of Technology ePrints Archive

Goal-Directed Behavior under Variational Predictive Coding: Dynamic Organization of Visual Attention and Working Memory

Author: Jung Minju
Matsumoto Takazumi
Tani Jun
Publication venue
Publication date: 12/03/2019
Field of study

Mental simulation is a critical cognitive function for goal-directed behavior because it is essential for assessing actions and their consequences. When a self-generated or externally specified goal is given, a sequence of actions that is most likely to attain that goal is selected among other candidates via mental simulation. Therefore, better mental simulation leads to better goal-directed action planning. However, developing a mental simulation model is challenging because it requires knowledge of self and the environment. The current paper studies how adequate goal-directed action plans of robots can be mentally generated by dynamically organizing top-down visual attention and visual working memory. For this purpose, we propose a neural network model based on variational Bayes predictive coding, where goal-directed action planning is formulated by Bayesian inference of latent intentional space. Our experimental results showed that cognitively meaningful competencies, such as autonomous top-down attention to the robot end effector (its hand) as well as dynamic organization of occlusion-free visual working memory, emerged. Furthermore, our analysis of comparative experiments indicated that introduction of visual working memory and the inference mechanism using variational Bayes predictive coding significantly improve the performance in planning adequate goal-directed actions

arXiv.org e-Print Archive

Crossref

OIST Institutional Repository