Search CORE

13,097 research outputs found

Metric State Space Reinforcement Learning for a Vision-Capable Mobile Robot

Author: Gomez Faustino
Hutter Marcus
Schmidhuber Juergen
Zhumatiy Viktor
Publication venue
Publication date: 01/01/2006
Field of study

We address the problem of autonomously learning controllers for vision-capable mobile robots. We extend McCallum's (1995) Nearest-Sequence Memory algorithm to allow for general metrics over state-action trajectories. We demonstrate the feasibility of our approach by successfully running our algorithm on a real mobile robot. The algorithm is novel and unique in that it (a) explores the environment and learns directly on a mobile robot without using a hand-made computer model as an intermediate step, (b) does not require manual discretization of the sensor input space, (c) works in piecewise continuous perceptual spaces, and (d) copes with partial observability. Together this allows learning from much less experience compared to previous methods.Comment: 14 pages, 8 figure

arXiv.org e-Print Archive

CiteSeerX

The Australian National University

Airborne chemical sensing with mobile robots

Author: Achim Lilienthal
Albert
Amy Loutfi
Arshak
Atema
Blumenthal
Braitenberg
Carey
Deveza
Dickinson
Duda
Eiben
Engelberger
Gardner
Gardner
Hangartner
Haug
Hayes
Hayes
Hinze
Ishida
Ishida
Ishida
Ishida
Ishida
Ishida
Ishida
Ishida
Ishida
Ishida
Kaneyasu
Kanzaki
Kanzaki
Kleperis
Kolb
Kreutz
Kuwana
Larionova
Lilienthal
Lilienthal
Lilienthal
Loutfi
Loutfi
Martinez
Moncrieff
Munoz
Murlis
Nagle
Nakamoto
Nakamoto
Olafsson
Organisation
Partridge
Parzen
Pearce
Penza
Persaud
Pluijm
Pyk
Raub
Rechenberg
Roberts
Russell
Russell
Russell
Russell
Russell
Sauerbrey
Schaller
Schwefel
Smyth
Sudd
Tom Duckett
Townsend
Wilkens
Winquist
Publication venue: 'MDPI AG'
Publication date: 01/01/2006
Field of study

Airborne chemical sensing with mobile robots has been an active research areasince the beginning of the 1990s. This article presents a review of research work in this field,including gas distribution mapping, trail guidance, and the different subtasks of gas sourcelocalisation. Due to the difficulty of modelling gas distribution in a real world environmentwith currently available simulation techniques, we focus largely on experimental work and donot consider publications that are purely based on simulations

University of Lincoln Institutional Repository

Multidisciplinary Digital Publishing Institute

CiteSeerX

Crossref

Directory of Open Access Journals

Publikationer från Örebro universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Evolutionary robotics and neuroscience

Author: Husbands Phil
Moioli Renan
O'Shea Michael
Philippides Andy
Shim Yoonsik
Vargas Patricia
Publication venue: 'MIT Press - Journals'
Publication date: 01/03/2014
Field of study

No description supplie

Heriot Watt Pure

Sussex Research Online

Beyond Gazing, Pointing, and Reaching: A Survey of Developmental Robotics

Author: Lungarella Max
Metta Giogio
Publication venue: Lund University Cognitive Studies
Publication date: 01/01/2003
Field of study

Developmental robotics is an emerging field located at the intersection of developmental psychology and robotics, that has lately attracted quite some attention. This paper gives a survey of a variety of research projects dealing with or inspired by developmental issues, and outlines possible future directions

CiteSeerX

CogPrints Cognitive Sciences Eprint Archive

Information driven self-organization of complex robotic behaviors

Author: Ay Nihat
Der Ralf
Martius Georg
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 27/03/2013
Field of study

Information theory is a powerful tool to express principles to drive autonomous systems because it is domain invariant and allows for an intuitive interpretation. This paper studies the use of the predictive information (PI), also called excess entropy or effective measure complexity, of the sensorimotor process as a driving force to generate behavior. We study nonlinear and nonstationary systems and introduce the time-local predicting information (TiPI) which allows us to derive exact results together with explicit update rules for the parameters of the controller in the dynamical systems framework. In this way the information principle, formulated at the level of behavior, is translated to the dynamics of the synapses. We underpin our results with a number of case studies with high-dimensional robotic systems. We show the spontaneous cooperativity in a complex physical system with decentralized control. Moreover, a jointly controlled humanoid robot develops a high behavioral variety depending on its physics and the environment it is dynamically embedded into. The behavior can be decomposed into a succession of low-dimensional modes that increasingly explore the behavior space. This is a promising way to avoid the curse of dimensionality which hinders learning systems to scale well.Comment: 29 pages, 12 figure

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

FigShare

Autonomous Reinforcement of Behavioral Sequences in Neural Dynamics

Author: Kazerounian Sohrob
Luciw Matthew
Richter Mathis
Sandamirskaya Yulia
Publication venue
Publication date: 01/01/2012
Field of study

We introduce a dynamic neural algorithm called Dynamic Neural (DN) SARSA(\lambda) for learning a behavioral sequence from delayed reward. DN-SARSA(\lambda) combines Dynamic Field Theory models of behavioral sequence representation, classical reinforcement learning, and a computational neuroscience model of working memory, called Item and Order working memory, which serves as an eligibility trace. DN-SARSA(\lambda) is implemented on both a simulated and real robot that must learn a specific rewarding sequence of elementary behaviors from exploration. Results show DN-SARSA(\lambda) performs on the level of the discrete SARSA(\lambda), validating the feasibility of general reinforcement learning without compromising neural dynamics.Comment: Sohrob Kazerounian, Matthew Luciw are Joint first author

arXiv.org e-Print Archive

CiteSeerX

Symbol Emergence in Robotics: A Survey

Author: Asoh Hideki
Iwahashi Naoto
Nagai Takayuki
Nakamura Tomoaki
Ogata Tetsuya
Taniguchi Tadahiro
Publication venue
Publication date: 29/09/2015
Field of study

Humans can learn the use of language through physical interaction with their environment and semiotic communication with other people. It is very important to obtain a computational understanding of how humans can form a symbol system and obtain semiotic skills through their autonomous mental development. Recently, many studies have been conducted on the construction of robotic systems and machine-learning methods that can learn the use of language through embodied multimodal interaction with their environment and other systems. Understanding human social interactions and developing a robot that can smoothly communicate with human users in the long term, requires an understanding of the dynamics of symbol systems and is crucially important. The embodied cognition and social interaction of participants gradually change a symbol system in a constructive manner. In this paper, we introduce a field of research called symbol emergence in robotics (SER). SER is a constructive approach towards an emergent symbol system. The emergent symbol system is socially self-organized through both semiotic communications and physical interactions with autonomous cognitive developmental agents, i.e., humans and developmental robots. Specifically, we describe some state-of-art research topics concerning SER, e.g., multimodal categorization, word discovery, and a double articulation analysis, that enable a robot to obtain words and their embodied meanings from raw sensory--motor information, including visual information, haptic information, auditory information, and acoustic speech signals, in a totally unsupervised manner. Finally, we suggest future directions of research in SER.Comment: submitted to Advanced Robotic

arXiv.org e-Print Archive