Search CORE

8,199 research outputs found

Role Playing Learning for Socially Concomitant Mobile Robot Navigation

Author: Ge Shuzhi Sam
Jiang Rui
Lee Tong Heng
Li Mingming
Publication venue
Publication date: 29/05/2017
Field of study

In this paper, we present the Role Playing Learning (RPL) scheme for a mobile robot to navigate socially with its human companion in populated environments. Neural networks (NN) are constructed to parameterize a stochastic policy that directly maps sensory data collected by the robot to its velocity outputs, while respecting a set of social norms. An efficient simulative learning environment is built with maps and pedestrians trajectories collected from a number of real-world crowd data sets. In each learning iteration, a robot equipped with the NN policy is created virtually in the learning environment to play itself as a companied pedestrian and navigate towards a goal in a socially concomitant manner. Thus, we call this process Role Playing Learning, which is formulated under a reinforcement learning (RL) framework. The NN policy is optimized end-to-end using Trust Region Policy Optimization (TRPO), with consideration of the imperfectness of robot's sensor measurements. Simulative and experimental results are provided to demonstrate the efficacy and superiority of our method

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

ScholarBank@NUS

Covert Perceptual Capability Development

Author: Huang Xiao
Weng Juyang
Publication venue: Lund University Cognitive Studies
Publication date: 01/01/2005
Field of study

In this paper, we propose a model to develop robots’ covert perceptual capability using reinforcement learning. Covert perceptual behavior is treated as action selected by a motivational system. We apply this model to vision-based navigation. The goal is to enable a robot to learn road boundary type. Instead of dealing with problems in controlled environments with a low-dimensional state space, we test the model on images captured in non-stationary environments. Incremental Hierarchical Discriminant Regression is used to generate states on the fly. Its coarse-to-fine tree structure guarantees real-time retrieval in high-dimensional state space. K Nearest-Neighbor strategy is adopted to further reduce training time complexity

CogPrints Cognitive Sciences Eprint Archive

Time representation in reinforcement learning models of the basal ganglia

Author: Alvaro F. Nieto Guil
Emilio L. Malchiodi
Maria Eugenia Bernis
Mariana eOksdath
Marisa M. Fernandez
Santiago eQuiroga
Sebastian eDupraz
Silvana B. Rosso
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2013
Field of study

Reinforcement learning (RL) models have been influential in understanding many aspects of basal ganglia function, from reward prediction to action selection. Time plays an important role in these models, but there is still no theoretical consensus about what kind of time representation is used by the basal ganglia. We review several theoretical accounts and their supporting evidence. We then discuss the relationship between RL models and the timing mechanisms that have been attributed to the basal ganglia. We hypothesize that a single computational system may underlie both RL and interval timing—the perception of duration in the range of seconds to hours. This hypothesis, which extends earlier models by incorporating a time-sensitive action selection mechanism, may have important implications for understanding disorders like Parkinson's disease in which both decision making and timing are impaired

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

DSpace@MIT

CONICET Digital

Directory of Open Access Journals

Frontiers - Publisher Connector

PubMed Central

Warwick Research Archives Portal Repository

Western Sydney ResearchDirect

Optimal inference with suboptimal models:Addiction and active Bayesian inference

Author: Adams
Adams
Bechara
Beck
Beck
Belin
Berke
Berridge
Berridge
Bickel
Bishop
Blum
Blum
Blum
Bowers
Bowers
Brighton
Cain
Charness
Christoph Mathys
Clark
Clark
Comings
Conant
Daunizeau
Daw
Dayan
Dayan
Diana
Dolan
Doyon
Edwards
Everitt
Everitt
Eysenck
FitzGerald
Friedrich Wurst
Friston
Friston
Friston
Friston
Friston
Friston
Geisler
Gigerenzer
Gillan
Graybiel
Grether
Griffiths
Griffiths
Henden
Hommer
Jones
Kahnemann
Karl Friston
Kersten
Knill
Lee
MacKay
Mahmood
Maia
Martin Kronbichler
Miedl
Montague
Monterosso
Pellicano
Peters
Peters
Philipp Schwartenbeck
Pouget
Ray Dolan
Redish
Redish
Robinson
Schwartenbeck
Simon
Simon
Stephan
Story
Sutton
Taber
Tenenbaum
Thomas H.B. FitzGerald
Tversky
Tversky
Weiss
Whalley
Willuhn
Wolpert
Yu
Publication venue: 'Elsevier BV'
Publication date: 15/12/2014
Field of study

When casting behaviour as active (Bayesian) inference, optimal inference is defined with respect to an agent's beliefs - based on its generative model of the world. This contrasts with normative accounts of choice behaviour, in which optimal actions are considered in relation to the true structure of the environment - as opposed to the agent's beliefs about worldly states (or the task). This distinction shifts an understanding of suboptimal or pathological behaviour away from aberrant inference as such, to understanding the prior beliefs of a subject that cause them to behave less 'optimally' than our prior beliefs suggest they should behave. Put simply, suboptimal or pathological behaviour does not speak against understanding behaviour in terms of (Bayes optimal) inference, but rather calls for a more refined understanding of the subject's generative model upon which their (optimal) Bayesian inference is based. Here, we discuss this fundamental distinction and its implications for understanding optimality, bounded rationality and pathological (choice) behaviour. We illustrate our argument using addictive choice behaviour in a recently described 'limited offer' task. Our simulations of pathological choices and addictive behaviour also generate some clear hypotheses, which we hope to pursue in ongoing empirical work

Crossref

UCL Discovery

PubMed Central

Sissa Digital Library

University of East Anglia digital repository

MPG.PuRe