Search CORE

74,847 research outputs found

A comparative study between motivated learning and reinforcement learning

Author: GRAHAM James T.
HE Haibo
NI Zhen
STARZYK Janusz A.
TAN Ah-Hwee
TENG T.-H.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/07/2015
Field of study

This paper analyzes advanced reinforcement learning techniques and compares some of them to motivated learning. Motivated learning is briefly discussed indicating its relation to reinforcement learning. A black box scenario for comparative analysis of learning efficiency in autonomous agents is developed and described. This is used to analyze selected algorithms. Reported results demonstrate that in the selected category of problems, motivated learning outperformed all reinforcement learning algorithms we compared with

Institutional Knowledge at Singapore Management University

DigitalCommons@URI

Psychological factors affecting equine performance

Author: McBride Sebastian D.
Mills Daniel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

For optimal individual performance within any equestrian discipline horses must be in peak physical condition and have the correct psychological state. This review discusses the psychological factors that affect the performance of the horse and, in turn, identifies areas within the competition horse industry where current behavioral research and established behavioral modification techniques could be applied to further enhance the performance of animals. In particular, the role of affective processes underpinning temperament, mood and emotional reaction in determining discipline-specific performance is discussed. A comparison is then made between the training and the competition environment and the review completes with a discussion on how behavioral modification techniques and general husbandry can be used advantageously from a performance perspective

University of Lincoln Institutional Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards

Author: Lukasiewicz Thomas
Song Yuhang
Wang Jianyi
Wojcicki Andrzej
Xu Mai
Xu Zhenghua
Zhang Shangtong
Publication venue
Publication date: 26/11/2019
Field of study

Intrinsic rewards were introduced to simulate how human intelligence works; they are usually evaluated by intrinsically-motivated play, i.e., playing games without extrinsic rewards but evaluated with extrinsic rewards. However, none of the existing intrinsic reward approaches can achieve human-level performance under this very challenging setting of intrinsically-motivated play. In this work, we propose a novel megalomania-driven intrinsic reward (called mega-reward), which, to our knowledge, is the first approach that achieves human-level performance in intrinsically-motivated play. Intuitively, mega-reward comes from the observation that infants' intelligence develops when they try to gain more control on entities in an environment; therefore, mega-reward aims to maximize the control capabilities of agents on given entities in a given environment. To formalize mega-reward, a relational transition model is proposed to bridge the gaps between direct and latent control. Experimental studies show that mega-reward (i) can greatly outperform all state-of-the-art intrinsic reward approaches, (ii) generally achieves the same level of performance as Ex-PPO and professional human-level scores, and (iii) has also a superior performance when it is incorporated with extrinsic rewards

arXiv.org e-Print Archive

Oxford University Research Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

SOVEREIGN: An Autonomous Neural System for Incrementally Learning Planned Action Sequences to Navigate Towards a Rewarded Goal

Author: Gnadt William
Grossberg Stephen
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/09/2007
Field of study

How do reactive and planned behaviors interact in real time? How are sequences of such behaviors released at appropriate times during autonomous navigation to realize valued goals? Controllers for both animals and mobile robots, or animats, need reactive mechanisms for exploration, and learned plans to reach goal objects once an environment becomes familiar. The SOVEREIGN (Self-Organizing, Vision, Expectation, Recognition, Emotion, Intelligent, Goaloriented Navigation) animat model embodies these capabilities, and is tested in a 3D virtual reality environment. SOVEREIGN includes several interacting subsystems which model complementary properties of cortical What and Where processing streams and which clarify similarities between mechanisms for navigation and arm movement control. As the animat explores an environment, visual inputs are processed by networks that are sensitive to visual form and motion in the What and Where streams, respectively. Position-invariant and sizeinvariant recognition categories are learned by real-time incremental learning in the What stream. Estimates of target position relative to the animat are computed in the Where stream, and can activate approach movements toward the target. Motion cues from animat locomotion can elicit head-orienting movements to bring a new target into view. Approach and orienting movements are alternately performed during animat navigation. Cumulative estimates of each movement are derived from interacting proprioceptive and visual cues. Movement sequences are stored within a motor working memory. Sequences of visual categories are stored in a sensory working memory. These working memories trigger learning of sensory and motor sequence categories, or plans, which together control planned movements. Predictively effective chunk combinations are selectively enhanced via reinforcement learning when the animat is rewarded. Selected planning chunks effect a gradual transition from variable reactive exploratory movements to efficient goal-oriented planned movement sequences. Volitional signals gate interactions between model subsystems and the release of overt behaviors. The model can control different motor sequences under different motivational states and learns more efficient sequences to rewarded goals as exploration proceeds.Riverside Reserach Institute; Defense Advanced Research Projects Agency (N00014-92-J-4015); Air Force Office of Scientific Research (F49620-92-J-0225); National Science Foundation (IRI 90-24877, SBE-0345378); Office of Naval Research (N00014-92-J-1309, N00014-91-J-4100, N00014-01-1-0624, N00014-01-1-0624); Pacific Sierra Research (PSR 91-6075-2

Boston University Institutional Repository (OpenBU)

A new conceptual framework for revenge firesetting

Revenge has frequently been acknowledged to account for a relatively large proportion of motives in deliberate firesetting. However, very little is actually known about the aetiology of revenge firesetting. Theoretical approaches to revenge-seeking behaviour are discussed. A brief review of how revenge is accounted for in existing theoretical explanations of deliberate firesetting and the known characteristics of revenge firesetters are provided. On this basis, the authors suggest, as a motive, revenge firesetting has to date been misconceptualised. A new conceptual framework is thus proposed, paying particular attention to the contextual, affective, cognitive, volitional and behavioural factors which may influence and generate a single episode of revenge firesetting. Treatment implications and suggestions for future research are also provided

Crossref

Kent Academic Repository