Search CORE

17,143 research outputs found

TALplanner in IPC-2002: Extensions and Control Rules

Author: Kvarnström J.
Magnusson M.
Publication venue: 'AI Access Foundation'
Publication date: 26/06/2011
Field of study

TALplanner is a forward-chaining planner that relies on domain knowledge in the shape of temporal logic formulas in order to prune irrelevant parts of the search space. TALplanner recently participated in the third International Planning Competition, which had a clear emphasis on increasing the complexity of the problem domains being used as benchmark tests and the expressivity required to represent these domains in a planning system. Like many other planners, TALplanner had support for some but not all aspects of this increase in expressivity, and a number of changes to the planner were required. After a short introduction to TALplanner, this article describes some of the changes that were made before and during the competition. We also describe the process of introducing suitable domain knowledge for several of the competition domains

arXiv.org e-Print Archive

Crossref

A Deep Hierarchical Approach to Lifelong Learning in Minecraft

Author: Givony Shahar
Mankowitz Daniel J.
Mannor Shie
Tessler Chen
Zahavy Tom
Publication venue
Publication date: 30/11/2016
Field of study

We propose a lifelong learning system that has the ability to reuse and transfer knowledge from one task to another while efficiently retaining the previously learned knowledge-base. Knowledge is transferred by learning reusable skills to solve tasks in Minecraft, a popular video game which is an unsolved and high-dimensional lifelong learning problem. These reusable skills, which we refer to as Deep Skill Networks, are then incorporated into our novel Hierarchical Deep Reinforcement Learning Network (H-DRLN) architecture using two techniques: (1) a deep skill array and (2) skill distillation, our novel variation of policy distillation (Rusu et. al. 2015) for learning skills. Skill distillation enables the HDRLN to efficiently retain knowledge and therefore scale in lifelong learning, by accumulating knowledge and encapsulating multiple reusable skills into a single distilled network. The H-DRLN exhibits superior performance and lower learning sample complexity compared to the regular Deep Q Network (Mnih et. al. 2015) in sub-domains of Minecraft

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Substructure and Boundary Modeling for Continuous Action Recognition

Author: Huang Thomas
Lin Kai-Hsiang
Wang Jinjun
Wang Zhaowen
Xiao Jing
Publication venue
Publication date: 01/01/2012
Field of study

This paper introduces a probabilistic graphical model for continuous action recognition with two novel components: substructure transition model and discriminative boundary model. The first component encodes the sparse and global temporal transition prior between action primitives in state-space model to handle the large spatial-temporal variations within an action class. The second component enforces the action duration constraint in a discriminative way to locate the transition boundaries between actions more accurately. The two components are integrated into a unified graphical structure to enable effective training and inference. Our comprehensive experimental results on both public and in-house datasets show that, with the capability to incorporate additional information that had not been explicitly or efficiently modeled by previous methods, our proposed algorithm achieved significantly improved performance for continuous action recognition.Comment: Detailed version of the CVPR 2012 paper. 15 pages, 6 figure

arXiv.org e-Print Archive

CiteSeerX

Simultaneous Learning of Nonlinear Manifold and Dynamical Models for High-dimensional Time Series

Author: Li Rui
Sclaroff Stan
Tian Tai-Peng
Publication venue: Boston University Computer Science Department
Publication date: 01/01/2007
Field of study

The goal of this work is to learn a parsimonious and informative representation for high-dimensional time series. Conceptually, this comprises two distinct yet tightly coupled tasks: learning a low-dimensional manifold and modeling the dynamical process. These two tasks have a complementary relationship as the temporal constraints provide valuable neighborhood information for dimensionality reduction and conversely, the low-dimensional space allows dynamics to be learnt efficiently. Solving these two tasks simultaneously allows important information to be exchanged mutually. If nonlinear models are required to capture the rich complexity of time series, then the learning problem becomes harder as the nonlinearities in both tasks are coupled. The proposed solution approximates the nonlinear manifold and dynamics using piecewise linear models. The interactions among the linear models are captured in a graphical model. By exploiting the model structure, efficient inference and learning algorithms are obtained without oversimplifying the model of the underlying dynamical process. Evaluation of the proposed framework with competing approaches is conducted in three sets of experiments: dimensionality reduction and reconstruction using synthetic time series, video synthesis using a dynamic texture database, and human motion synthesis, classification and tracking on a benchmark data set. In all experiments, the proposed approach provides superior performance.National Science Foundation (IIS 0308213, IIS 0329009, CNS 0202067

CiteSeerX

Boston University Institutional Repository (OpenBU)

How Life Experience Shapes Cognitive Control Strategies: The Case of Air Traffic Control Training

Author: Arbula Sandra
Capizzi Mariagrazia
Lombardo Nicoletta
Vallesi Antonino
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2016
Field of study

Although human flexible behavior relies on cognitive control, it would be implausible to assume that there is only one, general mode of cognitive control strategy adopted by all individuals. For instance, different reliance on proactive versus reactive control strategies could explain inter-individual variability. In particular, specific life experiences, like a highly demanding training for future Air Traffic Controllers (ATCs), could modulate cognitive control functions. A group of ATC trainees and a matched group of university students were tested longitudinally on task-switching and Stroop paradigms that allowed us to measure indices of cognitive control. The results showed that the ATCs, with respect to the control group, had substantially smaller mixing costs during long cue-target intervals (CTI) and a reduced Stroop interference effect. However, this advantage was present also prior to the training phase. Being more capable in managing multiple task sets and less distracted by interfering events suggests a more efficient selection and maintenance of task relevant information as an inherent characteristic of the ATC group, associated with proactive control. Critically, the training that the ATCs underwent improved their accuracy in general and reduced response time switching costs during short CTIs only. These results indicate a training-induced change in reactive control, which is described as a transient process in charge of stimulus-driven task detection and resolution. This experience-based enhancement of reactive control strategy denotes how cognitive control and executive functions in general can be shaped by real-life training and underlines the importance of experience in explaining inter-individual variability in cognitive functioning

Directory of Open Access Journals

PubMed Central

Archivio istituzionale della ricerca - Università di Padova

FigShare