Search CORE

39,023 research outputs found

Safety-Aware Apprenticeship Learning

Author: A Solar-Lezama
H Hansson
M Kwiatkowska
M Kwiatkowska
M Kwiatkowska
R Bellman
R Wimmer
S Jha
S Junges
T Han
Y-J Kuo
Publication venue
Publication date: 28/04/2018
Field of study

Apprenticeship learning (AL) is a kind of Learning from Demonstration techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent has to derive a good policy by observing an expert's demonstrations. In this paper, we study the problem of how to make AL algorithms inherently safe while still meeting its learning objective. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure safety while retaining performance of the learnt policy. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential.Comment: Accepted by International Conference on Computer Aided Verification (CAV) 201

arXiv.org e-Print Archive

Safety-aware apprenticeship learning

Author: Zhou Weichao
Publication venue
Publication date: 03/07/2018
Field of study

It is well acknowledged in the AI community that finding a good reward function for reinforcement learning is extremely challenging. Apprenticeship learning (AL) is a class of “learning from demonstration” techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent uses inverse reinforcement learning (IRL) methods to recover expert policy from a set of expert demonstrations. However, as the agent learns exclusively from observations, given a constraint on the probability of the agent running into unwanted situations, there is no verification, nor guarantee, for the learnt policy on the satisfaction of the restriction. In this dissertation, we study the problem of how to guide AL to learn a policy that is inherently safe while still meeting its learning objective. By combining formal methods with imitation learning, a Counterexample-Guided Apprenticeship Learning algorithm is proposed. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure both safety and performance of the learnt policy. This algorithm guarantees that given some formal safety specification defined by probabilistic temporal logic, the learnt policy shall satisfy this specification. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential

Boston University Institutional Repository (OpenBU)

Learning from the best: examples of best practice from providers of apprenticeships in under performing vocational areas

Author
Publication venue: Office for Standards in Education, Children’s Services and Skills (OFSTED)
Publication date: 01/01/2010
Field of study

Apprenticeship in supporting teaching and learning in schools: framework issue number 3.4 (interim)

Author
Publication venue: Training and Development Agency for Schools (TDA)
Publication date: 01/01/2010
Field of study

Quality and standards in education and training in Wales: a report on the quality of work-based learning and Jobcentre Plus programmes in Icon Vocational Training

Author
Publication venue: Estyn
Publication date
Field of study

Exeter College : Training Standards Council inspection report 2000; Adult Learning Inspectorate re-inspection November 2001

Author
Publication venue: Training Standards Council
Publication date: 01/01/2001
Field of study

Defining and measuring training activity

Author: Davies Ben
Gore Katie
Manson Ken
Newton John
Shury Jan
Winterbotham Mark
Publication venue: UK Commission for Employment and Skills
Publication date: 01/01/2011
Field of study

Understanding employer networks

Author: Breuer Zoey
Cox Annette
Garrett Richard
Higgins Tom
Marangozov Rachel
Publication venue: UK Commission for Employment and Skills
Publication date: 01/01/2013
Field of study

Student apprenticeship evaluation

Author: McCoshan Andrew
Williams Jenny
Publication venue: DfES
Publication date: 01/01/2002
Field of study

Career Development Program for Refugee and Migrant Youth

Author: Gallegos D.
Tilbury F.
Publication venue: Murdoch University
Publication date: 01/01/2006
Field of study

The Career Guidance for Refugee and Migrant Young People project is an initiative of the South Metropolitan Migrant Resource Centre funded by the Department of Education and Training. It aims to develop, pilot and evaluate a career development and planning program that specifically meets the learning levels and needs of refugee youth with low levels of education, cultural life skills and English language ability

Research Repository