Search CORE

6 research outputs found

Towards Better Interpretability in Deep Q-Networks

Author: Annasamy Raghuram Mandyam
Sycara Katia
Publication venue
Publication date: 14/11/2018
Field of study

Deep reinforcement learning techniques have demonstrated superior performance in a wide variety of environments. As improvements in training algorithms continue at a brisk pace, theoretical or empirical studies on understanding what these networks seem to learn, are far behind. In this paper we propose an interpretable neural network architecture for Q-learning which provides a global explanation of the model's behavior using key-value memories, attention and reconstructible embeddings. With a directed exploration strategy, our model can reach training rewards comparable to the state-of-the-art deep Q-learning models. However, results suggest that the features extracted by the neural network are extremely shallow and subsequent testing using out-of-sample examples shows that the agent can easily overfit to trajectories seen during training.Comment: Accepted at AAAI-19; (16 pages, 18 figures

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

GCAI 2017: 3rd Global Conference on Artificial Intelligence, Miami, FL, USA, 18-22 October 2017

Author
Publication venue: EPiC Series in Computing, EasyChair
Publication date: 19/10/2017
Field of study

Open Repository and Bibliography - Luxembourg

Deep Learning, transparency and trust in Human Robot Teamwork

Author: Lewis Michael
Li Huao
Sycara Katia
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

For Autonomous AI systems to be accepted and trusted, the users should be able to understand the reasoning process of the system (i.e., the system should be transparent). Robotics presents unique programming difficulties in that systems need to map from complicated sensor inputs such as camera feeds and laser scans to outputs such as joint angles and velocities. Advances in Deep Neural Networks are now making it possible to replace laborious handcrafted features and control code by learning control policies directly from high dimensional sensor inputs. Because Atari games, where these capabilities were first demonstrated, replicate the robotics problem they are ideal for investigating how humans might come to understand and interact with agents who have not been explicitly programmed. We present computational and human results for making DRLN more transparent using object saliency visualizations of internal states and test the effectiveness of expressing saliency through teleological verbal explanations

D-Scholarship@Pitt

Automated Deduction – CADE 28

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/07/2021
Field of study

This open access book constitutes the proceeding of the 28th International Conference on Automated Deduction, CADE 28, held virtually in July 2021. The 29 full papers and 7 system descriptions presented together with 2 invited papers were carefully reviewed and selected from 76 submissions. CADE is the major forum for the presentation of research in all aspects of automated deduction, including foundations, applications, implementations, and practical experience. The papers are organized in the following topics: Logical foundations; theory and principles; implementation and application; ATP and AI; and system descriptions

Directory of Open Access Books (DOAB)

New Effective Leader-Behavior Insights through Combining Skin Conductance with Video-coded Field Behaviors

Author: Hoogeboom Marcella A.M.G.
Wilderom Celeste P.M.
Publication venue
Publication date: 25/05/2016
Field of study

University of Twente Research Information