Search CORE

146 research outputs found

Iterative Policy-Space Expansion in Reinforcement Learning

Author: Lichtenberg Jan
Şimşek Özgür
Publication venue
Publication date: 01/01/2019
Field of study

Humans and animals solve a difficult problem much more easily when they are presented with a sequence of problems that starts simple and slowly increases in difficulty. We explore this idea in the context of reinforcement learning. Rather than providing the agent with an externally provided curriculum of progressively more difficult tasks, the agent solves a single task utilizing a decreasingly constrained policy space. The algorithm we propose first learns to categorize features into positive and negative before gradually learning a more refined policy. Experimental results in Tetris demonstrate superior learning rate of our approach when compared to existing algorithms.Comment: Workshop on Biological and Artificial Reinforcement Learning at the 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), Vancouver, Canad

arXiv.org e-Print Archive

OPUS

Regularization in Directable Environments with Application to Tetris

Author: Lichtenberg Jan
Şimşek Özgür
Publication venue
Publication date: 15/06/2019
Field of study

OPUS

The Temporal Persistence of Generative Language Models in Sentiment Analysis

Author: Medina-Alias Pablo
Şimşek Özgür
Publication venue
Publication date: 21/09/2023
Field of study

OPUS

Using Relative Novelty to Identify Useful Temporal Abstractions in Reinforcement Learning

Author: Barto Andrew G.
Şimşek Özgür
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2004
Field of study

We present a new method for automatically creating useful temporal abstractions in reinforcement learning. We argue that states that allow the agent to transition to a different region of the state space are useful subgoals, and propose a method for identifying them using the concept of relative novelty. When such a state is identified, a temporallyextended activity (e.g., an option) is generated that takes the agent efficiently to this state. We illustrate the utility of the method in a number of tasks

CiteSeerX

Crossref

ScholarWorks@UMass Amherst

Betweenness Centrality as a Basis for Forming Skills

Author: Barto Andrew G.
Şimşek Özgür
Publication venue
Publication date: 12/04/2007
Field of study

We show that betweenness centrality, a graph-theoretic measure widely used in social network analysis, provides a sound basis for autonomously forming useful high-level behaviors, or skills, from available primitives— the smallest behavioral units available to an autonomous agent

OPUS

Creating Multi-Level Skill Hierarchies in Reinforcement Learning

Author: Evans Joshua B.
Şimşek Özgür
Publication venue: 'Center for Open Science'
Publication date: 16/06/2023
Field of study

What is a useful skill hierarchy for an autonomous agent? We propose an answer based on the graphical structure of an agent's interaction with its environment. Our approach uses hierarchical graph partitioning to expose the structure of the graph at varying timescales, producing a skill hierarchy with multiple levels of abstraction. At each level of the hierarchy, skills move the agent between regions of the state space that are well connected within themselves but weakly connected to each other. We illustrate the utility of the proposed skill hierarchy in a wide variety of domains in the context of reinforcement learning

OPUS

Autocorrelation and Relational Learning: Challenges and Opportunities

Author: Jensen David
Neville Jennifer
Şimşek Özgür
Publication venue
Publication date: 01/01/2004
Field of study

OPUS

Explaining Reinforcement Learning with Shapley Values

Author: Beechey Daniel
Smith Thomas M. S.
Şimşek Özgür
Publication venue: 'Center for Open Science'
Publication date: 09/06/2023
Field of study

For reinforcement learning systems to be widely adopted, their users must understand and trust them. We present a theoretical analysis of explaining reinforcement learning using Shapley values, following a principled approach from game theory for identifying the contribution of individual players to the outcome of a cooperative game. We call this general framework Shapley Values for Explaining Reinforcement Learning (SVERL). Our analysis exposes the limitations of earlier uses of Shapley values in reinforcement learning. We then develop an approach that uses Shapley values to explain agent performance. In a variety of domains, SVERL produces meaningful explanations that match and supplement human intuition

OPUS

Explaining Reinforcement Learning with Shapley Values

Author: Beechey Daniel
Smith Thomas M. S.
Şimşek Özgür
Publication venue
Publication date: 09/06/2023
Field of study

arXiv.org e-Print Archive