Use of Inverse Reinforcement Learning for Identity Prediction

Bao, Jonathan; Beling, Peter; Hayes, Roy; Horowitz, Barry

research

Use of Inverse Reinforcement Learning for Identity Prediction

Authors: Jonathan Bao
Peter Beling
Roy Hayes
Barry Horowitz
Publication date
Publisher

Abstract

We adopt Markov Decision Processes (MDP) to model sequential decision problems, which have the characteristic that the current decision made by a human decision maker has an uncertain impact on future opportunity. We hypothesize that the individuality of decision makers can be modeled as differences in the reward function under a common MDP model. A machine learning technique, Inverse Reinforcement Learning (IRL), was used to learn an individual's reward function based on limited observation of his or her decision choices. This work serves as an initial investigation for using IRL to analyze decision making, conducted through a human experiment in a cyber shopping environment. Specifically, the ability to determine the demographic identity of users is conducted through prediction analysis and supervised learning. The results show that IRL can be used to correctly identify participants, at a rate of 68% for gender and 66% for one of three college major categories

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

NASA Technical Reports Server

oai:casi.ntrs.nasa.gov:2011001...

Last time updated on 03/08/2016