4,452 research outputs found
Bounded Risk-Sensitive Markov Games: Forward Policy Design and Inverse Reward Learning with Iterative Reasoning and Cumulative Prospect Theory
Classical game-theoretic approaches for multi-agent systems in both the
forward policy design problem and the inverse reward learning problem often
make strong rationality assumptions: agents perfectly maximize expected
utilities under uncertainties. Such assumptions, however, substantially
mismatch with observed humans' behaviors such as satisficing with sub-optimal,
risk-seeking, and loss-aversion decisions. In this paper, we investigate the
problem of bounded risk-sensitive Markov Game (BRSMG) and its inverse reward
learning problem for modeling human realistic behaviors and learning human
behavioral models. Drawing on iterative reasoning models and cumulative
prospect theory, we embrace that humans have bounded intelligence and maximize
risk-sensitive utilities in BRSMGs. Convergence analysis for both the forward
policy design and the inverse reward learning problems are established under
the BRSMG framework. We validate the proposed forward policy design and inverse
reward learning algorithms in a navigation scenario. The results show that the
behaviors of agents demonstrate both risk-averse and risk-seeking
characteristics. Moreover, in the inverse reward learning task, the proposed
bounded risk-sensitive inverse learning algorithm outperforms a baseline
risk-neutral inverse learning algorithm by effectively recovering not only more
accurate reward values but also the intelligence levels and the risk-measure
parameters given demonstrations of agents' interactive behaviors.Comment: Accepted by 2021 AAAI Conference on Artificial Intelligenc
Teach Me What You Want to Play: Learning Variants of Connect Four through Human-Robot Interaction
This paper investigates the use of game theoretic representations to
represent and learn how to play interactive games such as Connect Four. We
combine aspects of learning by demonstration, active learning, and game theory
allowing a robot to leverage its developing representation of the game to
conduct question/answer sessions with a person, thus filling in gaps in its
knowledge. The paper demonstrates a method for teaching a robot the win
conditions of the game Connect Four and its variants using a single
demonstration and a few trial examples with a question and answer session led
by the robot. Our results show that the robot can learn arbitrary win
conditions for the game with little prior knowledge of the win conditions and
then play the game with a human utilizing the learned win conditions. Our
experiments also show that some questions are more important for learning the
game's win conditions. We believe that this method could be broadly applied to
a variety of interactive learning scenarios.Comment: The final authenticated publication is available online at
https://doi.org/10.1007/978-3-030-62056-1_4
Human Motion Trajectory Prediction: A Survey
With growing numbers of intelligent autonomous systems in human environments,
the ability of such systems to perceive, understand and anticipate human
behavior becomes increasingly important. Specifically, predicting future
positions of dynamic agents and planning considering such predictions are key
tasks for self-driving vehicles, service robots and advanced surveillance
systems. This paper provides a survey of human motion trajectory prediction. We
review, analyze and structure a large selection of work from different
communities and propose a taxonomy that categorizes existing methods based on
the motion modeling approach and level of contextual information used. We
provide an overview of the existing datasets and performance metrics. We
discuss limitations of the state of the art and outline directions for further
research.Comment: Submitted to the International Journal of Robotics Research (IJRR),
37 page
Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles
Interaction-aware Autonomous Driving (IAAD) is a rapidly growing field of
research that focuses on the development of autonomous vehicles (AVs) that are
capable of interacting safely and efficiently with human road users. This is a
challenging task, as it requires the autonomous vehicle to be able to
understand and predict the behaviour of human road users. In this literature
review, the current state of IAAD research is surveyed in this work. Commencing
with an examination of terminology, attention is drawn to challenges and
existing models employed for modelling the behaviour of drivers and
pedestrians. Next, a comprehensive review is conducted on various techniques
proposed for interaction modelling, encompassing cognitive methods, machine
learning approaches, and game-theoretic methods. The conclusion is reached
through a discussion of potential advantages and risks associated with IAAD,
along with the illumination of pivotal research inquiries necessitating future
exploration
- …