Search CORE

792 research outputs found

Should Collaborative Robots be Transparent?

Author: Gandhi Soham
Losey Dylan P.
Sagheb Shahabedin
Publication venue
Publication date: 05/09/2023
Field of study

We often assume that robots which collaborate with humans should behave in ways that are transparent (e.g., legible, explainable). These transparent robots intentionally choose actions that convey their internal state to nearby humans: for instance, a transparent robot might exaggerate its trajectory to indicate its goal. But while transparent behavior seems beneficial for human-robot interaction, is it actually optimal? In this paper we consider collaborative settings where the human and robot have the same objective, and the human is uncertain about the robot's type (i.e., the robot's internal state). We extend a recursive combination of Bayesian Nash equilibrium and the Bellman equation to solve for optimal robot policies. Interestingly, we discover that it is not always optimal for collaborative robots to be transparent; instead, human and robot teams can sometimes achieve higher rewards when the robot is opaque. In contrast to transparent robots, opaque robots select actions that withhold information from the human. Our analysis suggests that opaque behavior becomes optimal when either (a) human-robot interactions have a short time horizon or (b) users are slow to learn from the robot's actions. We extend this theoretical analysis to user studies across 43 total participants in both online and in-person settings. We find that -- during short interactions -- users reach higher rewards when working with opaque partners, and subjectively rate opaque robots as about equal to transparent robots. See videos of our experiments here: https://youtu.be/u8q1Z7WHUu

arXiv.org e-Print Archive

Modeling of Human Motor Control and Its Application in Human Interaction with Machines

Author: Jian Jianan
Publication venue
Publication date: 24/01/2018
Field of study

Human civilization started with the invention of tools which enhanced and expanded human motor capability. With the recent development of virtual reality technology and artificial intelligence, the interaction between humans and machines has become more and more intricate. A better understanding of our motor system and the way it interacts with machines will allow us to better design intelligent devices. However, previous works in motor control modeling mostly focused on linear dynamics and had limitations in incorporating the process of learning. A musculoskeletal model based on mechanical principles and a motor control model based on Bayesian probability are proposed in this study. The probability-theoretical formulation of the problem not only facilitates the understanding of motor learning but also transforms nonlinear dynamics into linear problems. Using these models, the interactions in which both human and machine are capable of learning and adapting are formulated and analyzed. Intelligent control policies for machine imitating the human motor control are proposed. Simulation results are also presented

D-Scholarship@Pitt

An Adversarial Interpretation of Information-Theoretic Bounded Rationality

Author: Lee Daniel D.
Ortega Pedro A.
Publication venue
Publication date: 22/04/2014
Field of study

Recently, there has been a growing interest in modeling planning with information constraints. Accordingly, an agent maximizes a regularized expected utility known as the free energy, where the regularizer is given by the information divergence from a prior to a posterior policy. While this approach can be justified in various ways, including from statistical mechanics and information theory, it is still unclear how it relates to decision-making against adversarial environments. This connection has previously been suggested in work relating the free energy to risk-sensitive control and to extensive form games. Here, we show that a single-agent free energy optimization is equivalent to a game between the agent and an imaginary adversary. The adversary can, by paying an exponential penalty, generate costs that diminish the decision maker's payoffs. It turns out that the optimal strategy of the adversary consists in choosing costs so as to render the decision maker indifferent among its choices, which is a definining property of a Nash equilibrium, thus tightening the connection between free energy optimization and game theory.Comment: 7 pages, 4 figures. Proceedings of AAAI-1

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Principles of sensorimotor learning.

Author: A d'Avella
A Hadipour-Niktarash
A Haith
A Keisler
A Resulaj
AA Ahmed
AA Faisal
AAG Mattar
AJ Nagengast
AJ Nagengast
AR Luft
C Takahashi
CJ Burke
CM Bütefisch
CM Heyes
CS Green
D Kahneman
D Marr
D Mitrovic
D Nozaki
D Nozaki
D Pelisson
DA Braun
DA Braun
DA Braun
DA Braun
DA Braun
DA Braun
DA Braun
Daniel M. Wolpert
DJ Ostry
DM Wolpert
DM Wolpert
DW Franklin
DW Franklin
E Burdet
E Oztop
E Todorov
E Todorov
EK Cressman
FJ Valero-Cuevas
G Caithness
G Rizzolatti
G Rizzolatti
G Rotman
GC Sing
H Golla
H Tanaka
IS Howard
IS Howard
J Anguera
J Bongard
J Classen
J Diedrichsen
J Diedrichsen
J Diedrichsen
J Diedrichsen
J Fernandez-Ruiz
J Izawa
J Kluzik
J Krakauer
J Najemnik
J Reis
J Triesch
J Trommershäuser
J-Y Lee
J. Randall Flanagan
JA Hosp
JA Pruszynski
JA Pruszynski
JI Gold
JM Galea
JN Ingram
JR Flanagan
JR Flanagan
JR Flanagan
JR Lackner
Jörn Diedrichsen
K Kording
K Reed
K Seki
KA Thoroughman
KM Mosier
KP Kording
L Itti
L Madelain
LLCD Bursztyn
M Abe
M Berniker
M Brass
M Burstedt
M Deisenroth
M Ernst
M Hayhoe
M Jordan
M Land
M Land
MA Smith
MA Smith
MD Hesse
MJ Wagner
MM Churchland
N Cothros
N Cothros
N Malfait
NI Krouchev
O Donchin
O White
P Baraduc
P Cisek
P Vetter
PM Bays
R Gupta
R Shadmehr
R Srimal
RJ van Beers
RM Brown
RS Johansson
RS Johansson
RS Sutton
S Vaziri
SB Most
SM Aglioti
SM Morton
SM Nasir
T Brashers-Krug
T Martin
T Verstynen
U Sailer
V Brooks
V Della-Maggiore
VS Huang
VS Huang
VS Huang
X Liu
Y Kojima
Y-W Tseng
Publication venue: Scholarship@Western
Publication date: 27/10/2011
Field of study

The exploits of Martina Navratilova and Roger Federer represent the pinnacle of motor learning. However, when considering the range and complexity of the processes that are involved in motor learning, even the mere mortals among us exhibit abilities that are impressive. We exercise these abilities when taking up new activities - whether it is snowboarding or ballroom dancing - but also engage in substantial motor learning on a daily basis as we adapt to changes in our environment, manipulate new objects and refine existing skills. Here we review recent research in human motor learning with an emphasis on the computational mechanisms that are involved

Scholarship@Western

Crossref

A Survey of Monte Carlo Tree Search Methods

Author: Browne Cameron B
Colton Simon
Cowling Peter I
Lucas Simon M
Perez Diego
Powley Edward
Rohlfshagen Philipp
Samothrakis Spyridon
Tavener Stephen
Whitehouse Daniel
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2012
Field of study

Monte Carlo tree search (MCTS) is a recently proposed search method that combines the precision of tree search with the generality of random sampling. It has received considerable interest due to its spectacular success in the difficult problem of computer Go, but has also proved beneficial in a range of other domains. This paper is a survey of the literature to date, intended to provide a snapshot of the state of the art after the first five years of MCTS research. We outline the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarize the results from the key game and nongame domains to which MCTS methods have been applied. A number of open research questions indicate that the field is ripe for future work

University of Essex Research Repository

CiteSeerX

Maastricht University Research Portal

Recommended from our members

Intelligent and High-Performance Behavior Design of Autonomous Systems via Learning, Optimization and Control

Author: Sun Liting
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Nowadays, great societal demands have rapidly boosted the development of autonomous systems that densely interact with humans in many application domains, from manufacturing to transportation and from workplaces to daily lives. The shift from isolated working environments to human-dominated space requires autonomous systems to be empowered to handle not only environmental uncertainties such as external vibrations but also interaction uncertainties arising from human behavior which is in nature probabilistic, causal but not strictly rational, internally hierarchical and socially compliant.This dissertation is concerned with the design of intelligent and high-performance behavior of such autonomous systems, leveraging the strength from control, optimization, learning, and cognitive science. The work consists of two parts. In Part I, the problem of high-level hybrid human-machine behavior design is addressed. The goal is to achieve safe, efficient and human-like interaction with people. A framework based on the theory of mind, utility theories and imitation learning is proposed to efficiently represent and learn the complicated behavior of humans. Built upon that, machine behaviors at three different levels - the perceptual level, the reasoning level, and the action level - are designed via imitation learning, optimization, and online adaptation, allowing the system to interpret, reason and behave as human, particularly when a variety of uncertainties exist. Applications to autonomous driving are considered throughout Part I. Part II is concerned with the design of high-performance low-level individual machine behavior in the presence of model uncertainties and external disturbances. Advanced control laws based on adaptation, iterative learning and the internal structures of uncertainties/disturbances are developed to assure that the high-level interactive behaviors can be reliably executed. Applications on robot manipulators and high-precision motion systems are discussed in this part

eScholarship - University of California