792 research outputs found
Should Collaborative Robots be Transparent?
We often assume that robots which collaborate with humans should behave in
ways that are transparent (e.g., legible, explainable). These transparent
robots intentionally choose actions that convey their internal state to nearby
humans: for instance, a transparent robot might exaggerate its trajectory to
indicate its goal. But while transparent behavior seems beneficial for
human-robot interaction, is it actually optimal? In this paper we consider
collaborative settings where the human and robot have the same objective, and
the human is uncertain about the robot's type (i.e., the robot's internal
state). We extend a recursive combination of Bayesian Nash equilibrium and the
Bellman equation to solve for optimal robot policies. Interestingly, we
discover that it is not always optimal for collaborative robots to be
transparent; instead, human and robot teams can sometimes achieve higher
rewards when the robot is opaque. In contrast to transparent robots, opaque
robots select actions that withhold information from the human. Our analysis
suggests that opaque behavior becomes optimal when either (a) human-robot
interactions have a short time horizon or (b) users are slow to learn from the
robot's actions. We extend this theoretical analysis to user studies across 43
total participants in both online and in-person settings. We find that --
during short interactions -- users reach higher rewards when working with
opaque partners, and subjectively rate opaque robots as about equal to
transparent robots. See videos of our experiments here:
https://youtu.be/u8q1Z7WHUu
Modeling of Human Motor Control and Its Application in Human Interaction with Machines
Human civilization started with the invention of tools which enhanced and expanded human motor capability. With the recent development of virtual reality technology and artificial intelligence, the interaction between humans and machines has become more and more intricate. A better understanding of our motor system and the way it interacts with machines will allow us to better design intelligent devices. However, previous works in motor control modeling mostly focused on linear dynamics and had limitations in incorporating the process of learning.
A musculoskeletal model based on mechanical principles and a motor control model based on Bayesian probability are proposed in this study. The probability-theoretical formulation of the problem not only facilitates the understanding of motor learning but also transforms nonlinear dynamics into linear problems. Using these models, the interactions in which both human and machine are capable of learning and adapting are formulated and analyzed. Intelligent control policies for machine imitating the human motor control are proposed. Simulation results are also presented
An Adversarial Interpretation of Information-Theoretic Bounded Rationality
Recently, there has been a growing interest in modeling planning with
information constraints. Accordingly, an agent maximizes a regularized expected
utility known as the free energy, where the regularizer is given by the
information divergence from a prior to a posterior policy. While this approach
can be justified in various ways, including from statistical mechanics and
information theory, it is still unclear how it relates to decision-making
against adversarial environments. This connection has previously been suggested
in work relating the free energy to risk-sensitive control and to extensive
form games. Here, we show that a single-agent free energy optimization is
equivalent to a game between the agent and an imaginary adversary. The
adversary can, by paying an exponential penalty, generate costs that diminish
the decision maker's payoffs. It turns out that the optimal strategy of the
adversary consists in choosing costs so as to render the decision maker
indifferent among its choices, which is a definining property of a Nash
equilibrium, thus tightening the connection between free energy optimization
and game theory.Comment: 7 pages, 4 figures. Proceedings of AAAI-1
Principles of sensorimotor learning.
The exploits of Martina Navratilova and Roger Federer represent the pinnacle of motor learning. However, when considering the range and complexity of the processes that are involved in motor learning, even the mere mortals among us exhibit abilities that are impressive. We exercise these abilities when taking up new activities - whether it is snowboarding or ballroom dancing - but also engage in substantial motor learning on a daily basis as we adapt to changes in our environment, manipulate new objects and refine existing skills. Here we review recent research in human motor learning with an emphasis on the computational mechanisms that are involved
A Survey of Monte Carlo Tree Search Methods
Monte Carlo tree search (MCTS) is a recently proposed search method that combines the precision of tree search with the generality of random sampling. It has received considerable interest due to its spectacular success in the difficult problem of computer Go, but has also proved beneficial in a range of other domains. This paper is a survey of the literature to date, intended to provide a snapshot of the state of the art after the first five years of MCTS research. We outline the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarize the results from the key game and nongame domains to which MCTS methods have been applied. A number of open research questions indicate that the field is ripe for future work
Recommended from our members
Intelligent and High-Performance Behavior Design of Autonomous Systems via Learning, Optimization and Control
Nowadays, great societal demands have rapidly boosted the development of autonomous systems that densely interact with humans in many application domains, from manufacturing to transportation and from workplaces to daily lives. The shift from isolated working environments to human-dominated space requires autonomous systems to be empowered to handle not only environmental uncertainties such as external vibrations but also interaction uncertainties arising from human behavior which is in nature probabilistic, causal but not strictly rational, internally hierarchical and socially compliant.This dissertation is concerned with the design of intelligent and high-performance behavior of such autonomous systems, leveraging the strength from control, optimization, learning, and cognitive science. The work consists of two parts. In Part I, the problem of high-level hybrid human-machine behavior design is addressed. The goal is to achieve safe, efficient and human-like interaction with people. A framework based on the theory of mind, utility theories and imitation learning is proposed to efficiently represent and learn the complicated behavior of humans. Built upon that, machine behaviors at three different levels - the perceptual level, the reasoning level, and the action level - are designed via imitation learning, optimization, and online adaptation, allowing the system to interpret, reason and behave as human, particularly when a variety of uncertainties exist. Applications to autonomous driving are considered throughout Part I. Part II is concerned with the design of high-performance low-level individual machine behavior in the presence of model uncertainties and external disturbances. Advanced control laws based on adaptation, iterative learning and the internal structures of uncertainties/disturbances are developed to assure that the high-level interactive behaviors can be reliably executed. Applications on robot manipulators and high-precision motion systems are discussed in this part
- …