123 research outputs found
Expected Value of Communication for Planning in Ad Hoc Teamwork
You are viewing an article from Good Systems from February 2021Office of the VP for Researc
Stubborn: An Environment for Evaluating Stubbornness between Agents with Aligned Incentives
Recent research in multi-agent reinforcement learning (MARL) has shown
success in learning social behavior and cooperation. Social dilemmas between
agents in mixed-sum settings have been studied extensively, but there is little
research into social dilemmas in fullycooperative settings, where agents have
no prospect of gaining reward at another agent's expense.
While fully-aligned interests are conducive to cooperation between agents,
they do not guarantee it. We propose a measure of "stubbornness" between agents
that aims to capture the human social behavior from which it takes its name: a
disagreement that is gradually escalating and potentially disastrous. We would
like to promote research into the tendency of agents to be stubborn, the
reactions of counterpart agents, and the resulting social dynamics.
In this paper we present Stubborn, an environment for evaluating stubbornness
between agents with fully-aligned incentives. In our preliminary results, the
agents learn to use their partner's stubbornness as a signal for improving the
choices that they make in the environment
Comments from ARPA/AFML
I want to take a moment to give you a few reflections from the sponsoring agency. The Advanced Research Projects Agency has been set up to take chances on high risk R and 0. I have a warm spot in my heart for this particular program because it\u27s the first one that I was fortunate enough to pull together on an integrated basis. However, I have to admit that when it was first suggested by Mike Buckley that I should put a few chips in NOE, I felt kind of blah about it. I felt it was not a risky area, that it was not the colorful type of thing that ARPA should be getting into. The more I ~rd about it, however, the more I realized the importance of this area and that a good investment in some people that Don had pulled together would pay off handsomely. I want to compliment Don and the excellent team he has gotten together. I am particularly excited tonight to have the atteniton of people like Secretary Brownman who, I can appreciate, understands technology and understands that it needs diffusing out into the services. The program is still risky unless we can pull that off. Let\u27s hope that getting the attention of people like Secretary Brownman and others will aid in a technology transfer so that new developments will not be lost for another decade or two, or until someone rediscovers it
ΠΡΠΈΡ ΠΎΠ»ΠΎΠ³ΠΎ-ΠΏΠ΅Π΄Π°Π³ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΈΠ΅ Π°ΡΠΏΠ΅ΠΊΡΡ ΠΏΡΠ΅ΠΏΠΎΠ΄Π°Π²Π°Π½ΠΈΡ Π΄ΠΈΡΡΠΈΠΏΠ»ΠΈΠ½Ρ "ΠΎΠ±ΡΠ°Ρ Π±ΠΈΠΎΠ»ΠΎΠ³ΠΈΡ" ΠΈΠ½ΠΎΡΡΡΠ°Π½Π½ΡΠΌ ΡΡΠ°ΡΠΈΠΌΡΡ ΠΏΠΎΠ΄Π³ΠΎΡΠΎΠ²ΠΈΡΠ΅Π»ΡΠ½ΠΎΠ³ΠΎ ΡΠ°ΠΊΡΠ»ΡΡΠ΅ΡΠ°
Π ΡΡΠ°ΡΡΠ΅ ΠΎΠ±ΠΎΡΠ½ΠΎΠ²ΡΠ²Π°Π΅ΡΡΡ Π½Π΅ΠΎΠ±Ρ
ΠΎΠ΄ΠΈΠΌΠΎΡΡΡ ΠΈΡΠΏΠΎΠ»ΡΠ·ΠΎΠ²Π°Π½ΠΈΡ ΠΏΡΠΈΡ
ΠΎΠ»ΠΎΠ³ΠΎ-ΠΏΠ΅Π΄Π°Π³ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΈΡ
ΠΏΠΎΠ΄Ρ
ΠΎΠ΄ΠΎΠ² Π² ΠΏΡΠΎΡΠ΅ΡΡΠ΅ ΠΏΡΠ΅ΠΏΠΎΠ΄Π°Π²Π°Π½ΠΈΡ ΠΎΠ±ΡΠ΅ΠΉ Π±ΠΈΠΎΠ»ΠΎΠ³ΠΈΠΈ ΠΈΠ½ΠΎΡΡΡΠ°Π½Π½ΡΠΌ ΡΡΠ°ΡΠΈΠΌΡΡ Π½Π° Π΄ΠΎΠ²ΡΠ·ΠΎΠ²ΡΠΊΠΎΠΌ ΡΡΠ°ΠΏΠ΅ ΠΎΠ±ΡΡΠ΅Π½ΠΈΡ. ΠΡΠ΅Π΄Π»Π°Π³Π°ΡΡΡΡ ΡΠΏΠ΅ΡΠΈΠ°Π»ΡΠ½ΡΠ΅ ΠΌΠ΅ΡΠΎΠ΄ΠΈΡΠ΅ΡΠΊΠΈΠ΅ ΠΏΡΠΈΠ΅ΠΌΡ ΠΎΠ²Π»Π°Π΄Π΅Π½ΠΈΡ ΡΡΠ°ΡΠΈΠΌΠΈΡΡ ΡΠ·ΡΠΊΠΎΠΌ Π΄ΠΈΡΡΠΈΠΏΠ»ΠΈΠ½Ρ ΡΠ΅ΡΠΌΠΈΠ½ΠΎΠ»ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΎΠΉ Π»Π΅ΠΊΡΠΈΠΊΠΈ, ΠΎΠ±Π΅ΡΠΏΠ΅ΡΠΈΠ²Π°ΡΡΠΈΠ΅ Π°ΠΊΡΠΈΠ²Π½ΡΡ ΡΠ·ΡΠΊΠΎΠ²ΡΡ ΠΊΠΎΠΌΠΏΠ΅ΡΠ΅Π½ΡΠΈΡ ΠΏΠΎ ΠΏΡΠ΅Π΄ΠΌΠ΅ΡΡ. ΠΠΎΠ΄ΡΠ΅ΡΠΊΠΈΠ²Π°Π΅ΡΡΡ Π½Π΅ΠΎΠ±Ρ
ΠΎΠ΄ΠΈΠΌΠΎΡΡΡ Π΄Π»Ρ ΡΠΎΠ²Π΅ΡΡΠ΅Π½ΡΡΠ²ΠΎΠ²Π°Π½ΠΈΡ ΡΡΠ΅Π±Π½ΠΎΠ³ΠΎ ΠΏΡΠΎΡΠ΅ΡΡΠ° ΠΈ ΡΠ΅Π°Π»ΠΈΠ·Π°ΡΠΈΠΈ Π»ΠΈΡΠ½ΠΎΡΡΠ½ΠΎ ΠΎΡΠΈΠ΅Π½ΡΠΈΡΠΎΠ²Π°Π½Π½ΠΎΠ³ΠΎ ΠΏΠΎΠ΄Ρ
ΠΎΠ΄Π° ΠΏΡΠ΅Π΄Π²Π°ΡΠΈΡΠ΅Π»ΡΠ½ΠΎ ΠΎΡΠ΅Π½ΠΈΡΡ ΠΏΡΠΈΡ
ΠΎΡΠΈΠ·ΠΈΠΎΠ»ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΈΠΉ ΡΡΠ°ΡΡΡ ΠΈΠ½ΠΎΡΡΡΠ°Π½Π½ΡΡ
ΡΡΠ°ΡΠΈΡ
ΡΡ, ΡΡΠΎ Π²ΠΊΠ»ΡΡΠ°Π΅Ρ Π² ΡΠ΅Π±Ρ ΠΈΡΡΠ»Π΅Π΄ΠΎΠ²Π°Π½ΠΈΠ΅ Π°Π½Π°Π»ΠΈΡΠΈΠΊΠΎ-ΡΠΈΠ½ΡΠ΅ΡΠΈΡΠ΅ΡΠΊΠΎΠΉ Π΄Π΅ΡΡΠ΅Π»ΡΠ½ΠΎΡΡΠΈ, ΡΠ°Π·Π²ΠΈΡΠΈΠ΅ ΡΠ°Π·Π»ΠΈΡΠ½ΡΡ
Π²ΠΈΠ΄ΠΎΠ² Π·Π°ΠΏΠΎΠΌΠΈΠ½Π°Π½ΠΈΡ, ΠΈΠ½ΡΠ΅Π½ΡΠΈΠ²Π½ΠΎΡΡΠΈ ΠΈ ΡΠ΅ΠΌΠΏΠΎΠ² ΡΠΌΡΡΠ²Π΅Π½Π½ΠΎΠΉ Π΄Π΅ΡΡΠ΅Π»ΡΠ½ΠΎΡΡΠΈ, Π»ΠΈΡΠ½ΠΎΡΡΠ½ΡΡ
ΠΎΡΠΎΠ±Π΅Π½Π½ΠΎΡΡΠ΅ΠΉ ΡΡΠ°ΡΠΈΡ
ΡΡ. ΠΠ°ΠΈΠ±ΠΎΠ»Π΅Π΅ Π²Π°ΠΆΠ½ΡΠ΅ ΠΈΠ½ΡΡΡΡΠΌΠ΅Π½ΡΡ ΠΏΡΠΈΡ
ΠΎΠ»ΠΎΠ³ΠΎ-ΠΏΠ΅Π΄Π°Π³ΠΎΠ³ΠΈΡΠ΅ΡΠΊΠΎΠΉ Π°Π΄Π°ΠΏΡΠ°ΡΠΈΠΈ: ΠΎΡΠ΅Π½ΠΊΠ° ΡΡΠΎΠ²Π½Ρ Π²ΠΎΡΠΏΡΠΈΡΡΠΈΡ ΡΡΠ΅Π±Π½ΠΎΠ³ΠΎ ΠΌΠ°ΡΠ΅ΡΠΈΠ°Π»Π° ΠΏΡΠΈ ΡΠ°Π·Π»ΠΈΡΠ½ΡΡ
ΡΠΎΡΠΌΠ°Ρ
Π΅Π³ΠΎ ΠΏΡΠ΅Π·Π΅Π½ΡΠ°ΡΠΈΠΈ (ΠΊΠ»Π°ΡΡΠΈΡΠ΅ΡΠΊΠ°Ρ Π»Π΅ΠΊΡΠΈΡ, ΠΌΠΈΠΊΡΠΎΠ»Π΅ΠΊΡΠΈΡ, ΠΏΡΠ°ΠΊΡΠΈΡΠ΅ΡΠΊΠΈΠ΅ Π·Π°Π½ΡΡΠΈΡ)
Editorial : Advances in Goal, Plan and Activity Recognition
Funding Information: The editors would like to thank the authors and reviewers for their time and effort and also for providing new insights and reflections into the growing field of goal recognition research. We are indebted to Dr. Marta Compigotto, Senior Journal Specialist, and her editorial team for their editorial assistance.Peer reviewedPublisher PD
ICRA Roboethics Challenge 2023: Intelligent Disobedience in an Elderly Care Home
With the projected surge in the elderly population, service robots offer a
promising avenue to enhance their well-being in elderly care homes. Such robots
will encounter complex scenarios which will require them to perform decisions
with ethical consequences. In this report, we propose to leverage the
Intelligent Disobedience framework in order to give the robot the ability to
perform a deliberation process over decisions with potential ethical
implications. We list the issues that this framework can assist with, define it
formally in the context of the specific elderly care home scenario, and
delineate the requirements for implementing an intelligently disobeying robot.
We conclude this report with some critical analysis and suggestions for future
work.Comment: This report is part of ICRA roboethics competition :
https://competition.raiselab.ca/competition-details-2023_1/ethics-challenge/submitted-proposals/submission-
- β¦