Search CORE

39 research outputs found

Game theory of mind

Author: A Benveniste
A Ng
A Traulsen
AN Hampton
B Skyrms
CF Camerer
CF Camerer
CF Camerer
CJCH Watkins
D Fudenberg
D Fudenberg
D Kahneman
D Wilson
DG Premack
DM Kreps
DO Stahl
E Fehr
E Fehr
E Todorov
H Gintis
H Gintis
HA Simon
HL Gallagher
J Moll
JM Smith
JM Smith
K McCabe
Karl J. Friston
KJ Friston
M Costa-Gomes
P Davies
P Milgrom
PA Haile
PJ Gmytrasiewicz
R Bellman
R McKelvey
Ray J. Dolan
RS Sutton
S Avner
Tim Behrens
U Frith
W Nelson
Wako Yoshida
Publication venue
Publication date: 01/01/2008
Field of study

This paper introduces a model of ‘theory of mind’, namely, how we represent the intentions and goals of others to optimise our mutual interactions. We draw on ideas from optimum control and game theory to provide a ‘game theory of mind’. First, we consider the representations of goals in terms of value functions that are prescribed by utility or rewards. Critically, the joint value functions and ensuing behaviour are optimised recursively, under the assumption that I represent your value function, your representation of mine, your representation of my representation of yours, and so on ad infinitum. However, if we assume that the degree of recursion is bounded, then players need to estimate the opponent's degree of recursion (i.e., sophistication) to respond optimally. This induces a problem of inferring the opponent's sophistication, given behavioural exchanges. We show it is possible to deduce whether players make inferences about each other and quantify their sophistication on the basis of choices in sequential games. This rests on comparing generative models of choices with, and without, inference. Model comparison is demonstrated using simulated and real data from a ‘stag-hunt’. Finally, we note that exactly the same sophisticated behaviour can be achieved by optimising the utility function itself (through prosocial utility), producing unsophisticated but apparently altruistic agents. This may be relevant ethologically in hierarchal game theory and coevolution

CiteSeerX

Crossref

Directory of Open Access Journals

UCL Discovery

PubMed Central

MPG.PuRe

A flexible coupling approach to multi-agent planning under incomplete information

Author: A Barrett
A Blum
Alejandro Torreño
C Boutilier
C Micacchi
D Weld
D Weld
E Argente
E Ephrati
EH Durfee
Eva Onaindia
F Wu
GA Kaminka
H Tonino
H Younes
J Hoffmann
K Decker
M Brenner
M desJardins
M Helmert
M Kone
M Tambe
M Weerdt de
N Nguyen
P Gmytrasiewicz
R Fikes
S Kambhampati
S Kraus
S Parsons
S Richter
V Lesser
Óscar Sapena
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

The final publication is available at Springer via http://dx.doi.org/10.1007/s10115-012-0569-7Multi-agent planning (MAP) approaches are typically oriented at solving loosely coupled problems, being ineffective to deal with more complex, strongly related problems. In most cases, agents work under complete information, building complete knowledge bases. The present article introduces a general-purpose MAP framework designed to tackle problems of any coupling levels under incomplete information. Agents in our MAP model are partially unaware of the information managed by the rest of agents and share only the critical information that affects other agents, thus maintaining a distributed vision of the task. Agents solve MAP tasks through the adoption of an iterative refinement planning procedure that uses single-agent planning technology. In particular, agents will devise refinements through the partial-order planning paradigm, a flexible framework to build refinement plans leaving unsolved details that will be gradually completed by means of new refinements. Our proposal is supported with the implementation of a fully operative MAP system and we show various experiments when running our system over different types of MAP problems, from the most strongly related to the most loosely coupled.This work has been partly supported by the Spanish MICINN under projects Consolider Ingenio 2010 CSD2007-00022 and TIN2011-27652-C03-01, and the Valencian Prometeo project 2008/051.Torreño Lerma, A.; Onaindia De La Rivaherrera, E.; Sapena Vercher, O. (2014). A flexible coupling approach to multi-agent planning under incomplete information. Knowledge and Information Systems. 38:141-178. https://doi.org/10.1007/s10115-012-0569-7S14117838Argente E, Botti V, Carrascosa C, Giret A, Julian V, Rebollo M (2011) An abstract architecture for virtual organizations: the THOMAS approach. Knowl Inf Syst 29(2):379–403Barrett A, Weld DS (1994) Partial-order planning: evaluating possible efficiency gains. Artif Intell 67(1):71–112Belesiotis A, Rovatsos M, Rahwan I (2010) Agreeing on plans through iterated disputes. In: Proceedings of the 9th international conference on autonomous agents and multiagent systems. pp 765–772Bellifemine F, Poggi A, Rimassa G (2001) JADE: a FIPA2000 compliant agent development environment. In: Proceedings of the 5th international conference on autonomous agents (AAMAS). ACM, pp 216–217Blum A, Furst ML (1997) Fast planning through planning graph analysis. Artif Intell 90(1–2):281–300Boutilier C, Brafman R (2001) Partial-order planning with concurrent interacting actions. J Artif Intell Res 14(105):136Brafman R, Domshlak C (2008) From one to many: planning for loosely coupled multi-agent systems. In: Proceedings of the 18th international conference on automated planning and scheduling (ICAPS). pp 28–35Brenner M, Nebel B (2009) Continual planning and acting in dynamic multiagent environments. J Auton Agents Multiag Syst 19(3):297–331Coles A, Coles A, Fox M, Long D (2010) Forward-chaining partial-order planning. In: Proceedings of the 20th international conference on automated planning and scheduling (ICAPS). pp 42–49Coles A, Fox M, Long D, Smith A (2008) Teaching forward-chaining planning with JavaFF. In: Colloquium on AI education, 23rd AAAI conference on artificial intelligenceCox J, Durfee E, Bartold T (2005) A distributed framework for solving the multiagent plan coordination problem. In: Proceedings of the 4th international joint conference on autonomous agents and multiagent systems (AAMAS). ACM, pp 821–827de Weerdt M, Clement B (2009) Introduction to planning in multiagent systems. Multiag Grid Syst 5(4):345–355Decker K, Lesser VR (1992) Generalizing the partial global planning algorithm. Int J Coop Inf Syst 2(2):319–346desJardins M, Durfee E, Ortiz C, Wolverton M (1999) A survey of research in distributed continual planning. AI Mag 20(4):13–22Doshi P (2007) On the role of interactive epistemology in multiagent planning. In: Artificial intelligence and, pattern recognition. pp 208–213Dréo J, Savéant P, Schoenauer M, Vidal V (2011) Divide-and-evolve: the marriage of descartes and darwin. In: Proceedings of the 7th international planning competition (IPC). Freiburg, GermanyDurfee EH (2001) Distributed problem solving and planning. In: Multi-agents systems and applications: selected tutorial papers from the 9th ECCAI advanced course (ACAI) and agentLink’s third European agent systems summer school (EASSS), vol LNAI 2086. Springer, pp 118–149Durfee EH, Lesser V (1991) Partial global planning: a coordination framework for distributed hypothesis formation. IEEE Trans Syst Man Cybern Special Issue Distrib Sens Netw 21(5):1167–1183Ephrati E, Rosenschein JS (1996) Deriving consensus in multiagent systems. Artif Intell 87(1–2):21–74Fikes R, Nilsson N (1971) STRIPS: a new approach to the application of theorem proving to problem solving. Artif Intell 2(3):189–208Fogués R, Alberola J, Such J, Espinosa A, Garcia-Fornes A (2010) Towards dynamic agent interaction support in open multiagent systems. In: Proceedings of the 2010 conference on artificial intelligence research and development: proceedings of the 13th international conference of the Catalan association for artificial intelligence’. IOS Press, pp 89–98Gerevini A, Long D (2006) Preferences and soft constraints in PDDL3. In: ICAPS workshop on planning with preferences and soft constraints, vol 6. Citeseer, pp 46–53Ghallab M, Howe A, Knoblock C, McDermott D, Ram A, Veloso M, Weld D, Wilkins D (1998) PDDL-the Planning Domain Definition Language. In: AIPS-98 planning committeeGmytrasiewicz P, Doshi P (2005) A framework for sequential planning in multi-agent settings. J Artif Intell Res 24:49–79Haslum P, Jonsson P (1999) Some results on the complexity of planning with incomplete information. In: Proceedings of the 5th European conference on, planning (ECP). pp 308–318Helmert M (2006) The fast downward planning system. J Artif Intell Res 26(1):191–246Hoffmann J, Nebel B (2001) The FF planning system: fast planning generation through heuristic search. J Artif Intell Res 14:253–302Jonsson A, Rovatsos M (2011) Scaling up multiagent planning: a best-response approach. In: Proceedings of the 21st international conference on automated planning and scheduling (ICAPS). AAAI, pp 114–121Kambhampati S (1997) Refinement planning as a unifying framework for plan synthesis. AI Mag 18(2):67–97Kaminka GA, Pynadath DV, Tambe M (2002) Monitoring teams by overhearing: a multi-agent plan-recognition approach. J Artif Intell Res 17:83–135Kone M, Shimazu A, Nakajima T (2000) The state of the art in agent communication languages. Knowl Inf Syst 2(3):259–284Kovacs DL (2011) Complete BNF description of PDDL3.1. Technical reportKraus S (1997) Beliefs, time and incomplete information in multiple encounter negotiations among autonomous agents. Ann Math Artif Intell 20(1–4):111–159Kumar A, Zilberstein S, Toussaint M (2011) Scalable multiagent planning using probabilistic inference. In: Proceedings of the 22nd international joint conference on artificial intelligence (IJCAI)’. Barcelona, Spain, pp 2140–2146Kvarnström J. (2011) Planning for loosely coupled agents using partial order forward-chaining. In: Proceedings of the 21st international conference on automated planning and scheduling (ICAPS). AAAI, pp 138–145Lesser V, Decker K, Wagner T, Carver N, Garvey A, Horling B, Neiman D, Podorozhny R, Prasad M, Raja A et al (2004) Evolution of the GPGP/TAEMS domain-independent coordination framework. Auton Agents Multi Agent Syst 9(1):87–143Lipovetzky N, Geffner H (2011) Searching for plans with carefully designed probes. In: Proceedings of the 21th international conference on automated planning and scheduling (ICAPS)Micacchi C, Cohen R (2008) A framework for simulating real-time multi-agent systems. Knowl Inf Syst 17(2):135–166Nguyen N, Katarzyniak R (2009) Actions and social interactions in multi-agent systems. Knowl Inf Syst 18(2):133–136Nguyen X, Kambhampati S (2001) Reviving partial order planning. In: Proceedings of the 17th international joint conference on artificial intelligence (IJCAI). Morgan Kaufmann, pp 459–464Nissim R, Brafman R, Domshlak C (2010) A general, fully distributed multi-agent planning algorithm. In: Proceedings of the 9th international conference on autonomous agents and multiagent systems (AAMAS). pp 1323–1330Pajares S, Onaindia E (2012) Defeasible argumentation for multi-agent planning in ambient intelligence applications. In: Proceedings of the 11th international conference on autonomous agents and multiagent systems (AAMAS) pp 509–516Paolucci M, Shehory O, Sycara K, Kalp D, Pannu A (2000) A planning component for RETSINA agents. Intelligent Agents VI. Agent Theories Architectures, and Languages pp 147–161Parsons S, Sierra C, Jennings N (1998) Agents that reason and negotiate by arguing. J Logic Comput 8(3):261Penberthy J, Weld D (1992) UCPOP: a sound, complete, partial order planner for ADL. In: Proceedings of the 3rd international conference on principles of knowledge representation and reasoning (KR). Morgan Kaufmann, pp 103–114Richter S, Westphal M (2010) The LAMA planner: guiding cost-based anytime planning with landmarks. J Artif Intell Res 39(1):127–177Sycara K, Pannu A (1998) The RETSINA multiagent system (video session): towards integrating planning, execution and information gathering. In: Proceedings of the 2nd international conference on autonomous agents (Agents). ACM, pp 350–351Tambe M (1997) Towards flexible teamwork. J Artif Intell Res 7:83–124Tang Y, Norman T, Parsons S (2010) A model for integrating dialogue and the execution of joint plans. Argumentation in multi-agent systems, pp 60–78Tonino H, Bos A, de Weerdt M, Witteveen C (2002) Plan coordination by revision in collective agent based systems. Artif Intell 142(2):121–145Van Der Krogt R, De Weerdt M (2005), Plan repair as an extension of planning. In: Proceedings of the 15th international conference on automated planning and scheduling (ICAPS). pp 161–170Weld D (1994) An introduction to least commitment planning. AI Mag 15(4):27Weld D (1999) Recent advances in AI planning. AI Mag 20(2):93–123Wilkins D, Myers K (1998) A multiagent planning architecture. In: Proceedings of the 4th international conference on artificial intelligence planning systems (AIPS), pp 154–162Wu F, Zilberstein S, Chen X (2011) Online planning for multi-agent systems with bounded communication. Artif Intell 175(2):487–511Younes H, Simmons R (2003) VHPOP: versatile heuristic partial order planner. J Artif Intell Res 20: 405–430Zhang J, Nguyen X, Kowalczyk R (2007) Graph-based multi-agent replanning algorithm. In: Proceedings of the 6th conference on autonomous agents and multiagent systems (AAMAS

Crossref

RiuNet

Arguing with behavior influence: A model for web-based group decision support systems

Author: Allbeck J.
Aronson J. E.
Ball G.
Carneiro J.
Carneiro J.
Diogo Martinho
Fan X.
Gmytrasiewicz P. J.
Goreti Marreiros
João Carneiro
Lewicki R. J.
Lunenburg F. C.
Luthans F.
Marakas G. M.
Ortony A.
Padgham L.
Paulo Novais
Santos R.
Santos R.
Velsquez J.
Walton D.
Zamfirescu C.-B.
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2019
Field of study

In this work, we propose an argumentation-based dialogue model designed for Web-based Group Decision Support Systems, that considers the decision-makers' intentions. The intentions are modeled as behavior styles which allow agents to interact with each other as humans would in face-to-face meetings. In addition, we propose a set of arguments that can be used by the agents to perform and evaluate requests, while considering the agents' behavior style. The inclusion of decision-makers' intentions intends to create a more reliable and realistic process. Our model proved, in different contexts, that higher levels of consensus and satisfaction are achieved when using agents modeled with behavior styles compared to agents without any features to represent the decision-makers' intentions.- (undefined

Universidade do Minho: RepositoriUM

Crossref

Can bounded and self-interested agents be teammates? Application to planning in ad hoc teams

Author: A Brandenburger
B Goodwine
C Boutilier
C Camerer
C Guestrin
CF Camerer
D Koller
DS Bernstein
DV Pynadath
E Kalai
GW Brown
I Gilboa
J Mertens
JA Tatman
JC Harsanyi
K Binmore
L Panait
M Bowling
Muthukumaran Chandrasekaran
P Doshi
P Doshi
P Gmytrasiewicz
Prashant Doshi
R Nair
R Wageman
RJ Aumann
S Seuken
Y Zeng
Yifeng Zeng
Yingke Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 23/11/2016
Field of study

Planning for ad hoc teamwork is challenging because it involves agents collaborating without any prior coordination or communication. The focus is on principled methods for a single agent to cooperate with others. This motivates investigating the ad hoc teamwork problem in the context of self-interested decision-making frameworks. Agents engaged in individual decision making in multiagent settings face the task of having to reason about other agents’ actions, which may in turn involve reasoning about others. An established approximation that operationalizes this approach is to bound the infinite nesting from below by introducing level 0 models. For the purposes of this study, individual, self-interested decision making in multiagent settings is modeled using interactive dynamic influence diagrams (I-DID). These are graphical models with the benefit that they naturally offer a factored representation of the problem, allowing agents to ascribe dynamic models to others and reason about them. We demonstrate that an implication of bounded, finitely-nested reasoning by a self-interested agent is that we may not obtain optimal team solutions in cooperative settings, if it is part of a team. We address this limitation by including models at level 0 whose solutions involve reinforcement learning. We show how the learning is integrated into planning in the context of I-DIDs. This facilitates optimal teammate behavior, and we demonstrate its applicability to ad hoc teamwork on several problem domains and configurations

Northumbria Research Link

Crossref

Teeside University's Research Repository

Human Behavior Models for Agents in Simulators and Games: Part I: Enabling Science with PMFserv

Author: Badler N.
Barry G. Silverman
Cornwell J. B.
Gillis P. D.
Gmytrasiewicz P.
Jason Cornwell
Kevin O'Brien
Laird J.
Lustick I. S.
Michael Johns
Silverman B. G.
Silverman B. G.
Silverman B. G.
Tambe M.
Publication venue: 'MIT Press - Journals'
Publication date
Field of study

Crossref

Formal models and algorithms for decentralized decision making under uncertainty

Author: C.H. Papadimitriou
C.H. Papadimitriou
C.H. Papadimitriou
C.V. Goldman
C.V. Goldman
D.P. Farias de
D.S. Bernstein
D.V. Pynadath
E. Kalai
I. Suzuki
K.J. Aström
L.P. Kaelbling
M. Tambe
M.J. Osborne
M.L. Puterman
O. Madani
P.J. Gmytrasiewicz
S. Russell
Shlomo Zilberstein
Sven Seuken
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Gaining Competitive Advantage Through Learning Agent Models

Author: D. Suryadi
D. Zeng
J. Vidal
L. Garrido
M. Tambe
P. Gmytrasiewicz
P. Gmytrasiewicz
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Crossref

Agent Modeling in Antiair Defense

Author: A Jameson
AS Rao
EH Durfee
J Pearl
KL Poh
KL Poh
M Tambe
MP Wellman
PJ Gmytrasiewicz
PJ Gmytrasiewicz
S Sen
SJ Russell
T Kellogg
Y Mor
Publication venue: SpringerVerlag
Publication date: 01/01/1997
Field of study

. This research addresses rational decision making and coordination among antiair units whose mission is to defend a specified territory from a number of attacking missiles. The automated units have to decide which missiles to attempt to intercept, given the characteristics of the threat, and given the other units' anticipated actions, in their attempt to minimize the expected overall damages to the defended territory. Thus, an automated defense unit needs to model the other agents, either human or automated, that control the other defense batteries. For the purpose of this case study, we assume that the units cannot communicate among themselves, say, due to an imposed radio silence. We use the Recursive Modeling Method (RMM), which enables an agent to select his rational action by examining the expected utility of his alternative behaviors, and to coordinate with other agents by modeling their decision making in a distributed multiagent environment. We describe how decision making usi..

CiteSeerX

Crossref

On Self-adaptive Resource Allocation through Reinforcement Learning

Author: D. Sciuto
F. Sironi
G. Beltrame
J. Panerati
M. Carminati
M. D. Santambrogio
M. Maggio
P. J. Gmytrasiewicz
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

Autonomic computing was proposed as a promising solution to overcome the complexity of modern systems, which is causing management operations to become increasingly difficult for human beings. This work proposes the Adaptation Manager, a comprehensive framework to implement autonomic managers capable of pursuing some of the objectives of autonomic computing (i.e., self-optimization and self-healing). The Adaptation Manager features an active performance monitoring infrastructure and two dynamic knobs to tune the scheduling decisions of an operating system and the working frequency of cores. The Adaptation Manager exploits artificial intelligence and reinforcement learning to close the Monitor-Plan-Analyze- Execute with Knowledge adaptation loop at the very base of every autonomic manager. We evaluate the Adaptation Manager, and especially the adaptation policies it learns by means of reinforcement learning, using a set of representative applications for multicore processors and show the effectiveness of our prototype on commodity computing systems

Archivio istituzionale della ricerca - Politecnico di Milano

Lund University Publications

Crossref

PolyPublie

Sentiment Variations in Text for Persuasion Technology

Author: E. Reiter
F. Grasso
F. Rosis de
H. Hernault
H. Lee
J.A.A. Sillince
M. Guerini
M. Kaptein
M. Kaptein
P.J. Gmytrasiewicz
R. Giora
S.T. Dumais
Publication venue
Publication date: 01/01/2014
Field of study

Crossref

Archivio della ricerca - Fondazione Bruno Kessler