Search CORE

1,179 research outputs found

Learning the Structure and Parameters of Large-Population Graphical Games from Behavioral Data

Author: Honorio Jean
Ortiz Luis
Publication venue
Publication date: 03/05/2015
Field of study

We consider learning, from strictly behavioral data, the structure and parameters of linear influence games (LIGs), a class of parametric graphical games introduced by Irfan and Ortiz (2014). LIGs facilitate causal strategic inference (CSI): Making inferences from causal interventions on stable behavior in strategic settings. Applications include the identification of the most influential individuals in large (social) networks. Such tasks can also support policy-making analysis. Motivated by the computational work on LIGs, we cast the learning problem as maximum-likelihood estimation (MLE) of a generative model defined by pure-strategy Nash equilibria (PSNE). Our simple formulation uncovers the fundamental interplay between goodness-of-fit and model complexity: good models capture equilibrium behavior within the data while controlling the true number of equilibria, including those unobserved. We provide a generalization bound establishing the sample complexity for MLE in our framework. We propose several algorithms including convex loss minimization (CLM) and sigmoidal approximations. We prove that the number of exact PSNE in LIGs is small, with high probability; thus, CLM is sound. We illustrate our approach on synthetic data and real-world U.S. congressional voting records. We briefly discuss our learning framework's generality and potential applicability to general graphical games.Comment: Journal of Machine Learning Research. (accepted, pending publication.) Last conference version: submitted March 30, 2012 to UAI 2012. First conference version: entitled, Learning Influence Games, initially submitted on June 1, 2010 to NIPS 201

arXiv.org e-Print Archive

CiteSeerX

Structure in the Value Function of Two-Player Zero-Sum Games of Incomplete Information

Author: Oliehoek Frans A.
Roijers Diederik M.
Wiggers Auke J.
Publication venue
Publication date: 01/01/2016
Field of study

Zero-sum stochastic games provide a rich model for competitive decision making. However, under general forms of state uncertainty as considered in the Partially Observable Stochastic Game (POSG), such decision making problems are still not very well understood. This paper makes a contribution to the theory of zero-sum POSGs by characterizing structure in their value function. In particular, we introduce a new formulation of the value function for zs-POSGs as a function of the "plan-time sufficient statistics" (roughly speaking the information distribution in the POSG), which has the potential to enable generalization over such information distributions. We further delineate this generalization capability by proving a structural result on the shape of value function: it exhibits concavity and convexity with respect to appropriately chosen marginals of the statistic space. This result is a key pre-cursor for developing solution methods that may be able to exploit such structure. Finally, we show how these results allow us to reduce a zs-POSG to a "centralized" model with shared observations, thereby transferring results for the latter, narrower class, to games with individual (private) observations

arXiv.org e-Print Archive

University of Liverpool Repository

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Predicting the expected behavior of agents that learn about agents: the CLRI framework

Author: Durfee Edmund H.
Vidal Jose M.
Publication venue
Publication date: 01/01/2002
Field of study

We describe a framework and equations used to model and predict the behavior of multi-agent systems (MASs) with learning agents. A difference equation is used for calculating the progression of an agent's error in its decision function, thereby telling us how the agent is expected to fare in the MAS. The equation relies on parameters which capture the agent's learning abilities, such as its change rate, learning rate and retention rate, as well as relevant aspects of the MAS such as the impact that agents have on each other. We validate the framework with experimental results using reinforcement learning agents in a market system, as well as with other experimental results gathered from the AI literature. Finally, we use PAC-theory to show how to calculate bounds on the values of the learning parameters

arXiv.org e-Print Archive

CiteSeerX

Scholar Commons - Institutional Repository of the University of South Carolina

Deep Blue Documents at the University of Michigan

If multi-agent learning is the answer, what is the question?

Author: Rob Powers
Trond Grenager
Yoav Shoham
Publication venue
Publication date
Field of study

Research Papers in Economics

A Projective Simulation Scheme for Partially-Observable Multi-Agent Systems

Author: Kheiri Rasoul
Publication venue
Publication date: 22/05/2019
Field of study

We introduce a kind of partial observability to the projective simulation (PS) learning method. It is done by adding a belief projection operator and an observability parameter to the original framework of the efficiency of the PS model. I provide theoretical formulations, network representations, and situated scenarios derived from the invasion toy problem as a starting point for some multi-agent PS models.Comment: 28 pages, 21 figure

arXiv.org e-Print Archive