Search CORE

2,656 research outputs found

Lipschitz Continuity and Approximate Equilibria

Author: Deligkas Argyrios
Fearnley John
Spirakis Paul
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/10/2020
Field of study

Mixture Selection, Mechanism Design, and Signaling

Author: Cheng Yu
Cheung Ho Yee
Dughmi Shaddin
Emamjomeh-Zadeh Ehsan
Han Li
Teng Shang-Hua
Publication venue
Publication date: 14/08/2015
Field of study

We pose and study a fundamental algorithmic problem which we term mixture selection, arising as a building block in a number of game-theoretic applications: Given a function

g

from the

n

-dimensional hypercube to the bounded interval

[-1,1]

, and an

n \times m

matrix

A

with bounded entries, maximize

g(Ax)

over

x

in the

m

-dimensional simplex. This problem arises naturally when one seeks to design a lottery over items for sale in an auction, or craft the posterior beliefs for agents in a Bayesian game through the provision of information (a.k.a. signaling). We present an approximation algorithm for this problem when

g

simultaneously satisfies two smoothness properties: Lipschitz continuity with respect to the

L^\infty

norm, and noise stability. The latter notion, which we define and cater to our setting, controls the degree to which low-probability errors in the inputs of

g

can impact its output. When

g

is both

O(1)

-Lipschitz continuous and

O(1)

-stable, we obtain an (additive) PTAS for mixture selection. We also show that neither assumption suffices by itself for an additive PTAS, and both assumptions together do not suffice for an additive FPTAS. We apply our algorithm to different game-theoretic applications from mechanism design and optimal signaling. We make progress on a number of open problems suggested in prior work by easily reducing them to mixture selection: we resolve an important special case of the small-menu lottery design problem posed by Dughmi, Han, and Nisan; we resolve the problem of revenue-maximizing signaling in Bayesian second-price auctions posed by Emek et al. and Miltersen and Sheffet; we design a quasipolynomial-time approximation scheme for the optimal signaling problem in normal form games suggested by Dughmi; and we design an approximation algorithm for the optimal signaling problem in the voting model of Alonso and C\^{a}mara

arXiv.org e-Print Archive

Crossref

Learning Convex Partitions and Computing Game-theoretic Equilibria from Best Response Queries

Author: C Daskalakis
D Fudenberg
J Fearnley
J Nash
J Robinson
NH Bshouty
P Klemperer
PW Goldberg
PW Goldberg
PW Goldberg
S Hart
S Kakutani
X Chen
Y Babichenko
Y Babichenko
Publication venue
Publication date: 01/01/2018
Field of study

Suppose that an

m

-simplex is partitioned into

n

convex regions having disjoint interiors and distinct labels, and we may learn the label of any point by querying it. The learning objective is to know, for any point in the simplex, a label that occurs within some distance

\epsilon

from that point. We present two algorithms for this task: Constant-Dimension Generalised Binary Search (CD-GBS), which for constant

m

uses

poly(n, \log \left( \frac{1}{\epsilon} \right))

queries, and Constant-Region Generalised Binary Search (CR-GBS), which uses CD-GBS as a subroutine and for constant

n

uses

poly(m, \log \left( \frac{1}{\epsilon} \right))

queries. We show via Kakutani's fixed-point theorem that these algorithms provide bounds on the best-response query complexity of computing approximate well-supported equilibria of bimatrix games in which one of the players has a constant number of pure strategies. We also partially extend our results to games with multiple players, establishing further query complexity bounds for computing approximate well-supported equilibria in this setting.Comment: 38 pages, 7 figures, second version strengthens lower bound in Theorem 6, adds footnotes with additional comments and fixes typo

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Query Complexity of Approximate Nash Equilibria

Author: Babichenko Yakov
Publication venue
Publication date: 01/06/2014
Field of study

We study the query complexity of approximate notions of Nash equilibrium in games with a large number of players

n

. Our main result states that for

n

-player binary-action games and for constant

\varepsilon

, the query complexity of an

\varepsilon

-well-supported Nash equilibrium is exponential in

n

. One of the consequences of this result is an exponential lower bound on the rate of convergence of adaptive dynamics to approxiamte Nash equilibrium

arXiv.org e-Print Archive

CiteSeerX

Caltech Authors

Model and Reinforcement Learning for Markov Games with Risk Preferences

Author: Hai Pham Viet
Haskell William B.
Huang Wenjie
Publication venue
Publication date: 21/11/2019
Field of study

We motivate and propose a new model for non-cooperative Markov game which considers the interactions of risk-aware players. This model characterizes the time-consistent dynamic "risk" from both stochastic state transitions (inherent to the game) and randomized mixed strategies (due to all other players). An appropriate risk-aware equilibrium concept is proposed and the existence of such equilibria is demonstrated in stationary strategies by an application of Kakutani's fixed point theorem. We further propose a simulation-based Q-learning type algorithm for risk-aware equilibrium computation. This algorithm works with a special form of minimax risk measures which can naturally be written as saddle-point stochastic optimization problems, and covers many widely investigated risk measures. Finally, the almost sure convergence of this simulation-based algorithm to an equilibrium is demonstrated under some mild conditions. Our numerical experiments on a two player queuing game validate the properties of our model and algorithm, and demonstrate their worth and applicability in real life competitive decision-making.Comment: 38 pages, 6 tables, 5 figure

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications