4,675 research outputs found
Statics and dynamics of selfish interactions in distributed service systems
We study a class of games which model the competition among agents to access
some service provided by distributed service units and which exhibit congestion
and frustration phenomena when service units have limited capacity. We propose
a technique, based on the cavity method of statistical physics, to characterize
the full spectrum of Nash equilibria of the game. The analysis reveals a large
variety of equilibria, with very different statistical properties. Natural
selfish dynamics, such as best-response, usually tend to large-utility
equilibria, even though those of smaller utility are exponentially more
numerous. Interestingly, the latter actually can be reached by selecting the
initial conditions of the best-response dynamics close to the saturation limit
of the service unit capacities. We also study a more realistic stochastic
variant of the game by means of a simple and effective approximation of the
average over the random parameters, showing that the properties of the
average-case Nash equilibria are qualitatively similar to the deterministic
ones.Comment: 30 pages, 10 figure
Deep Reinforcement Learning from Self-Play in Imperfect-Information Games
Many real-world applications can be described as large-scale games of
imperfect information. To deal with these challenging domains, prior work has
focused on computing Nash equilibria in a handcrafted abstraction of the
domain. In this paper we introduce the first scalable end-to-end approach to
learning approximate Nash equilibria without prior domain knowledge. Our method
combines fictitious self-play with deep reinforcement learning. When applied to
Leduc poker, Neural Fictitious Self-Play (NFSP) approached a Nash equilibrium,
whereas common reinforcement learning methods diverged. In Limit Texas Holdem,
a poker game of real-world scale, NFSP learnt a strategy that approached the
performance of state-of-the-art, superhuman algorithms based on significant
domain expertise.Comment: updated version, incorporating conference feedbac
A stochastic approximation algorithm for stochastic semidefinite programming
Motivated by applications to multi-antenna wireless networks, we propose a
distributed and asynchronous algorithm for stochastic semidefinite programming.
This algorithm is a stochastic approximation of a continous- time matrix
exponential scheme regularized by the addition of an entropy-like term to the
problem's objective function. We show that the resulting algorithm converges
almost surely to an -approximation of the optimal solution
requiring only an unbiased estimate of the gradient of the problem's stochastic
objective. When applied to throughput maximization in wireless multiple-input
and multiple-output (MIMO) systems, the proposed algorithm retains its
convergence properties under a wide array of mobility impediments such as user
update asynchronicities, random delays and/or ergodically changing channels.
Our theoretical analysis is complemented by extensive numerical simulations
which illustrate the robustness and scalability of the proposed method in
realistic network conditions.Comment: 25 pages, 4 figure
Stochastic Optimization of Service Provision with Selfish Users
We develop a computationally efficient technique to solve a fairly general
distributed service provision problem with selfish users and imperfect
information. In particular, in a context in which the service capacity of the
existing infrastructure can be partially adapted to the user load by activating
just some of the service units, we aim at finding the configuration of active
service units that achieves the best trade-off between maintenance (e.g.\
energetic) costs for the provider and user satisfaction. The core of our
technique resides in the implementation of a belief-propagation (BP) algorithm
to evaluate the cost configurations. Numerical results confirm the
effectiveness of our approach.Comment: paper presented at NETSTAT Workshop, Budapest - June 201
Uniform exponential convergence of sample average random functions under general sampling with applications in stochastic programming
AbstractSample average approximation (SAA) is one of the most popular methods for solving stochastic optimization and equilibrium problems. Research on SAA has been mostly focused on the case when sampling is independent and identically distributed (iid) with exceptions (Dai et al. (2000) [9], Homem-de-Mello (2008) [16]). In this paper we study SAA with general sampling (including iid sampling and non-iid sampling) for solving nonsmooth stochastic optimization problems, stochastic Nash equilibrium problems and stochastic generalized equations. To this end, we first derive the uniform exponential convergence of the sample average of a class of lower semicontinuous random functions and then apply it to a nonsmooth stochastic minimization problem. Exponential convergence of estimators of both optimal solutions and M-stationary points (characterized by Mordukhovich limiting subgradients (Mordukhovich (2006) [23], Rockafellar and Wets (1998) [32])) are established under mild conditions. We also use the unform convergence result to establish the exponential rate of convergence of statistical estimators of a stochastic Nash equilibrium problem and estimators of the solutions to a stochastic generalized equation problem
- …