345 research outputs found
Dynamic Non-Bayesian Decision Making
The model of a non-Bayesian agent who faces a repeated game with incomplete
information against Nature is an appropriate tool for modeling general
agent-environment interactions. In such a model the environment state
(controlled by Nature) may change arbitrarily, and the feedback/reward function
is initially unknown. The agent is not Bayesian, that is he does not form a
prior probability neither on the state selection strategy of Nature, nor on his
reward function. A policy for the agent is a function which assigns an action
to every history of observations and actions. Two basic feedback structures are
considered. In one of them -- the perfect monitoring case -- the agent is able
to observe the previous environment state as part of his feedback, while in the
other -- the imperfect monitoring case -- all that is available to the agent is
the reward obtained. Both of these settings refer to partially observable
processes, where the current environment state is unknown. Our main result
refers to the competitive ratio criterion in the perfect monitoring case. We
prove the existence of an efficient stochastic policy that ensures that the
competitive ratio is obtained at almost all stages with an arbitrarily high
probability, where efficiency is measured in terms of rate of convergence. It
is further shown that such an optimal policy does not exist in the imperfect
monitoring case. Moreover, it is proved that in the perfect monitoring case
there does not exist a deterministic policy that satisfies our long run
optimality criterion. In addition, we discuss the maxmin criterion and prove
that a deterministic efficient optimal strategy does exist in the imperfect
monitoring case under this criterion. Finally we show that our approach to
long-run optimality can be viewed as qualitative, which distinguishes it from
previous work in this area.Comment: See http://www.jair.org/ for any accompanying file
Smooth Inequalities and Equilibrium Inefficiency in Scheduling Games
We study coordination mechanisms for Scheduling Games (with unrelated
machines). In these games, each job represents a player, who needs to choose a
machine for its execution, and intends to complete earliest possible. Our goal
is to design scheduling policies that always admit a pure Nash equilibrium and
guarantee a small price of anarchy for the l_k-norm social cost --- the
objective balances overall quality of service and fairness. We consider
policies with different amount of knowledge about jobs: non-clairvoyant,
strongly-local and local. The analysis relies on the smooth argument together
with adequate inequalities, called smooth inequalities. With this unified
framework, we are able to prove the following results.
First, we study the inefficiency in l_k-norm social costs of a strongly-local
policy SPT and a non-clairvoyant policy EQUI. We show that the price of anarchy
of policy SPT is O(k). We also prove a lower bound of Omega(k/log k) for all
deterministic, non-preemptive, strongly-local and non-waiting policies
(non-waiting policies produce schedules without idle times). These results
ensure that SPT is close to optimal with respect to the class of l_k-norm
social costs. Moreover, we prove that the non-clairvoyant policy EQUI has price
of anarchy O(2^k).
Second, we consider the makespan (l_infty-norm) social cost by making
connection within the l_k-norm functions. We revisit some local policies and
provide simpler, unified proofs from the framework's point of view. With the
highlight of the approach, we derive a local policy Balance. This policy
guarantees a price of anarchy of O(log m), which makes it the currently best
known policy among the anonymous local policies that always admit a pure Nash
equilibrium.Comment: 25 pages, 1 figur
On the Approximation Performance of Fictitious Play in Finite Games
We study the performance of Fictitious Play, when used as a heuristic for
finding an approximate Nash equilibrium of a 2-player game. We exhibit a class
of 2-player games having payoffs in the range [0,1] that show that Fictitious
Play fails to find a solution having an additive approximation guarantee
significantly better than 1/2. Our construction shows that for n times n games,
in the worst case both players may perpetually have mixed strategies whose
payoffs fall short of the best response by an additive quantity 1/2 -
O(1/n^(1-delta)) for arbitrarily small delta. We also show an essentially
matching upper bound of 1/2 - O(1/n)
On the Structure of Equilibria in Basic Network Formation
We study network connection games where the nodes of a network perform edge
swaps in order to improve their communication costs. For the model proposed by
Alon et al. (2010), in which the selfish cost of a node is the sum of all
shortest path distances to the other nodes, we use the probabilistic method to
provide a new, structural characterization of equilibrium graphs. We show how
to use this characterization in order to prove upper bounds on the diameter of
equilibrium graphs in terms of the size of the largest -vicinity (defined as
the the set of vertices within distance from a vertex), for any
and in terms of the number of edges, thus settling positively a conjecture of
Alon et al. in the cases of graphs of large -vicinity size (including graphs
of large maximum degree) and of graphs which are dense enough.
Next, we present a new swap-based network creation game, in which selfish
costs depend on the immediate neighborhood of each node; in particular, the
profit of a node is defined as the sum of the degrees of its neighbors. We
prove that, in contrast to the previous model, this network creation game
admits an exact potential, and also that any equilibrium graph contains an
induced star. The existence of the potential function is exploited in order to
show that an equilibrium can be reached in expected polynomial time even in the
case where nodes can only acquire limited knowledge concerning non-neighboring
nodes.Comment: 11 pages, 4 figure
Metastability of Asymptotically Well-Behaved Potential Games
One of the main criticisms to game theory concerns the assumption of full
rationality. Logit dynamics is a decentralized algorithm in which a level of
irrationality (a.k.a. "noise") is introduced in players' behavior. In this
context, the solution concept of interest becomes the logit equilibrium, as
opposed to Nash equilibria. Logit equilibria are distributions over strategy
profiles that possess several nice properties, including existence and
uniqueness. However, there are games in which their computation may take time
exponential in the number of players. We therefore look at an approximate
version of logit equilibria, called metastable distributions, introduced by
Auletta et al. [SODA 2012]. These are distributions that remain stable (i.e.,
players do not go too far from it) for a super-polynomial number of steps
(rather than forever, as for logit equilibria). The hope is that these
distributions exist and can be reached quickly by logit dynamics.
We identify a class of potential games, called asymptotically well-behaved,
for which the behavior of the logit dynamics is not chaotic as the number of
players increases so to guarantee meaningful asymptotic results. We prove that
any such game admits distributions which are metastable no matter the level of
noise present in the system, and the starting profile of the dynamics. These
distributions can be quickly reached if the rationality level is not too big
when compared to the inverse of the maximum difference in potential. Our proofs
build on results which may be of independent interest, including some spectral
characterizations of the transition matrix defined by logit dynamics for
generic games and the relationship of several convergence measures for Markov
chains
Efficient Equilibria in Polymatrix Coordination Games
We consider polymatrix coordination games with individual preferences where
every player corresponds to a node in a graph who plays with each neighbor a
separate bimatrix game with non-negative symmetric payoffs. In this paper, we
study -approximate -equilibria of these games, i.e., outcomes where
no group of at most players can deviate such that each member increases his
payoff by at least a factor . We prove that for these
games have the finite coalitional improvement property (and thus
-approximate -equilibria exist), while for this
property does not hold. Further, we derive an almost tight bound of
on the price of anarchy, where is the number of
players; in particular, it scales from unbounded for pure Nash equilibria ( to for strong equilibria (). We also settle the complexity
of several problems related to the verification and existence of these
equilibria. Finally, we investigate natural means to reduce the inefficiency of
Nash equilibria. Most promisingly, we show that by fixing the strategies of
players the price of anarchy can be reduced to (and this bound is tight)
Budgeted personalized incentive approaches for smoothing congestion in resource networks
Abstract. Congestion occurs when there is competition for resources by selfish agents. In this paper, we are concerned with smoothing out congestion in a network of resources by using personalized well-timed incentives that are subject to budget constraints. To that end, we provide: (i) a mathematical formulation that computes equilibrium for the resource sharing congestion game with incentives and budget constraints; (ii) an integrated approach that scales to larger problems by exploiting the factored network structure and approximating the attained equilibrium; (iii) an iterative best response algorithm for solving the unconstrained version (no budget) of the resource sharing congestion game; and (iv) theoretical and empirical results (on an illustrative theme park problem) that demonstrate the usefulness of our approach.
A note on anti-coordination and social interactions
This note confirms a conjecture of [Bramoull\'{e}, Anti-coordination and
social interactions, Games and Economic Behavior, 58, 2007: 30-49]. The
problem, which we name the maximum independent cut problem, is a restricted
version of the MAX-CUT problem, requiring one side of the cut to be an
independent set. We show that the maximum independent cut problem does not
admit any polynomial time algorithm with approximation ratio better than
, where is the number of nodes, and arbitrarily
small, unless P=NP. For the rather special case where each node has a degree of
at most four, the problem is still MAXSNP-hard.Comment: 7 page
- …