Location of Repository

Limit Behavior of No-regret Dynamics

By Andriy Zapechelnyuk

Abstract

Consider a repeated game where all players follow no-regret strategies by reinforcing the actions that they regret not having played enough in the past. We show that a resulting no-regret dynamic approaches in the long run a best-response dynamic and leads to its invariant sets: rest points (Nash equilibria) or periodic orbits. The convergence results for best-response dynamics known in the literature immediately apply to no-regret dynamics. Thus, every no-regret dynamic leads to Nash equilibrium in zero-sum games, weighted potential and two-player ordinal potential games, supermodular games with diminishing returns, and some other special classes.Regret minimization, no-regret strategy, best-response dynamic, Nash equilibrium, Shapley polygon, curb set

OAI identifier:

Suggested articles

Preview

Citations

  1. (2001). A general class of adaptive procedures.
  2. (1999). A note on best response dynamics.
  3. (2000). A simple adaptive procedure leading to correlated equilibrium.
  4. (2003). A wide range no-regret theorem.
  5. (1991). Adaptive and sophisticated learning in normal form games.
  6. (1999). Adaptive game playing using multiplicative weights.
  7. (1951). An iterative method of solving a game.
  8. (2009). Approachability with bounded memory.
  9. (1957). Approximation to Bayes risk in repeated play. In
  10. (1998). Asymptotic calibration.
  11. (1992). Best response dynamics and socially stable strategies.
  12. (2003). Clever agents in adaptive learning.
  13. (1974). Conditional logit analysis of qualitative choice behavior.
  14. (2003). Continuous-time regret-based dynamics.
  15. (2003). Deterministic approximation of stochastic evolution in games.
  16. (1995). Evolutionary selection in normal-form games.
  17. (2000). Fictitious play in 23 games.
  18. (2005). Fictitious play in 2n games.
  19. (1996). Fictitious play property for games with identical interests.
  20. (1995). Fictitious play, Shapley polygons, and the replicator equation.
  21. (2008). games: The transition between periodic and chaotic behaviour.
  22. (1998). Individual Strategy and Social Structure.
  23. (1951). Iterative solutions of games by play. In
  24. (1995). Learning by forgetful players.
  25. (2009). Learning in games with unstable equilibria.
  26. (2005). Learning in perturbed asymmetric games.
  27. (1993). Learning mixed equilibria.
  28. (2002). On the global convergence of stochastic ctitious play.
  29. (1998). On the rate of convergence of continuous-time play.
  30. (1997). Population games. In
  31. (2003). Potential-based algorithms in on-line prediction and game theory.
  32. (2006). Prediction, Learning, and Games.
  33. (1999). Regret in the online decision problem.
  34. (1964). Some topics in two person games. In
  35. (2005). Stochastic approximations and dierential inclusions.
  36. (2006). Stochastic approximations and dierential inclusions. Part II: Applications.
  37. (2004). Strategic Learning and Its Limits.
  38. (1978). Strategically zero-sum games: the class of games whose completely mixed equilibria cannot be improved upon.
  39. (1991). Strategy subsets closed under rational behavior.
  40. (1993). The evolution of conventions.
  41. (1993). The statistical mechanics of strategic interaction.
  42. (1988). The Theory of Industrial Organization.
  43. (1994). The weighted majority algorithm.
  44. (2002). Two competing models of how people learn in games.
  45. (2007). Two more classes of games with the continuous-time play property.
  46. (1995). Universal consistency and cautious play.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.