7 research outputs found

    Finding mixed-strategy equilibria of continuous-action games without gradients using randomized policy networks

    Full text link
    We study the problem of computing an approximate Nash equilibrium of continuous-action game without access to gradients. Such game access is common in reinforcement learning settings, where the environment is typically treated as a black box. To tackle this problem, we apply zeroth-order optimization techniques that combine smoothed gradient estimators with equilibrium-finding dynamics. We model players' strategies using artificial neural networks. In particular, we use randomized policy networks to model mixed strategies. These take noise in addition to an observation as input and can flexibly represent arbitrary observation-dependent, continuous-action distributions. Being able to model such mixed strategies is crucial for tackling continuous-action games that lack pure-strategy equilibria. We evaluate the performance of our method using an approximation of the Nash convergence metric from game theory, which measures how much players can benefit from unilaterally changing their strategy. We apply our method to continuous Colonel Blotto games, single-item and multi-item auctions, and a visibility game. The experiments show that our method can quickly find high-quality approximate equilibria. Furthermore, they show that the dimensionality of the input noise is crucial for performance. To our knowledge, this paper is the first to solve general continuous-action games with unrestricted mixed strategies and without any gradient information

    Focality and Asymmetry in Multi-battle Contests

    Get PDF
    This article examines behavior in two-person constant-sum Colonel Blotto games in which each player maximizes the expected total value of the battlefields won. A lottery contest success function is employed in each battlefield. Recent experimental research on such games provides only partial support for Nash equilibrium behavior. We hypothesize that the salience of battlefields affects strategic behavior (the salient target hypothesis). We present a controlled test of this hypothesis – against Nash predictions – when the sources of salience come from certain asymmetries in either battlefield values or labels (as in Schelling (1960)). In both cases, subjects over-allocate the resource to the salient battlefields relative to the Nash prediction. However, the effect is stronger with salient values. In the absence of salience, we replicate previous results in the literature supporting the Nash prediction

    Focality and Asymmetry in Multi-battle Contests

    Get PDF
    This article examines the influence of focality in Colonel Blotto games with a lottery contest success function (CSF), where the equilibrium is unique and in pure strategies. We hypothesise that the salience of battlefields affects strategic behaviour (the salient target hypothesis) and present a controlled test of this hypothesis against Nash predictions, checking the robustness of equilibrium play. When the sources of salience come from asymmetries in battlefield values or labels (as in Schelling, 1960), subjects over-allocate the resource to the salient battlefields relative to the Nash prediction. However, the effect is stronger with salient values. In the absence of salience, we find support for the Nash prediction

    Colonel Blotto games with a head start

    Full text link
    This paper studies Colonel Blotto games with two battlefields where one player has a head start in the form of additional troops on one of the battlefields. Such games arise naturally in marketing, electoral competition, and military conflict. Sion and Wolfe (1957) have shown that, if the strategy space is continuous, a mixed-strategy Nash equilibrium need not exist. Therefore, we consider a finite approximation. Using the iterated elimination of (weakly) dominated strategies, we identify an equilibrium for all parameter constellations and discuss its uniqueness properties. In equilibrium, resource decisions are typically not uniform but tend to concern units that roughly correspond in size to multiples of the head start. Moreover, competition takes the form of a hide-and-seek game, where the favorite tries to outguess the number of units that the underdog commits to the balanced battlefield. Somewhat unexpectedly, equilibrium payoffs of finite approximations of the Sion-Wolfe game accumulate around precisely three values. We also discuss the relation to the model with heterogeneous budgets but no head start

    How strength asymmetries shape multi-sided conflicts

    Get PDF
    Governments and multilateral organisations often attempt to influence multi-sided violent conflicts by supporting or undermining one of the conflicting parties. We investigate the (intended and unintended) consequences of strengthening or weakening an agent in a multi-sided conflict. Using a conflict network based on Franke and Öztürk (J Public Econ 126:104–113, 2015), we study how changing the strength of otherwise symmetric agents creates knock-on effects throughout the network. Increasing or decreasing an agent’s strength has the same unintended consequences. Changes in the strength of an agent induce a relocation of conflict investments: Distant conflicts are carried out more fiercely. In line with previous results, asymmetry reduces aggregate conflict investments. In the case of bipartite networks, with two conflicting tacit groups with aligned interests, agents in the group of the (now) strong or weak agent face more intense conflicts. Furthermore, in conflicts where the (now strong or weak) agent is not involved, the probabilities of winning remain unchanged compared to the symmetric case.</p

    Generalizations of the General Lotto and Colonel Blotto Games

    Get PDF
    In this paper, we generalize the General Lotto game (budget constraints satisfied in expectation) and the Colonel Blotto game (budget constraints hold with probability one) to allow for battlefield valuations that are heterogeneous across battlefields and asymmetric across players, and for the players to have asymmetric resource constraints. We completely characterize Nash equilibrium in the generalized version of the General Lotto game and then show how this characterization can be applied to identify equilibria in the Colonel Blotto version of the game. In both games, we find that there exist sets of non-pathological parameter configurations of positive Lebesgue measure with multiple payoff nonequivalent equilibria

    OR Forum—Blotto Politics

    No full text
    corecore