Search CORE

5,042 research outputs found

Log-Distributional Approach for Learning Covariate Shift Ratios

Author: Bernárdez Gil Guillermo
Publication venue: Universitat Politècnica de Catalunya
Publication date: 01/01/2019
Field of study

Distributional Reinforcement Learning theory suggests that distributional fixed points could play a fundamental role to learning non additive value functions. In particular, we propose a distributional approach for learning Covariate Shift Ratios, whose update rule is originally multiplicative

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Statistical equilibrium in simple exchange games I

Author: Angle
Angle
Angle
Bach
Bennati
Bennati
Costantini
Costantini
Dragulescu
E. Scalas
Foley
Föllmer
Hill
S. Donadio
Silver
U. Garibaldi
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Simple stochastic exchange games are based on random allocation of finite resources. These games are Markov chains that can be studied either analytically or by Monte Carlo simulations. In particular, the equilibrium distribution can be derived either by direct diagonalization of the transition matrix, or using the detailed balance equation, or by Monte Carlo estimates. In this paper, these methods are introduced and applied to the Bennati-Dragulescu-Yakovenko (BDY) game. The exact analysis shows that the statistical-mechanical analogies used in the previous literature have to be revised.Comment: 11 pages, 3 figures, submitted to EPJ

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Research Papers in Economics

CERN Document Server

Archivio della ricerca- Università di Roma La Sapienza

Sussex Research Online

Archivio Istituzionale della Ricerca- Università del Piemonte Orientale

Winning quick and dirty: the greedy random walk

Author: Abramowitz M
Diaconis P
E Ben-Naim
Feller W
Fisher M E
Hill J M
Lifshitz I M
Redner S
Robinson D Vijay S
Rudnick J
S Redner
Weiss G H
Weiss G H
Publication venue: 'IOP Publishing'
Publication date: 01/01/2004
Field of study

As a strategy to complete games quickly, we investigate one-dimensional random walks where the step length increases deterministically upon each return to the origin. When the step length after the kth return equals k, the displacement of the walk x grows linearly in time. Asymptotically, the probability distribution of displacements is a purely exponentially decaying function of |x|/t. The probability E(t,L) for the walk to escape a bounded domain of size L at time t decays algebraically in the long time limit, E(t,L) ~ L/t^2. Consequently, the mean escape time ~ L ln L, while ~ L^{2n-1} for n>1. Corresponding results are derived when the step length after the kth return scales as k^alpha$ for alpha>0.Comment: 7 pages, 6 figures, 2-column revtext4 forma

arXiv.org e-Print Archive

CiteSeerX

Crossref

Cooperation in a resource extraction game

Author: Stähler Frank
Wagner Friedrich
Publication venue
Publication date
Field of study

An exhaustible stock of resources may be exploited by N players. An arbitrarily long duration of the game is only possible, if the utility function satisfies certain restrictions at small values R of extraction. We find that stability against unilateral defection occurs if the elasticity of the marginal utility turns out to be larger than (N - 1 )/N, however independent of the value of the discount factor. Hence we find that cooperation does not depend on the discount factor for a certain range of elasticities. Analogy to phase transitions in statistical physics is discussed.

Research Papers in Economics

Explicit lower and upper bounds on the entangled value of multiplayer XOR games

Author: A. Aspect
A. Aspect
B.S. Tsirelson
D. Pérez-García
D.L. Hanson
G. Pisier
J. Kempe
J.F. Clauser
J.S. Bell
M. Junge
M. Zukowski
N.D. Mermin
R. Latała
U. Haagerup
U. Haagerup
V.I. Paulsen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/03/2012
Field of study

XOR games are the simplest model in which the nonlocal properties of entanglement manifest themselves. When there are two players, it is well known that the bias --- the maximum advantage over random play --- of entangled players can be at most a constant times greater than that of classical players. Recently, P\'{e}rez-Garc\'{i}a et al. [Comm. Math. Phys. 279 (2), 2008] showed that no such bound holds when there are three or more players: the advantage of entangled players over classical players can become unbounded, and scale with the number of questions in the game. Their proof relies on non-trivial results from operator space theory, and gives a non-explicit existence proof, leading to a game with a very large number of questions and only a loose control over the local dimension of the players' shared entanglement. We give a new, simple and explicit (though still probabilistic) construction of a family of three-player XOR games which achieve a large quantum-classical gap (QC-gap). This QC-gap is exponentially larger than the one given by P\'{e}rez-Garc\'{i}a et. al. in terms of the size of the game, achieving a QC-gap of order

\sqrt{N}

with

N^2

questions per player. In terms of the dimension of the entangled state required, we achieve the same (optimal) QC-gap of

\sqrt{N}

for a state of local dimension

N

per player. Moreover, the optimal entangled strategy is very simple, involving observables defined by tensor products of the Pauli matrices. Additionally, we give the first upper bound on the maximal QC-gap in terms of the number of questions per player, showing that our construction is only quadratically off in that respect. Our results rely on probabilistic estimates on the norm of random matrices and higher-order tensors which may be of independent interest.Comment: Major improvements in presentation; results identica

arXiv.org e-Print Archive

DSpace@MIT

Crossref

CWI's Institutional Repository

A general methodology to price and hedge derivatives in incomplete markets

Author: Aurell E.
Baviera R.
Hammarlid O.
Serva M.
Vulpiani A.
Publication venue
Publication date: 09/04/1999
Field of study

We introduce and discuss a general criterion for the derivative pricing in the general situation of incomplete markets, we refer to it as the No Almost Sure Arbitrage Principle. This approach is based on the theory of optimal strategy in repeated multiplicative games originally introduced by Kelly. As particular cases we obtain the Cox-Ross-Rubinstein and Black-Scholes in the complete markets case and the Schweizer and Bouchaud-Sornette as a quadratic approximation of our prescription. Technical and numerical aspects for the practical option pricing, as large deviation theory approximation and Monte Carlo computation are discussed in detail.Comment: 24 pages, LaTeX, epsfig.sty, 5 eps figures, changes in the presentation of the method, submitted to International J. of Theoretical and Applied Financ

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Politecnico di Milano

Game theory, maximum entropy, minimum discrepancy and robust Bayesian decision theory

Author: Dawid A. Philip
Grunwald Peter D.
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2002
Field of study

We describe and develop a close relationship between two problems that have customarily been regarded as distinct: that of maximizing entropy, and that of minimizing worst-case expected loss. Using a formulation grounded in the equilibrium theory of zero-sum games between Decision Maker and Nature, these two problems are shown to be dual to each other, the solution to each providing that to the other. Although Tops\oe described this connection for the Shannon entropy over 20 years ago, it does not appear to be widely known even in that important special case. We here generalize this theory to apply to arbitrary decision problems and loss functions. We indicate how an appropriate generalized definition of entropy can be associated with such a problem, and we show that, subject to certain regularity conditions, the above-mentioned duality continues to apply in this extended context. This simultaneously provides a possible rationale for maximizing entropy and a tool for finding robust Bayes acts. We also describe the essential identity between the problem of maximizing entropy and that of minimizing a related discrepancy or divergence between distributions. This leads to an extension, to arbitrary discrepancies, of a well-known minimax theorem for the case of Kullback-Leibler divergence (the ``redundancy-capacity theorem'' of information theory). For the important case of families of distributions having certain mean values specified, we develop simple sufficient conditions and methods for identifying the desired solutions.Comment: Published by the Institute of Mathematical Statistics (http://www.imstat.org) in the Annals of Statistics (http://www.imstat.org/aos/) at http://dx.doi.org/10.1214/00905360400000055

arXiv.org e-Print Archive

CiteSeerX

Crossref

UCL Discovery

Growth Optimal Investment and Pricing of Derivatives

Author: Angelo Vulpiani
Avellaneda
Bachelier
Bernoulli
Black
Black
Black
Bouchaud
Bouchaud
Chance
Cox
Cox
Derman
Duffie
Dybvig
Efron
Erik Aurell
Follmer
Hakanson
Hammarlid
Harrison
Hull
Ingersoll
Kelly
Macbeth
Maurizio Serva
Merton
Merton
Ola Hammarlid
Press
Roberto Baviera
Rubinstein
Rubinstein
Samuelson
Samuelson
Schweizer
Schweizer
Schäl
Serva
Stutzer
Varadhan
Wolczyńska
Publication venue: 'Elsevier BV'
Publication date: 14/10/1999
Field of study

We introduce a criterion how to price derivatives in incomplete markets, based on the theory of growth optimal strategy in repeated multiplicative games. We present reasons why these growth-optimal strategies should be particularly relevant to the problem of pricing derivatives. We compare our result with other alternative pricing procedures in the literature, and discuss the limits of validity of the lognormal approximation. We also generalize the pricing method to a market with correlated stocks. The expected estimation error of the optimal investment fraction is derived in a closed form, and its validity is checked with a small-scale empirical test.Comment: 21 pages, 5 figure

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref