Search CORE

307 research outputs found

Generalization properties of finite size polynomial Support Vector Machines

Author: A. Buhot
C. Cortes
C. Marangi
H. Yoon
M. B. Gordon
M. Opper
M. Opper
M. Opper
Mirta B. Gordon
R. Dietrich
R. Monasson
S. Risau-Gusman
Sebastian Risau-Gusman
T. Cover
V. Vapnik
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2000
Field of study

The learning properties of finite size polynomial Support Vector Machines are analyzed in the case of realizable classification tasks. The normalization of the high order features acts as a squeezing factor, introducing a strong anisotropy in the patterns distribution in feature space. As a function of the training set size, the corresponding generalization error presents a crossover, more or less abrupt depending on the distribution's anisotropy and on the task to be learned, between a fast-decreasing and a slowly decreasing regime. This behaviour corresponds to the stepwise decrease found by Dietrich et al.[Phys. Rev. Lett. 82 (1999) 2975-2978] in the thermodynamic limit. The theoretical results are in excellent agreement with the numerical simulations.Comment: 12 pages, 7 figure

arXiv.org e-Print Archive

Crossref

HAL-CEA

Retarded Learning: Rigorous Results from Statistical Mechanics

Author: A. Buhot
B. S. Clarke
B. Schottky
C. Van den Broeck
D. Haussler
D. Haussler
D. Herschkowitz
Didier Herschkowitz
H. S. Seung
H. Schwarze
J. O. Berger
J. Rissanen
M. B. Gordon
M. Biehl
M. Copelli
M. Mezard
M. Opper
M. Opper
M. Opper
M. Opper
Manfred Opper
N. Brunel
O. Kinouchi
P. Reimann
R. P. Feynman
S. Amari
T. Cover
T. L. H. Watkin
Publication venue: 'American Physical Society (APS)'
Publication date: 13/03/2001
Field of study

We study learning of probability distributions characterized by an unknown symmetry direction. Based on an entropic performance measure and the variational method of statistical mechanics we develop exact upper and lower bounds on the scaled critical number of examples below which learning of the direction is impossible. The asymptotic tightness of the bounds suggests an asymptotically optimal method for learning nonsmooth distributions.Comment: 8 pages, 1 figur

arXiv.org e-Print Archive

Crossref

Statistical mechanics of random two-player games

Author: A. Crisanti
B. von Stengel
C.E. Lemke
E.P. Wigner
H. Keiding
J. Berg
J. Berg
J. Berg
J.A. Hertz
J.F. Nash
J.R.L. de Almeida
M. Mézard
M. Mézard
M. Opper
M. Opper
R. Monasson
S. Diederich
Publication venue: 'American Physical Society (APS)'
Publication date: 22/10/1999
Field of study

Using methods from the statistical mechanics of disordered systems we analyze the properties of bimatrix games with random payoffs in the limit where the number of pure strategies of each player tends to infinity. We analytically calculate quantities such as the number of equilibrium points, the expected payoff, and the fraction of strategies played with non-zero probability as a function of the correlation between the payoff matrices of both players and compare the results with numerical simulations.Comment: 16 pages, 6 figures, for further information see http://itp.nat.uni-magdeburg.de/~jberg/games.htm

arXiv.org e-Print Archive

Crossref

Inferring hidden states in Langevin dynamics on large networks: Average case performance

Author: A. Engel
A. S. Cofiño
A. Vakili
B. Bravi
B. Cseke
C. Archambeau
C. M. Bishop
D. F. Anderson
D. V. Voiculescu
E. Domany
M. H. A. Davis
M. L. Mehta
M. Opper
M. Opper
P. Erdős
P. Sollich
R. S. Tsay
R. Speicher
W. Kinzel
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2017
Field of study

We present average performance results for dynamical inference problems in large networks, where a set of nodes is hidden while the time trajectories of the others are observed. Examples of this scenario can occur in signal transduction and gene regulation networks. We focus on the linear stochastic dynamics of continuous variables interacting via random Gaussian couplings of generic symmetry. We analyze the inference error, given by the variance of the posterior distribution over hidden paths, in the thermodynamic limit and as a function of the system parameters and the ratio {\alpha} between the number of hidden and observed nodes. By applying Kalman filter recursions we find that the posterior dynamics is governed by an "effective" drift that incorporates the effect of the observations. We present two approaches for characterizing the posterior variance that allow us to tackle, respectively, equilibrium and nonequilibrium dynamics. The first appeals to Random Matrix Theory and reveals average spectral properties of the inference error and typical posterior relaxation times, the second is based on dynamical functionals and yields the inference error as the solution of an algebraic equation.Comment: 20 pages, 5 figure

arXiv.org e-Print Archive

Crossref

Spiral - Imperial College Digital Repository

King's Research Portal

Entropy and typical properties of Nash equilibria in two-player games

Author: Hertz J.
J Berg
Keiding H.
M Weigt
Mézard M.
Mézard M.
Opper M.
Stengel B. V.
Wang Jianhua
Young A. P.
Publication venue: 'IOP Publishing'
Publication date: 01/01/1999
Field of study

We use techniques from the statistical mechanics of disordered systems to analyse the properties of Nash equilibria of bimatrix games with large random payoff matrices. By means of an annealed bound, we calculate their number and analyse the properties of typical Nash equilibria, which are exponentially dominant in number. We find that a randomly chosen equilibrium realizes almost always equal payoffs to either player. This value and the fraction of strategies played at an equilibrium point are calculated as a function of the correlation between the two payoff matrices. The picture is complemented by the calculation of the properties of Nash equilibria in pure strategies.Comment: 6 pages, was "Self averaging of Nash equilibria in two player games", main section rewritten, some new results, for additional information see http://itp.nat.uni-magdeburg.de/~jberg/games.htm

arXiv.org e-Print Archive

CiteSeerX

Crossref

Generalizing with perceptrons in case of structured phase- and pattern-spaces

Author: Anlauf J K
B Schottky
Berg J
Biehl M
Biehl M
Bishop C M
Bruce A D
Buhmann J
Cortes C
Dirscherl G
Engel A
G Dirscherl
Gardner E
Hertz J
MacKay D J C
MacKay D J C
Monasson R
Opper M
Opper M
Pöppel G
Schottky B
Schottky B
Tarkowski W
U Krey
Watkin T L H
Publication venue: 'IOP Publishing'
Publication date: 01/01/1997
Field of study

We investigate the influence of different kinds of structure on the learning behaviour of a perceptron performing a classification task defined by a teacher rule. The underlying pattern distribution is permitted to have spatial correlations. The prior distribution for the teacher coupling vectors itself is assumed to be nonuniform. Thus classification tasks of quite different difficulty are included. As learning algorithms we discuss Hebbian learning, Gibbs learning, and Bayesian learning with different priors, using methods from statistics and the replica formalism. We find that the Hebb rule is quite sensitive to the structure of the actual learning problem, failing asymptotically in most cases. Contrarily, the behaviour of the more sophisticated methods of Gibbs and Bayes learning is influenced by the spatial correlations only in an intermediate regime of

\alpha

, where

\alpha

specifies the size of the training set. Concerning the Bayesian case we show, how enhanced prior knowledge improves the performance.Comment: LaTeX, 32 pages with eps-figs, accepted by J Phys

arXiv.org e-Print Archive

CiteSeerX

Crossref

An information theoretic approach to statistical dependence: copula information

Author: Amari S.-I.
Caticha A.
Dotsenko V.
Efron B.
Jaynes E. T.
Ma J. Sun Z.
Mari D. D.
Nelsen R. B.
Opper M.
R. S. Calsaverini
R. Vicente
Shental O.
Publication venue: 'IOP Publishing'
Publication date: 01/01/2009
Field of study

We discuss the connection between information and copula theories by showing that a copula can be employed to decompose the information content of a multivariate distribution into marginal and dependence components, with the latter quantified by the mutual information. We define the information excess as a measure of deviation from a maximum entropy distribution. The idea of marginal invariant dependence measures is also discussed and used to show that empirical linear correlation underestimates the amplitude of the actual correlation in the case of non-Gaussian marginals. The mutual information is shown to provide an upper bound for the asymptotic empirical log-likelihood of a copula. An analytical expression for the information excess of T-copulas is provided, allowing for simple model identification within this family. We illustrate the framework in a financial data set.Comment: to appear in Europhysics Letter

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

RCAAP - Repositório Científico de Acesso Aberto de Portugal

Repositório da Produção USP (Univ. de São Paulo)

Storage of correlated patterns in a perceptron

Author: B Lopez
Bork A
Cover T M
Engel A
Fontanari J F
Gardner E
Gardner E
M Opper
M Schroder
Monasson R
Schröder M
Winkel J
Publication venue: 'IOP Publishing'
Publication date: 01/01/1995
Field of study

We calculate the storage capacity of a perceptron for correlated gaussian patterns. We find that the storage capacity

\alpha_c

can be less than 2 if similar patterns are mapped onto different outputs and vice versa. As long as the patterns are in general position we obtain, in contrast to previous works, that

\alpha_c \geq 1

in agreement with Cover's theorem. Numerical simulations confirm the results.Comment: 9 pages LaTeX ioplppt style, figures included using eps

arXiv.org e-Print Archive

CiteSeerX

Crossref

Perceptron capacity revisited: classification ability for correlated patterns

Author: Braunstein A
de Almeida J R L
Derrida B
Dotsenko V S
Engel A
Gardner E
Györgyi G
Kabashima Y
Kabashima Y
Kabashima Y
Kinzel W Kanter I
Krauth W
Krauth W
Marinari E
Minsky M
Montanari A Prabhakar B Tse D
Mézard M
Müller R R Guo D Moustakas A L
Neirotti J P
Opper M
Opper M
Opper M
Parisi G
Pearl J
Plefka T
Takashi Shinzato
Takeda K
Takeda K
Tanaka T
Tulino A M
Verdú S
Voiculescu D V
Yoshiyuki Kabashima
Publication venue: 'IOP Publishing'
Publication date: 25/12/2007
Field of study

In this paper, we address the problem of how many randomly labeled patterns can be correctly classified by a single-layer perceptron when the patterns are correlated with each other. In order to solve this problem, two analytical schemes are developed based on the replica method and Thouless-Anderson-Palmer (TAP) approach by utilizing an integral formula concerning random rectangular matrices. The validity and relevance of the developed methodologies are shown for one known result and two example problems. A message-passing algorithm to perform the TAP scheme is also presented

arXiv.org e-Print Archive

Crossref

Phase transitions in soft-committee machines

Author: Biehl M.
Biehl M.
Chauvin Y.
E Schlösser
Hertz J. A.
M Ahr
M Biehl
Opper M.
Saad D. (Editor)
Schottky B.
Schottky B.
Schwarze H.
Schwarze H.
Urbanczik R.
Urbanczik R.
Vicente R.
Publication venue: 'IOP Publishing'
Publication date: 01/01/1998
Field of study

Equilibrium statistical physics is applied to layered neural networks with differentiable activation functions. A first analysis of off-line learning in soft-committee machines with a finite number (K) of hidden units learning a perfectly matching rule is performed. Our results are exact in the limit of high training temperatures. For K=2 we find a second order phase transition from unspecialized to specialized student configurations at a critical size P of the training set, whereas for K > 2 the transition is first order. Monte Carlo simulations indicate that our results are also valid for moderately low temperatures qualitatively. The limit K to infinity can be performed analytically, the transition occurs after presenting on the order of N K examples. However, an unspecialized metastable state persists up to P= O (N K^2).Comment: 8 pages, 4 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Proceedings - University of Groningen

EDP Sciences OAI-PMH repository (1.2.0)

University of Groningen

ARTS repository - University of Groningen

University of Groningen Digital Archive

Dissertations of the University of Groningen