Search CORE

168 research outputs found

Correlations between hidden units in multilayer neural networks and replica symmetry breaking

Author: A. Engel
A. Engel
A. Engel
A. Priel
B. Schottky
D. J. Amit
D. Malzahn
D. Malzahn
E. Barkai
E. Barkai
E. Gardner
E. Gardner
E. Gardner
G. Györgyi
M. Griniasty
M. Mézard
M. Opper
P. Majer
R. Monasson
W. H. Press
W. Whyte
Publication venue: 'American Physical Society (APS)'
Publication date: 14/06/1999
Field of study

We consider feed-forward neural networks with one hidden layer, tree architecture and a fixed hidden-to-output Boolean function. Focusing on the saturation limit of the storage problem the influence of replica symmetry breaking on the distribution of local fields at the hidden units is investigated. These field distributions determine the probability for finding a specific activation pattern of the hidden units as well as the corresponding correlation coefficients and therefore quantify the division of labor among the hidden units. We find that although modifying the storage capacity and the distribution of local fields markedly replica symmetry breaking has only a minor effect on the correlation coefficients. Detailed numerical results are provided for the PARITY, COMMITTEE and AND machines with K=3 hidden units and nonoverlapping receptive fields.Comment: 9 pages, 3 figures, RevTex, accepted for publication in Phys. Rev.

arXiv.org e-Print Archive

Crossref

Correlation of internal representations in feed-forward neural networks

Author: A Engel
Erichsen R
Gardner E
Kepler T B
Knuth D E
Majer P
Malzahn D
Mezard M
Schottky B
Wong K Y M
Publication venue: 'IOP Publishing'
Publication date: 01/01/1996
Field of study

Feed-forward multilayer neural networks implementing random input-output mappings develop characteristic correlations between the activity of their hidden nodes which are important for the understanding of the storage and generalization performance of the network. It is shown how these correlations can be calculated from the joint probability distribution of the aligning fields at the hidden units for arbitrary decoder function between hidden layer and output. Explicit results are given for the parity-, and-, and committee-machines with arbitrary number of hidden nodes near saturation.Comment: 6 pages, latex, 1 figur

arXiv.org e-Print Archive

CiteSeerX

Crossref

Statistical Mechanics of Learning: A Variational Approach for Real Data

Author: A. Engel
C. K. I. Williams
Dörthe Malzahn
E. Gardner
G. Wahba
H. Nishimori
M. Mézard
M. Mézard
Manfred Opper
R. Neal
R. P. Feynman
Publication venue: 'American Physical Society (APS)'
Publication date: 06/09/2002
Field of study

Using a variational technique, we generalize the statistical physics approach of learning from random examples to make it applicable to real data. We demonstrate the validity and relevance of our method by computing approximate estimators for generalization errors that are based on training data alone.Comment: 4 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Storage capacity of correlated perceptrons

Author: A. Engel
A. Engel
A. Engel
A. Priel
B. Schottky
D. Malzahn
D. Saad
E. Barkai
E. Barkai
E. Gardner
G. Cybenko
G. J. Mitchison
I. Kanter
J. A. Hertz
M Biehl
M. Bouten
M. Bouten
M. Copelli
M. Griniasty
M. Griniasty
M. Opper
R. Monasson
Publication venue: 'American Physical Society (APS)'
Publication date: 22/10/1996
Field of study

We consider an ensemble of

K

single-layer perceptrons exposed to random inputs and investigate the conditions under which the couplings of these perceptrons can be chosen such that prescribed correlations between the outputs occur. A general formalism is introduced using a multi-perceptron costfunction that allows to determine the maximal number of random inputs as a function of the desired values of the correlations. Replica-symmetric results for

K=2

and

K=3

are compared with properties of two-layer networks of tree-structure and fixed Boolean function between hidden units and output. The results show which correlations in the hidden layer of multi-layer neural networks are crucial for the value of the storage capacity.Comment: 16 pages, Latex2

arXiv.org e-Print Archive

Crossref

Statistical Mechanical Development of a Sparse Bayesian Classifier

Author: Alon U.
Fisher R. A.
Gallager R. G.
Kabashima Y.
Kabashima Y.
MacKay D. J. C.
Malzahn D.
Mezard M.
Mezard M.
Neal R. M.
Nishimori H.
Nishimori H.
Opper M.
Pearl J.
Smola S. J.
Tipping M. E.
Vapnik V. N.
Watkin T. H.
Publication venue: 'Japan Society of Applied Physics'
Publication date: 21/10/2005
Field of study

The demand for extracting rules from high dimensional real world data is increasing in various fields. However, the possible redundancy of such data sometimes makes it difficult to obtain a good generalization ability for novel samples. To resolve this problem, we provide a scheme that reduces the effective dimensions of data by pruning redundant components for bicategorical classification based on the Bayesian framework. First, the potential of the proposed method is confirmed in ideal situations using the replica method. Unfortunately, performing the scheme exactly is computationally difficult. So, we next develop a tractable approximation algorithm, which turns out to offer nearly optimal performance in ideal cases when the system size is large. Finally, the efficacy of the developed classifier is experimentally examined for a real world problem of colon cancer classification, which shows that the developed method can be practically useful.Comment: 13 pages, 6 figure

arXiv.org e-Print Archive

Crossref

Training a perceptron in a discrete weight space

Author: A. Buhot
B. Derrida
C. Van den Broeck
D. Barber
D. Malzahn
D. Saad
D. Saad
E. Gardner
F. Vallet
F. Vallet
H. Gutfreund
I. Kanter
Ido Kanter
J. Hertz
J. Schietse
M. Biehl
M. Biehl
M. Bouten
M. Bouten
M. Opper
M. Rosen-Zvi
M. Rosen-Zvi
Michal Rosen-Zvi
O. Kinouchi
P. Sollich
R. Meir
T. L. H. Watkin
W. Kinzel
W. Kinzel
Publication venue: 'American Physical Society (APS)'
Publication date: 27/02/2001
Field of study

On-line and batch learning of a perceptron in a discrete weight space, where each weight can take

2 L+1

different values, are examined analytically and numerically. The learning algorithm is based on the training of the continuous perceptron and prediction following the clipped weights. The learning is described by a new set of order parameters, composed of the overlaps between the teacher and the continuous/clipped students. Different scenarios are examined among them on-line learning with discrete/continuous transfer functions and off-line Hebb learning. The generalization error of the clipped weights decays asymptotically as

exp(-K \alpha^2)

exp(-e^{|\lambda| \alpha})

in the case of on-line learning with binary/continuous activation functions, respectively, where

\alpha

is the number of examples divided by N, the size of the input vector and

K

is a positive constant that decays linearly with 1/L. For finite

N

and

L

, a perfect agreement between the discrete student and the teacher is obtained for

\alpha \propto \sqrt{L \ln(NL)}

. A crossover to the generalization error

\propto 1/\alpha

, characterized continuous weights with binary output, is obtained for synaptic depth

L > O(\sqrt{N})

.Comment: 10 pages, 5 figs., submitted to PR

arXiv.org e-Print Archive

Crossref

Replica theory for learning curves for Gaussian processes on random graphs

Author: Chung F K
Erdős P
Font-Clos F Massucci F A Castillo I P
Kondor R Lafferty J Sammut C Hoffmann A G
Kühn R
Kühn R
M J Urry
Malzahn D
Min R Kuang R Bonner A Zhang Z Park H Parthasarathy S Liu H Obradovic Z
Mézard M
Opper M
Opper M
P Sollich
Rasmussen C E
Rogers T
Sollich P
Sollich P
Sollich P
Sollich P
Urry M J
Urry M J Sollich P
Publication venue: 'IOP Publishing'
Publication date: 26/10/2012
Field of study

Statistical physics approaches can be used to derive accurate predictions for the performance of inference methods learning from potentially noisy data, as quantified by the learning curve defined as the average error versus number of training examples. We analyse a challenging problem in the area of non-parametric inference where an effectively infinite number of parameters has to be learned, specifically Gaussian process regression. When the inputs are vertices on a random graph and the outputs noisy function values, we show that replica techniques can be used to obtain exact performance predictions in the limit of large graphs. The covariance of the Gaussian process prior is defined by a random walk kernel, the discrete analogue of squared exponential kernels on continuous spaces. Conventionally this kernel is normalised only globally, so that the prior variance can differ between vertices; as a more principled alternative we consider local normalisation, where the prior variance is uniform

arXiv.org e-Print Archive

Crossref

King's Research Portal

Evidence for Alternative Hypotheses

Author: B. Efron
B. J. Becker
E. Kulinskaya
M. Evans
P. J. Bickel
R. A. Fisher
R. Royall
S. G. Thompson
S. Kullback
S. Morgenthaler
T. A. Severini
U. Malzahn
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Most researchers want evidence for the direction of an effect, not evidence against a point null hypothesis. Such evidence is ideally on a scale that is easily in- terpretable, with an accompanying standard error. Further, the evidence from iden- tical experiments should be repeatable, and evidence from independent experiments should be easily combined, such as required in meta-analysis. Such a measure of evidence exists and has been shown to be closely related to the Kullback-Leibler symmetrized distance between null and alternative hypotheses for exponential fam- ilies. Here we provide more examples of the latter phenomenon, for distributions ly- ing outside the class of exponential families, including the non-central chi-squared family with unknown non-centrality parameter

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Thiolutin is a zinc chelator that inhibits the Rpn11 and other JAMM metalloproteases

Author: A Jimenez
A Joshi
A Nijnik
AC Diernfellner
Albert A Bowers
Amy E Palmer
Anton Shostak
Axel Diernfellner
B Langmead
B Li
B Sobhian
B Wang
C Dani
C Perez
DJ Tipper
DM Duda
E Malzahn
E Meulmeester
E Zeqiraj
EJ Worden
EM Cooper
F Cesbron
Frauke Melchior
GA Collins
GA Cope
GA Cope
GG Khachatourians
H Wang
Herbert Tschochner
Ibrahim Avi Cemel
J Bonnet
J Grigull
J Li
J McCullough
J Roza
JI Belle
Jing Li
JV Geisberg
K Minamiguchi
Kyle P Carter
L Davidson
Linda Lauinger
M Agromayor
M Görl
M Knop
M Vera
Michael Brunner
MS Dai
N Sivasubramanian
N Wei
Nati Ha
Nicolas Stankovic-Valentin
P D'Arcy
P Merkl
P Zhu
PG Richardson
Philipp E Merkl
R Cao
R Verma
Raymond J Deshaies
RJ Deshaies
S Dai
S Schulz
Simon Obermeyer
T Hideshima
T Schafmeier
Tobias Schafmeier
V Nandakumar
V Pelechano
W Zhou
Walter J Wever
XX Jiang
Y Jia
Y Sato
Yaru Zhang
Publication venue: Nature Publishing Group
Publication date: 01/01/2017
Field of study

Thiolutin is a disulfide-containing antibiotic and anti-angiogenic compound produced by Streptomyces. Its biological targets are not known. We show that reduced thiolutin is a zinc chelator that inhibits the JAB1/MPN/Mov34 (JAMM) domain–containing metalloprotease Rpn11, a deubiquitinating enzyme of the 19S proteasome. Thiolutin also inhibits the JAMM metalloproteases Csn5, the deneddylase of the COP9 signalosome; AMSH, which regulates ubiquitin-dependent sorting of cell-surface receptors; and BRCC36, a K63-specific deubiquitinase of the BRCC36-containing isopeptidase complex and the BRCA1–BRCA2-containing complex. We provide evidence that other dithiolopyrrolones also function as inhibitors of JAMM metalloproteases

University of Regensburg Publication Server

Crossref

Carolina Digital Repository

Caltech Authors

Climate change effects on phytoplankton depend on cell size and food web structure

Author: A Calbet
A Calbet
A Lehmann
A Lewandowska
A Lewandowska
AJ Richardson
AJ Richardson
AKE Wiklund
B Hansen
BACC Author Team
Barbara Bauer
BT Barton
C Lundsgaard
CS Reynolds
D Hoekman
D Vincent
DG Boyce
DJS Montagnes
E Gargas
E Litchman
E McCauley
E Saiz
E Zöllner
E Zöllner
EB Sherr
EB Sherr
F Sommer
G Ingrid
G Yvon-Durocher
GS Kleppel
H Hillebrand
H Horn
H Stibor
HH Jakobsen
HZ Baumert
IPCC
J Piontek
J Wohlers
JA Isla
JE Duffy
JH Ryther
JM Rose
JRWO Smith
KH Wiltshire
KH Wiltshire
M Daufresne
M Johansson
M Putt
M Winder
ME Bramm
MGJ Löder
MGJ Löder
MI O’Connor
MI O’Connor
MM Tilzer
N Aberle
N Wasmund
Nicole Aberle-Malzahn
O Vadstein
OS Beveridge
OS Beveridge
P Henriksen
PJ Hansen
R Ptacnik
R Sinistro
RD Tadonleke
S Menden-Deuer
SA Juliano
SJ Thackeray
T Neumann
Toni Klauschies
U Gaedke
U Sommer
U Sommer
U Sommer
U Sommer
U Sommer
U Sommer
U Sommer
U Tillmann
Ulrich Sommer
Ursula Gaedke
VA Guinder
X Irigoien
ZV Finkel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

We investigated the effects of warming on a natural phytoplankton community from the Baltic Sea, based on six mesocosm experiments conducted 2005–2009. We focused on differences in the dynamics of three phytoplankton size groups which are grazed to a variable extent by different zooplankton groups. While small-sized algae were mostly grazer-controlled, light and nutrient availability largely determined the growth of medium- and large-sized algae. Thus, the latter groups dominated at increased light levels. Warming increased mesozooplankton grazing on medium-sized algae, reducing their biomass. The biomass of small-sized algae was not affected by temperature, probably due to an interplay between indirect effects spreading through the food web. Thus, under the higher temperature and lower light levels anticipated for the next decades in the southern Baltic Sea, a higher share of smaller phytoplankton is expected. We conclude that considering the size structure of the phytoplankton community strongly improves the reliability of projections of climate change effects

OceanRep

Crossref

Electronic Publication Information Center