Search CORE

13 research outputs found

Random matrix theory and the loss surfaces of neural networks

Author: Baskerville Nicholas P
Publication venue
Publication date: 03/06/2023
Field of study

Neural network models are one of the most successful approaches to machine learning, enjoying an enormous amount of development and research over recent years and finding concrete real-world applications in almost any conceivable area of science, engineering and modern life in general. The theoretical understanding of neural networks trails significantly behind their practical success and the engineering heuristics that have grown up around them. Random matrix theory provides a rich framework of tools with which aspects of neural network phenomenology can be explored theoretically. In this thesis, we establish significant extensions of prior work using random matrix theory to understand and describe the loss surfaces of large neural networks, particularly generalising to different architectures. Informed by the historical applications of random matrix theory in physics and elsewhere, we establish the presence of local random matrix universality in real neural networks and then utilise this as a modeling assumption to derive powerful and novel results about the Hessians of neural network loss surfaces and their spectra. In addition to these major contributions, we make use of random matrix models for neural network loss surfaces to shed light on modern neural network training approaches and even to derive a novel and effective variant of a popular optimisation algorithm. Overall, this thesis provides important contributions to cement the place of random matrix theory in the theoretical study of modern neural networks, reveals some of the limits of existing approaches and begins the study of an entirely new role for random matrix theory in the theory of deep learning with important experimental discoveries and novel theoretical results based on local random matrix universality.Comment: 320 pages, PhD thesi

arXiv.org e-Print Archive

A spin-glass model for the loss surfaces of generative adversarial networks

Author: Baskerville Nicholas P
Keating Jonathan P
Mezzadri Francesco
Najnudel Joseph
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/01/2021
Field of study

We present a novel mathematical model that seeks to capture the key design feature of generative adversarial networks (GANs). Our model consists of two interacting spin glasses, and we conduct an extensive theoretical analysis of the complexity of the model's critical points using techniques from Random Matrix Theory. The result is insights into the loss surfaces of large GANs that build upon prior insights for simpler networks, but also reveal new structure unique to this setting.Comment: 26 pages, 9 figure

arXiv.org e-Print Archive

PubMed Central

Oxford University Research Archive

Explore Bristol Research

Universal characteristics of deep neural network loss surfaces from random matrix theory

Author: Baskerville Nicholas P
Granziol Diego
Keating Jonathan P
Mezzadri Francesco
Najnudel Joseph
Publication venue: 'IOP Publishing'
Publication date: 20/06/2022
Field of study

This paper considers several aspects of random matrix universality in deep neural networks. Motivated by recent experimental work, we use universal properties of random matrices related to local statistics to derive practical implications for deep neural networks based on a realistic model of their Hessians. In particular we derive universal aspects of outliers in the spectra of deep neural networks and demonstrate the important role of random matrix local laws in popular pre-conditioning gradient descent algorithms. We also present insights into deep neural network loss surfaces from quite general arguments based on tools from statistical physics and random matrix theory.Comment: 42 page

arXiv.org e-Print Archive

Appearance of Random Matrix Theory in Deep Learning

Author: Baskerville Nicholas P
Granziol Diego
Keating Jonathan P
Publication venue
Publication date: 01/01/2021
Field of study

We investigate the local spectral statistics of the loss surface Hessians of artificial neural networks, where we discover excellent agreement with Gaussian Orthogonal Ensemble statistics across several network architectures and datasets. These results shed new light on the applicability of Random Matrix Theory to modelling neural networks and suggest a previously unrecognised role for it in the study of loss surfaces in deep learning. Inspired by these observations, we propose a novel model for the true loss surfaces of neural networks, consistent with our observations, which allows for Hessian spectral densities with rank degeneracy and outliers, extensively observed in practice, and predicts a growing independence of loss gradients as a function of distance in weight-space. We further investigate the importance of the true loss surface in neural networks and find, in contrast to previous work, that the exponential hardness of locating the global minimum has practical consequences for achieving state of the art performance.Comment: 33 pages, 14 figure

arXiv.org e-Print Archive

Oxford University Research Archive

Type, typography and the typographer

This chapter considers the changing role of typography and the evolving practices of the typographer across six centuries. It charts the effect of technology, trade, and training on the profession as typographic responsibility passed from printer to compositor, to designer, and finally to Everyman. This chapter also considers the changing visual appearance of typographic books and their journey to free themselves from the conventions of the manuscript book and their influence on the e‐book

Crossref

Birmingham City University Open Access Repository

BCU Open Access

What drives adoption of a computerised, multifaceted quality improvement intervention for cardiovascular disease management in primary healthcare settings? A mixed methods analysis using normalisation process theory

Author: A Moxey
A Oakley
AM Audet
Anushka Patel
AS Gosling
AS McAlearney
AX Garg
B Patel
Bindu Patel
C Davy
C May
C May
C May
C O’Grady
CL Cooper
CR May
CR May
D Peiris
D Peiris
David Peiris
E Banks
E Kealey
E Murray
E Murray
EC Schneider
EL Heeley
FS Mair
GF Moore
GJ Parry
H McMullen
Institute of Medicine
J Proudfoot
JS Ash
JS Ash
K Coleman
Kathryn Panaretto
L Moja
M Booth
M Lugtenberg
Mark Harris
MJ Taylor
N Ivers
NB Baskerville
Nicholas Zwar
P Baxter
P Craig
P Nilsen
PS Roshanov
R Nieuwlaat
RJ Webster
S Mickan
S Reeves
SM Campbell
T Bodenheimer
T Ingebrigtsen
T Linda
Tim Usherwood
TJ Bright
WJ Riley
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Rethinking penal modernism from the global South: The case of convict transportation to Australia

Author: A Atkinson
A Ekirch
A Stoler
A Suranyi
AGL Shaw
Australian Government
B Baskerville
B Godfrey
B Godfrey
B Kercher
B Kercher
B Smith
B Webb
C Anderson
C Anderson
C Anderson
C Anderson
C Anderson
C Bayly
C De Vito
C Pybus
D Melossi
D Rothman
F Wines
Female Convicts Research Centre
G Ives
H Maxwell-Stewart
H Maxwell-Stewart
H Maxwell-Stewart
H Maxwell-Stewart
J Barry
J Beattie
J Braithwaite
J Brewer
J Clay
J Kavanagh
J Pratt
J Simon
J Willis
J Willis
K Harman
L Frost
L Frost
L Radzinowicz
L Radzinowicz
M Alexander
M Brown
M Feeley
M Finnane
M Foucault
M Gottschalk
M Ignatieff
P MacFie
R Connell
R Hogg
R Hogg
R Hughes
R Ward
S McConville
S Nicholas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/01/2018
Field of study

Criminological accounts of penal modernization have generally overlooked the experience of convict transportation to, and in, the global South, an effect of the general tendency of metropolitan theory to embed particular experiences and perspectives, and present them as universal. In consequence, the implications for our understanding of crime and punishment of this momentous penal project, spanning more than 80 years in the case of Australia, have received limited attention. The article reflects on this lacunae in contemporary penal thought and considers some of the historical, conceptual and policy lessons that might be drawn from an effort to incorporate convict transportation into an account of modern penal development

Crossref

Queensland University of Technology ePrints Archive

Limits to Causal Inference with State-Space Reconstruction for Infectious Disease

Author: A Hastings
AA Tsonis
AJ McKane
AT Clark
BD Dalziel
BF Finkenstädt
C Nichkawde
CJE Metcalf
CWJ Granger
D A Rand
D Alonso
D He
D Tilman
DJ Earn
DS Simberloff
Edward B. Baskerville
ER Deyle
ER Deyle
F Takens
G Sugihara
G Sugihara
G Sugihara
H Ye
H Ye
HTH Nguyen
J Schumacher
J Shaman
JM Mooij
JR Gog
JS Lavine
K Laneri
KP Burnham
L Cao
L Kocarev
LC Uzal
LM Pecora
M Casdagli
M Small
MB Kennel
MD de Cellès
MJ Keeling
MJ Mina
N Simon
Nicholas G. Reich
O Stegle
P Rohani
P Rohani
P Turchin
P Yodzis
PE Fine
PT Stephen Ellner
R Durrett
R Hilborn
RK Plowright
RM Anderson
RMN W S C Gurney
RMRMM Anderson
RP Boland
S Altizer
S Cobey
S Tajima
Sarah Cobey
SH Hurlbert
WG van Panhuis
WO Kermack
YH Grad
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref

The Involvement of Oxytocin in the Subthalamic Nucleus on Relapse to Methamphetamine-Seeking Behaviour

Author: A Field
A Lintas
A Parent
AG Phillips
AN Westover
BM Cox
C Baunez
C Hicks
CP Motbey
CW Meredith
D Harris
DG Shirley
DS Carson
DS Carson
E Tribollet
FE Pontieri
G Paxinos
GF Koob
GL Kovacs
GL Kovacs
HK Caldwell
IS McGregor
IS McGregor
IS McGregor
J Qi
J Qi
J Yang
Jennifer Louise Cornish
K Uvnas-Moberg
K-Z Shen
K-Z Shen
KR Dyer
L Ramos
M Manning
MJ Freund-Mercier
ND Volkow
Nicholas Adams Everett
P Zanos
RAH Adan
Rita Fuchs
RJ Figueira
S Ciketic
S Lardeux
S Lardeux
S Lardeux
S-S Cui
Sarah Jane Baracz
SD Turnipseed
SJ Baracz
SJ Baracz
SJ Baracz
T Rouand
TA Baskerville
TE Robinson
TJ Ornstein
UD McCann
W Gong
WY Chan
Y Chudasama
Y Darbaky
Y Smith
Z Sarnyai
Z Song
ZD You
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref