Search CORE

8 research outputs found

Random matrix theory and the loss surfaces of neural networks

Author: Baskerville Nick P
Publication venue
Publication date: 22/06/2023
Field of study

Appearence of Random Matrix Theory in Deep Learning

Author: Baskerville Nick P
Granziol Diego
Keating Jonathan P
Publication venue: 'Elsevier BV'
Publication date: 11/12/2021
Field of study

We investigate the local spectral statistics of the loss surface Hessians of artificial neural networks, where we discover excellent agreement with Gaussian Orthogonal Ensemble statistics across several network architectures and datasets. These results shed new light on the applicability of Random Matrix Theory to modelling neural networks and suggest a previously unrecognised role for it in the study of loss surfaces in deep learning. Inspired by these observations, we propose a novel model for the true loss surfaces of neural networks, consistent with our observations, which allows for Hessian spectral densities with rank degeneracy and outliers, extensively observed in practice, and predicts a growing independence of loss gradients as a function of distance in weight-space. We further investigate the importance of the true loss surface in neural networks and find, in contrast to previous work, that the exponential hardness of locating the global minimum has practical consequences for achieving state of the art performance.Comment: 33 pages, 14 figure

arXiv.org e-Print Archive

Oxford University Research Archive

Explore Bristol Research

Iterative Averaging in the Quest for Best Test Error

Author: Albane Samuel
Baskerville Nick P
Granziol Diego
Roberts Stephen
Wan Xingchen
Publication venue
Publication date: 31/10/2021
Field of study

We analyse and explain the increased generalisation performance of iterate averaging using a Gaussian process perturbation model between the true and batch risk surface on the high dimensional quadratic. We derive three phenomena \latestEdits{from our theoretical results:} (1) The importance of combining iterate averaging (IA) with large learning rates and regularisation for improved regularisation. (2) Justification for less frequent averaging. (3) That we expect adaptive gradient methods to work equally well, or better, with iterate averaging than their non-adaptive counterparts. Inspired by these results\latestEdits{, together with} empirical investigations of the importance of appropriate regularisation for the solution diversity of the iterates, we propose two adaptive algorithms with iterate averaging. These give significantly better results compared to stochastic gradient descent (SGD), require less tuning and do not require early stopping or validation set monitoring. We showcase the efficacy of our approach on the CIFAR-10/100, ImageNet and Penn Treebank datasets on a variety of modern and classical network architectures

arXiv.org e-Print Archive

Explore Bristol Research

Microprocessor mediates transcriptional termination of long noncoding RNA transcripts hosting microRNAs

Author: A El Hage
A Kozomara
A Wagschal
AE Almada
AG Rondón
Ashish Dhir
B Langmead
C Esau
Catherine L Jopling
CH Chien
CL Jopling
CR Mandel
D Baillat
D O'Reilly
E Mogilyansky
E Ntini
EJ Steinmetz
EM Prescott
G Ghazal
GM Sundaram
GS Pall
H Sperber
H Tilgner
IH Greger
IX Wang
J Bracht
J Chang
J Elmén
J Kawauchi
J Krol
J Ribas
JE Wilusz
JG Ruby
JT Arigo
K Skourti-Stathaki
L Vasiljeva
L Yang
M Ballarino
M Ha
M Kim
M Kim
M Morlando
M Xie
MJ Dye
MJ Dye
N Gromak
NG Kolev
Nick J Proudfoot
NJ Proudfoot
P Flicek
RC Friedman
S Baskerville
S Lamble
S West
S West
Somdutta Dhir
T Conrad
T Derrien
VC Auyeung
WJ Kent
X Cai
Y Lee
YK Kim
Z Zhang
Z Zhang
ZY Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

MicroRNA (miRNA) play a major role in the post-transcriptional regulation of gene expression. Mammalian miRNA biogenesis begins with co-transcriptional cleavage of RNA polymerase II (Pol II) transcripts by the Microprocessor complex. While most miRNA are located within introns of protein coding genes, a substantial minority of miRNA originate from long non coding (lnc) RNA where transcript processing is largely uncharacterized. Here, by detailed characterization of liver-specific lnc-pri-miR-122 and genome-wide analysis, we show that most lnc-pri-miRNA do not use the canonical cleavage and polyadenylation (CPA) pathway but instead use Microprocessor cleavage to terminate transcription. Microprocessor inactivation leads to extensive transcriptional readthrough of lnc-pri-miRNA and transcriptional interference with downstream genes. Consequently we define a novel RNase III-mediated, polyadenylation-independent mechanism of Pol II transcription termination in mammalian cells

Nottingham ePrints

Nottingham eTheses

Crossref

Repository@Nottingham

PubMed Central

Oxford University Research Archive

A Random Matrix Theory Approach to Damping in Deep Learning

Author: Baskerville Nick P
Granziol Diego
Publication venue
Publication date: 16/03/2022
Field of study

Explore Bristol Research

Mass media promotion of a smartphone smoking cessation app: modelled health and cost-saving impacts

Author: BM Iacoviello
C Clayforth
C Guerriero
Christine Cleghorn
DB Buller
E Atusingwize
FS Deen van der
GBD Risk Factor Collaborators
HK Ubhi
L Marsh
M Bruno
ML Ybarra
N Nghiem
N Wilson
NB Baskerville
NF BinDhim
Nhung Nghiem
Nick Wilson
P Bockerman
Quitline
R Patel
S Merry
T Blakely
T Blakely
Tony Blakely
William Leung
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Primary microRNA transcripts are processed co-transcriptionally.

microRNAs (miRNAs) are generated from long primary (pri-) RNA polymerase II (Pol II)-derived transcripts by two RNase III processing reactions: Drosha cleavage of nuclear pri-miRNAs and Dicer cleavage of cytoplasmic pre-miRNAs. Here we show that Drosha cleavage occurs during transcription acting on both independently transcribed and intron-encoded miRNAs. We also show that both 5'-3' and 3'-5' exonucleases associate with the sites where co-transcriptional Drosha cleavage occurs, promoting intron degradation before splicing. We finally demonstrate that miRNAs can also derive from 3' flanking transcripts of Pol II genes. Our results demonstrate that multiple miRNA-containing transcripts are co-transcriptionally cleaved during their synthesis and suggest that exonucleolytic degradation from Drosha cleavage sites in pre-mRNAs may influence the splicing and maturation of numerous mRNAs

Crossref

Oxford University Research Archive

Archivio della ricerca- Università di Roma La Sapienza

Primary microRNA transcripts are processed co-transcriptionally

Author: A Rodriguez
A Shiohama
AM Denli
D Haussecker
D Kampa
DS Schwarz
E Basyuk
E Lund
EC Forsberg
EJ Wagner
F Fazi
Francesca Pagano
GJ Hannon
H Zhou
HK Saini
Irene Bozzoni
J Han
J Kluiver
J Wuarin
JM Pawlicki
JM Thomson
K Glover-Cutter
K Ryman
KH Yeom
LH Qu
M Ballarino
M Danin-Kreiselman
M Kim
M Lagos-Quintana
M Lagos-Quintana
M Megraw
MA Valencia-Sanchez
Mariangela Morlando
MJ Dye
MJ Dye
MJ Dye
MJ Dye
Monica Ballarino
N Gromak
N Gromak
Natalia Gromak
Nick J Proudfoot
NJ Proudfoot
NR Smalheiser
P Richard
PK Yang
RI Gregory
S Baskerville
S Griffiths-Jones
S Guil
S West
S West
S West
T Hirose
T Kiss
TA Clark
V de Turris
VN Kim
WJ Kent
Y Lee
Y Lee
YK Kim
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref