Search CORE

1,642 research outputs found

$L_0$ -ARM: Network Sparsification via Stochastic Binary Optimization

Author: D Silver
RJ Williams
Y Lecun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/09/2019
Field of study

We consider network sparsification as an

L_0

-norm regularized binary optimization problem, where each unit of a neural network (e.g., weight, neuron, or channel, etc.) is attached with a stochastic binary gate, whose parameters are jointly optimized with original network parameters. The Augment-Reinforce-Merge (ARM), a recently proposed unbiased gradient estimator, is investigated for this binary optimization problem. Compared to the hard concrete gradient estimator from Louizos et al., ARM demonstrates superior performance of pruning network architectures while retaining almost the same accuracies of baseline methods. Similar to the hard concrete estimator, ARM also enables conditional computation during model training but with improved effectiveness due to the exact binary stochasticity. Thanks to the flexibility of ARM, many smooth or non-smooth parametric functions, such as scaled sigmoid or hard sigmoid, can be used to parameterize this binary optimization problem and the unbiasness of the ARM estimator is retained, while the hard concrete estimator has to rely on the hard sigmoid function to achieve conditional computation and thus accelerated training. Extensive experiments on multiple public datasets demonstrate state-of-the-art pruning rates with almost the same accuracies of baseline methods. The resulting algorithm

L_0

-ARM sparsifies the Wide-ResNet models on CIFAR-10 and CIFAR-100 while the hard concrete estimator cannot. The code is public available at https://github.com/leo-yangli/l0-arm.Comment: Published as a conference paper at ECML 201

arXiv.org e-Print Archive

Crossref

Sample-Efficient Model-Free Reinforcement Learning with Off-Policy Critics

Author: AG Barto
C Watkins
D Silver
LJ Lin
R Bellman
RJ Williams
VR Konda
WR Thompson
Publication venue
Publication date: 12/06/2019
Field of study

Value-based reinforcement-learning algorithms provide state-of-the-art results in model-free discrete-action settings, and tend to outperform actor-critic algorithms. We argue that actor-critic algorithms are limited by their need for an on-policy critic. We propose Bootstrapped Dual Policy Iteration (BDPI), a novel model-free reinforcement-learning algorithm for continuous states and discrete actions, with an actor and several off-policy critics. Off-policy critics are compatible with experience replay, ensuring high sample-efficiency, without the need for off-policy corrections. The actor, by slowly imitating the average greedy policy of the critics, leads to high-quality and state-specific exploration, which we compare to Thompson sampling. Because the actor and critics are fully decoupled, BDPI is remarkably stable, and unusually robust to its hyper-parameters. BDPI is significantly more sample-efficient than Bootstrapped DQN, PPO, and ACKTR, on discrete, continuous and pixel-based tasks. Source code: https://github.com/vub-ai-lab/bdpi.Comment: Accepted at the European Conference on Machine Learning 2019 (ECML

arXiv.org e-Print Archive

VU Research Portal

Crossref

Visual Rationalizations in Deep Reinforcement Learning for Atari Games

Author: D Silver
LA Hendricks
RJ Williams
RJ Williams
RL Teach
V Mnih
Publication venue
Publication date: 01/01/2018
Field of study

Due to the capability of deep learning to perform well in high dimensional problems, deep reinforcement learning agents perform well in challenging tasks such as Atari 2600 games. However, clearly explaining why a certain action is taken by the agent can be as important as the decision itself. Deep reinforcement learning models, as other deep learning models, tend to be opaque in their decision-making process. In this work, we propose to make deep reinforcement learning more transparent by visualizing the evidence on which the agent bases its decision. In this work, we emphasize the importance of producing a justification for an observed action, which could be applied to a black-box decision agent.Comment: presented as oral talk at BNAIC 201

arXiv.org e-Print Archive

Crossref

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Zero inventory and firm performance: a management paradigm revisited

Author: Andreas Donhauser
Balakrishnan R
Blinder AS
Hall RW
Hayes RH
Hirsch AA
Hopp WJ
Monden Y
Monden Y
Nakane J
Robert Obermaier
Schonberger RJ
Silver EA
Womack JP
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

An Application of Molecular Genotyping in Mice

Author: A Gropp
AG Searle
AG Searle
AG Searle
AG Searle
BM Cattanach
BM Cattanach
CA Redi
CM Williamson
E Capanna
H Hamada
L Silver
LAM Underkoffler
P Liang
RJ Oakey
T Maniatis
W Dietrich
Publication venue: Biological Procedures Online
Publication date: 01/01/2003
Field of study

Microsatellite markers are simple sequence repeats within the mammalian genome that can be used for identifying disease loci, mapping genes of interest as well as studying segregation patterns related to meiotic nondisjunction. Different strains of mice have variable CA repeat lengths and PCR based methods can be used to identify them, thus allowing for specific genotypes to be assigned. Molecular genotyping offers such identification at any developmental stage, which allows for a broad range of anomalies to be studied. We studied chromosomal segregation in relation to nondisjunction in early-gestation mouse embryos using molecular genotyping. Information on the parental origin as well as the number of chromosomes a given progeny carried was obtained in our analysis

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

King's Research Portal

Taking gradients through experiments: LSTMs and memory proximal policy optimization for black-box quantum control

Author: AM Souza
D Silver
G Quiroz
H Robbins
J Biamonte
JJ Sakurai
L Viola
M August
N Khaneja
P Doria
P Palittapongarnpim
PB Wigley
RJ Williams
S Hochreiter
T Caneva
U Schollwöck
V Mnih
Publication venue: ISC High Performance 2018: High Performance Computing
Publication date: 01/01/2018
Field of study

In this work we introduce the application of black-box quantum control as an interesting rein- forcement learning problem to the machine learning community. We analyze the structure of the reinforcement learning problems arising in quantum physics and argue that agents parameterized by long short-term memory (LSTM) networks trained via stochastic policy gradients yield a general method to solving them. In this context we introduce a variant of the proximal policy optimization (PPO) algorithm called the memory proximal policy optimization (MPPO) which is based on this analysis. We then show how it can be applied to specific learning tasks and present results of nu- merical experiments showing that our method achieves state-of-the-art results for several learning tasks in quantum control with discrete and continouous control parameters

arXiv.org e-Print Archive

Crossref

Apollo (Cambridge)

Importance sampling in reinforcement learning with an estimated behavior policy

Author: A Dvoretzky
B Delyon
C Gelada
CD Manning
CJ Oates
D Silver
E Greensmith
K Hirano
M Henmi
ML Puterman
PC Austin
PR Rosenbaum
Q Liu
R Bellman
RJ Williams
RS Sutton
RY Rubinstein
S Mahadevan
SP Singh
V Mnih
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2021
Field of study

Crossref

Edinburgh Research Explorer

Dysregulation of DGCR6 and DGCR6L: psychopathological outcomes in chromosome 22q11.2 deletion syndrome

Author: A Petronis
A Swillen
A Swillen
AS Bassett
B Crespi
B Glaser
B Tycko
BA Cornblatt
BP Hierck
C Feinstein
CE Bearden
D Dolinoy
D Zunner
DA Driscoll
F da Silva Alves
GJ Chelune
H Liu
H Silver
J Walter
K Lewandowski
K Livak
KC Murphy
L Edelmann
L Erlenmeyer-Kimling
M Dernovsek
M Ehrich
M Gerdes
M Karayiorgou
M Woodin
MW Otto
PP Luedi
PP Luedi
R Das
R Hartmer
RJ Shprintzen
RJ Shprintzen
RJ Shprintzen
RJ Shprintzen
RS Kern
S Cassidy
S Choufani
S Eliez
TM Maynard
TM Maynard
V Shashi
V Shashi
VK Rakyan
Publication venue: Nature Publishing Group
Publication date: 01/01/2012
Field of study

Chromosome 22q11.2 deletion syndrome (22q11DS) is the most common microdeletion syndrome in humans. It is typified by highly variable symptoms, which might be explained by epigenetic regulation of genes in the interval. Using computational algorithms, our laboratory previously predicted that DiGeorge critical region 6 (DGCR6), which lies within the deletion interval, is imprinted in humans. Expression and epigenetic regulation of this gene have not, however, been examined in 22q11DS subjects. The purpose of this study was to determine if the expression levels of DGCR6 and its duplicate copy DGCR6L in 22q11DS subjects are associated with the parent-of-origin of the deletion and childhood psychopathologies. Our investigation showed no evidence of parent-of-origin-related differences in expression of both DGCR6 and DGCR6L. However, we found that the variability in DGCR6 expression was significantly greater in 22q11DS children than in age and gender-matched control individuals. Children with 22q11DS who had anxiety disorders had significantly lower DGCR6 expression, especially in subjects with the deletion on the maternal chromosome, despite the lack of imprinting. Our findings indicate that epigenetic mechanisms other than imprinting contribute to the dysregulation of these genes and the associated childhood psychopathologies observed in individuals with 22q11DS. Further studies are now needed to test the usefulness of DGCR6 and DGCR6L expression and alterations in the epigenome at these loci in predicting childhood anxiety and associated adult-onset pathologies in 22q11DS subjects

Crossref

Harvard University - DASH

PubMed Central

Carolina Digital Repository

Quality and Safety Aspects of Infant Nutrition

Quality and safety aspects of infant nutrition are of key importance for child health, but oftentimes they do not get much attention by health care professionals whose interest tends to focus on functional benefits of early nutrition. Unbalanced diets and harmful food components induce particularly high risks for untoward effects in infants because of their rapid growth, high nutrient needs, and their typical dependence on only one or few foods during the first months of life. The concepts, standards and practices that relate to infant food quality and safety were discussed at a scientific workshop organized by the Child Health Foundation and the Early Nutrition Academy jointly with the European Society for Paediatric Gastroenterology, Hepatology and Nutrition, and a summary is provided here. The participants reviewed past and current issues on quality and safety, the role of different stakeholders, and recommendations to avert future issues. It was concluded that a high level of quality and safety is currently achieved, but this is no reason for complacency. The food industry carries the primary responsibility for the safety and suitability of their products, including the quality of composition, raw materials and production processes. Introduction of new or modified products should be preceded by a thorough science based review of suitability and safety by an independent authority. Food safety events should be managed on an international basis. Global collaboration of food producers, food-safety authorities, paediatricians and scientists is needed to efficiently exchange information and to best protect public health. Copyright (C) 2012 S. Karger AG, Base

Crossref

Open Access LMU

Transplantation of canine olfactory ensheathing cells producing chondroitinase ABC promotes chondroitin sulphate proteoglycan digestion and axonal sprouting following spinal cord injury

Author: A Mackay-Sim
A Mackay-Sim
A Ramón-Cueto
AW Barritt
BJ Dlouhy
C Deng
C Kachramanoglou
CH Tator
CM Galtrey
CM Galtrey
CM Galtrey
CP Hofstetter
D Carwardine
D Choi
D Choi
D Ito
D Reginensi
D Wang
Darren Carwardine
E Neafsey
EJ Bradbury
EJ Bradbury
EJ Bradbury
Elizabeth M. Muir
EM Muir
ER Burnside
F Feron
F Roloff
G Garcia-Alias
G Garcia-Alias
H Lee
HH Nash
J Silver
JA Ekberg
Jacob Neeves
James Uney
JC Kwok
JL Kueh
JM Massey
Jonathan Prager
K Bartus
K Oprych
KC Roet
L Moon
L Naldini
LC Weaver
Liang-Fong Wong
LL Jones
M Gossen
M Ruitenberg
MB Bunge
MH Grider
MI Chuah
MI Romero
Michael G Fehlings
ML Starkey
N Granger
N Jeffery
NA Silva
ND James
Nicolas Granger
NJ Tester
NJ Tester
P Brundin
P Tabakow
P Tabakow
PH Laverty
R Watzlawick
R-R Zhao
RJ Franklin
RR Khankan
S Karimi-Abdolrezaee
S Li
S Thuret
SA Moore
SC Jefferson
SJ Davies
T Hagg
T Imaizumi
T Kawahara
U Baron
WJ Alilain
WTJ Morgan
Y Li
Y Li
Y Wang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 11/12/2017
Field of study

Olfactory ensheathing cell (OEC) transplantation is a promising strategy for treating spinal cord injury (SCI), as has been demonstrated in experimental SCI models and naturally occurring SCI in dogs. However, the presence of chondroitin sulphate proteoglycans within the extracellular matrix of the glial scar can inhibit efficient axonal repair and limit the therapeutic potential of OECs. Here we have used lentiviral vectors to genetically modify canine OECs to continuously deliver mammalian chondroitinase ABC at the lesion site in order to degrade the inhibitory chondroitin sulphate proteoglycans in a rodent model of spinal cord injury. We demonstrate that these chondroitinase producing canine OECs survived at 4 weeks following transplantation into the spinal cord lesion and effectively digested chondroitin sulphate proteoglycans at the site of injury. There was evidence of sprouting within the corticospinal tract rostral to the lesion and an increase in the number of corticospinal axons caudal to the lesion, suggestive of axonal regeneration. Our results indicate that delivery of the chondroitinase enzyme can be achieved with the genetically modified OECs to increase axon growth following SCI. The combination of these two promising approaches is a potential strategy for promoting neural regeneration following SCI in veterinary practice and human patients

Crossref

Directory of Open Access Journals

Explore Bristol Research

FigShare