Search CORE

285 research outputs found

Temporal Consistency Objectives Regularize the Learning of Disentangled Representations

Author: A Chartsias
C Qin
JN Wood
O Bernard
O Ronneberger
W Bai
X Mao
Y Bengio
Y Bengio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/08/2019
Field of study

There has been an increasing focus in learning interpretable feature representations, particularly in applications such as medical image analysis that require explainability, whilst relying less on annotated data (since annotations can be tedious and costly). Here we build on recent innovations in style-content representations to learn anatomy, imaging characteristics (appearance) and temporal correlations. By introducing a self-supervised objective of predicting future cardiac phases we improve disentanglement. We propose a temporal transformer architecture that given an image conditioned on phase difference, it predicts a future frame. This forces the anatomical decomposition to be consistent with the temporal cardiac contraction in cine MRI and to have semantic meaning with less need for annotations. We demonstrate that using this regularization, we achieve competitive results and improve semi-supervised segmentation, especially when very few labelled data are available. Specifically, we show Dice increase of up to 19\% and 7\% compared to supervised and semi-supervised approaches respectively on the ACDC dataset. Code is available at: https://github.com/gvalvano/sdtnet .Comment: 9 pages, 4 figures (1 .gif), 1 tabl

arXiv.org e-Print Archive

Crossref

Edinburgh Research Explorer

Bio-Inspired Multi-Layer Spiking Neural Network Extracts Discriminative Features from Speech Signals

Author: A Tavanaei
G Hinton
GR Doddington
JJ Wade
LR Rabiner
M Beyeler
N Kasabov
O Abdel-Hamid
S Ghosh-Dastidar
SG Wysoski
SR Kheradpisheh
T Masquelier
W Maass
Y Bengio
Y Bengio
Y Cao
Y Dan
Y LeCun
Y LeCun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/06/2017
Field of study

Spiking neural networks (SNNs) enable power-efficient implementations due to their sparse, spike-based coding scheme. This paper develops a bio-inspired SNN that uses unsupervised learning to extract discriminative features from speech signals, which can subsequently be used in a classifier. The architecture consists of a spiking convolutional/pooling layer followed by a fully connected spiking layer for feature discovery. The convolutional layer of leaky, integrate-and-fire (LIF) neurons represents primary acoustic features. The fully connected layer is equipped with a probabilistic spike-timing-dependent plasticity learning rule. This layer represents the discriminative features through probabilistic, LIF neurons. To assess the discriminative power of the learned features, they are used in a hidden Markov model (HMM) for spoken digit recognition. The experimental results show performance above 96% that compares favorably with popular statistical feature extraction methods. Our results provide a novel demonstration of unsupervised feature acquisition in an SNN

arXiv.org e-Print Archive

Crossref

Collaborative Layer-wise Discriminative Learning in Deep Neural Networks

Author: C Cortes
C Farabet
C Xu
DR Cox
GE Hinton
J Nickolls
MD Zeiler
N Srivastava
O Russakovsky
P Viola
RO Duda
SS Haykin
Y Bengio
Y Wei
Publication venue
Publication date: 19/07/2016
Field of study

Intermediate features at different layers of a deep neural network are known to be discriminative for visual patterns of different complexities. However, most existing works ignore such cross-layer heterogeneities when classifying samples of different complexities. For example, if a training sample has already been correctly classified at a specific layer with high confidence, we argue that it is unnecessary to enforce rest layers to classify this sample correctly and a better strategy is to encourage those layers to focus on other samples. In this paper, we propose a layer-wise discriminative learning method to enhance the discriminative capability of a deep network by allowing its layers to work collaboratively for classification. Towards this target, we introduce multiple classifiers on top of multiple layers. Each classifier not only tries to correctly classify the features from its input layer, but also coordinates with other classifiers to jointly maximize the final classification performance. Guided by the other companion classifiers, each classifier learns to concentrate on certain training examples and boosts the overall performance. Allowing for end-to-end training, our method can be conveniently embedded into state-of-the-art deep networks. Experiments with multiple popular deep networks, including Network in Network, GoogLeNet and VGGNet, on scale-various object classification benchmarks, including CIFAR100, MNIST and ImageNet, and scene classification benchmarks, including MIT67, SUN397 and Places205, demonstrate the effectiveness of our method. In addition, we also analyze the relationship between the proposed method and classical conditional random fields models.Comment: To appear in ECCV 2016. Maybe subject to minor changes before camera-ready versio

arXiv.org e-Print Archive

Crossref

Convolutional LSTM Networks for Subcellular Localization of Proteins

Author: A Graves
A Höglund
A Prlić
C Magnan
G Dahl
HY Xiong
LJP Maaten Van Der
M Schuster
MCF Thomsen
O Emanuelsson
P Baldi
P Lena Di
S Briesemeister
S Henikoff
S Hochreiter
SF Altschul
T Blum
T Goldberg
T Petersen
Y Bengio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Machine learning is widely used to analyze biological sequence data. Non-sequential models such as SVMs or feed-forward neural networks are often used although they have no natural way of handling sequences of varying length. Recurrent neural networks such as the long short term memory (LSTM) model on the other hand are designed to handle sequences. In this study we demonstrate that LSTM networks predict the subcellular location of proteins given only the protein sequence with high accuracy (0.902) outperforming current state of the art algorithms. We further improve the performance by introducing convolutional filters and experiment with an attention mechanism which lets the LSTM focus on specific parts of the protein. Lastly we introduce new visualizations of both the convolutional filters and the attention mechanisms and show how they can be used to extract biological relevant knowledge from the LSTM networks

arXiv.org e-Print Archive

Crossref

Copenhagen University Research Information System

Online Research Database In Technology

RoboCup 2D Soccer Simulation League: Evaluation Challenges

Author: A Bai
A Bai
D Budden
D Polani
DM Budden
H Akiyama
H Hamann
H Kitano
H Sayama
H Zhang
HD Burkhard
HS Greenwald
I Noda
J Schmidhuber
JT Lizier
K Ghazi-Zahedi
M Prokopenko
M Prokopenko
M Prokopenko
N Ay
N Tishby
O Obst
O Obst
OM Cliff
OM Cliff
OM Cliff
P Stone
R Der
R Der
R Pfeifer
S Nallaperuma
T Gabel
Y Bengio
Publication venue
Publication date: 14/06/2017
Field of study

We summarise the results of RoboCup 2D Soccer Simulation League in 2016 (Leipzig), including the main competition and the evaluation round. The evaluation round held in Leipzig confirmed the strength of RoboCup-2015 champion (WrightEagle, i.e. WE2015) in the League, with only eventual finalists of 2016 competition capable of defeating WE2015. An extended, post-Leipzig, round-robin tournament which included the top 8 teams of 2016, as well as WE2015, with over 1000 games played for each pair, placed WE2015 third behind the champion team (Gliders2016) and the runner-up (HELIOS2016). This establishes WE2015 as a stable benchmark for the 2D Simulation League. We then contrast two ranking methods and suggest two options for future evaluation challenges. The first one, "The Champions Simulation League", is proposed to include 6 previous champions, directly competing against each other in a round-robin tournament, with the view to systematically trace the advancements in the League. The second proposal, "The Global Challenge", is aimed to increase the realism of the environmental conditions during the simulated games, by simulating specific features of different participating countries.Comment: 12 pages, RoboCup-2017, Nagoya, Japan, July 201

arXiv.org e-Print Archive

Crossref

LEED: Label-Free Expression Editing via Disentanglement

Author: A Mollahosseini
C Cao
H Li
I Higgins
J Geng
J Johnson
K Nagano
L van der Maaten
O Langner
S Du
Y Bengio
Y Chang
Z Wang
Publication venue
Publication date: 01/01/2020
Field of study

Recent studies on facial expression editing have obtained very promising progress. On the other hand, existing methods face the constraint of requiring a large amount of expression labels which are often expensive and time-consuming to collect. This paper presents an innovative label-free expression editing via disentanglement (LEED) framework that is capable of editing the expression of both frontal and profile facial images without requiring any expression label. The idea is to disentangle the identity and expression of a facial image in the expression manifold, where the neutral face captures the identity attribute and the displacement between the neutral image and the expressive image captures the expression attribute. Two novel losses are designed for optimal expression disentanglement and consistent synthesis, including a mutual expression information loss that aims to extract pure expression-related features and a siamese loss that aims to enhance the expression similarity between the synthesized image and the reference image. Extensive experiments over two public facial expression datasets show that LEED achieves superior facial expression editing qualitatively and quantitatively.Comment: Accepted to ECCV 202

arXiv.org e-Print Archive

Crossref

DR-NTU (Digital Repository of NTU)

An ensemble of epoch-wise empirical Bayes for few-shot learning

Author: C Ju
DM Blei
F Hutter
F Li
J Bergstra
JH Friedman
L Breiman
L Li
LI Kuncheva
MD Hoffman
O Russakovsky
P Smyth
S Thrun
T Mitchell
Y Bengio
Y Freund
Z Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Ministry of Education, Singapore under its Academic Research Funding Tier

arXiv.org e-Print Archive

Crossref

Institutional Knowledge at Singapore Management University

MPG.PuRe

A Theory of Cheap Control in Embodied Systems

Author: B Sallans
C Paul
CE Shannon
CM Bishop
D Sol
DD Clark
DFB Haeufle
EA Rückert
G Montúfar
G Montúfar
G Montúfar
GE Hinton
GE Hinton
Guido Montúfar
GW Taylor
H Hauser
J Martens
J Pearl
JE Auerbach
Josh C. Bongard
K Aström
K Zahedi
K Zahedi
K Zahedi
Keyan Ghazi-Zahedi
L Berthouze
L Sokoloff
M Lungarella
N Ay
N Ay
N Le Roux
Nihat Ay
O Rivoire
R Pfeifer
R Pfeifer
R Pfeifer
R Pfeifer
RA Brooks
RA Brooks
T McGeer
V Braitenberg
VN Vapnik
W Bialek
Y Bengio
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 15/11/2014
Field of study

We present a framework for designing cheap control architectures for embodied agents. Our derivation is guided by the classical problem of universal approximation, whereby we explore the possibility of exploiting the agent's embodiment for a new and more efficient universal approximation of behaviors generated by sensorimotor control. This embodied universal approximation is compared with the classical non-embodied universal approximation. To exemplify our approach, we present a detailed quantitative case study for policy models defined in terms of conditional restricted Boltzmann machines. In contrast to non-embodied universal approximation, which requires an exponential number of parameters, in the embodied setting we are able to generate all possible behaviors with a drastically smaller model, thus obtaining cheap universal approximation. We test and corroborate the theory experimentally with a six-legged walking machine. The experiments show that the sufficient controller complexity predicted by our theory is tight, which means that the theory has direct practical implications. Keywords: cheap design, embodiment, sensorimotor loop, universal approximation, conditional restricted Boltzmann machineComment: 27 pages, 10 figure

arXiv.org e-Print Archive

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FigShare

Automatically Segmenting the Left Atrium from Cardiac Images Using Successive 3D U-Nets and a Contour Loss

Author: C McGann
CA Morillo
CN Cecco De
F Isensee
G Litjens
J Liu
M Zoni-Berisso
Matthew D. Zeiler
O Oktay
O Ronneberger
X Li
Y Bengio
Ö Çiçek
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/09/2018
Field of study

International audienceRadiological imaging offers effective measurement of anatomy, which is useful in disease diagnosis and assessment. Previous study has shown that the left atrial wall remodeling can provide information to predict treatment outcome in atrial fibrillation. Nevertheless, the segmentation of the left atrial structures from medical images is still very time-consuming. Current advances in neural network may help creating automatic segmentation models that reduce the workload for clinicians. In this preliminary study, we propose automated, two-stage, three-dimensional U-Nets with convolutional neural network, for the challenging task of left atrial segmentation. Unlike previous two-dimensional image segmentation methods, we use 3D U-Nets to obtain the heart cavity directly in 3D. The dual 3D U-Net structure consists of, a first U-Net to coarsely segment and locate the left atrium, and a second U-Net to accurately segment the left atrium under higher resolution. In addition, we introduce a Contour loss based on additional distance information to adjust the final segmentation. We randomly split the data into training datasets (80 subjects) and validation datasets (20 subjects) to train multiple models, with different augmentation setting. Experiments show that the average Dice coefficients for validation datasets are around 0.91 - 0.92, the sensitivity around 0.90-0.94 and the specificity 0.99. Compared with traditional Dice loss, models trained with Contour loss in general offer smaller Hausdorff distance with similar Dice coefficient, and have less connected components in predictions. Finally, we integrate several trained models in an ensemble prediction to segment testing datasets

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

Born to learn: The inspiration, progress, and future of evolved plastic artificial neural networks

Author: Abbott
Abraham
Abraham
Alexander
Allis
Alpaydin
Anderson
Andrea Soltoggio
Angeline
Arifovic
Arnold
Arnold
Ay
Bailey
Baldwin
Baxter
Baxter
Bear
Beer
Bengio
Bengio
Bentley
Best
Bienenstock
Birmingham
Blynel
Boers
Bourlard
Brown
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Butz
Butz
Bäck
Cabessa
Carew
Carlson
Carver
Cervier
Chklovskii
Clark
Cliff
Clutton-Brock
Coleman
Cooper
Damasio
Darwin
Dawkins
Deary
Deng
de Vladar
Di Paolo
Di Paolo
Dobzhansky
Doidge
Downing
Downing
Doya
Doya
Draganski
Dudai
Durr
Edelman
Eiben
Ellefsen
Ellefsen
Eskridge
Fellous
Fernando
Fernando
Finnie
Floreano
Floreano
Floreano
Floreano
Floreano
Floreano
Fogel
Fontanari
Friston
Fujii
Funahashi
Gerstner
Gerstner
Goldberg
Goodfellow
Graves
Greve
Grossberg
Gruau
Gu
Gustafsson
Hansen
Happel
Harrington
Harris-Warrick
Harvey
Hasselmo
Hasselmo
Hasselmo
Hawkins
Hebb
Hensch
Hinton
Hinton
Hinton
Hochreiter
Hoinville
Holland
Holland
Hopkins
Hornby
Howard
Howard
Howard
Hull
Husbands
Izhikevich
Izhikevich
Jaeger
Jo
Kaas
Kandel
Kandel
Katz
Katz
Katz
Kenneth O. Stanley
Khan
Khan
Khan
Khan
Kirkpatrick
Kiyota
Klug
Kohonen
Kohonen
Kolb
Kolb
Kondo
Koutník
Koza
Krichmar
Krizhevsky
Kumaran
Kupfermann
Køppe
Lake
Lalejini
Lamprecht
Langton
Lanzi
LeCun
LeDoux
Lee
Lehman
Lehman
Lehman
Levy
Lüders
Lüders
Maass
Maniadakis
Marder
Marder
Markram
Mattiussi
Mayley
McClelland
McQuesten
Meng
Merzenich
Michalewicz
Michalski
Miconi
Miller
Miller
Millington
Monroe
Mouret
Murre
Niv
Nogueira
Nogueira
Nolfi
Nolfi
Nolfi
Nolfi
Norouzzadeh
Offerman
Oja
Oquab
Orchard
Ormrod
Pan
Pascual-Leone
Pavlov
Pehlevan
Pigliucci
Rauschecker
Rawal
Rescorla
Risi
Risi
Risi
Risi
Risi
Roberts
Robins
Roff
Rolls
Rumelhart
Russell
Russo
Sanchez
Schmidhuber
Schmidhuber
Schmidhuber
Schmidhuber
Schreiweis
Schroll
Schultz
Schultz
Schultz
Sebastian Risi
Silva
Silva
Silver
Sims
Sims
Sipper
Skinner
Skinner
Smith
Smith
Smith
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Sporns
Staddon
Stanley
Stanley
Stanley
Stanley
Stanley
Steels
Stone
Suri
Suri
Sutton
Taylor
Tessier-Lavigne
Thorndike
Thrun
Thrun
Thrun
Tonelli
Tonelli
Urzelai
Varela
Venkatesh
Vitay
Wagner
Walters
Widrow
Willshaw
Yamauchi
Yao
Yao
Yoder
Young
Ziemke
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Biological plastic neural networks are systems of extraordinary computational capabilities shaped by evolution, development, and lifetime learning. The interplay of these elements leads to the emergence of adaptive behavior and intelligence. Inspired by such intricate natural phenomena, Evolved Plastic Artificial Neural Networks (EPANNs) use simulated evolution in-silico to breed plastic neural networks with a large variety of dynamics, architectures, and plasticity rules: these artificial systems are composed of inputs, outputs, and plastic components that change in response to experiences in an environment. These systems may autonomously discover novel adaptive algorithms, and lead to hypotheses on the emergence of biological adaptation. EPANNs have seen considerable progress over the last two decades. Current scientific and technological advances in artificial neural networks are now setting the conditions for radically new approaches and results. In particular, the limitations of hand-designed networks could be overcome by more flexible and innovative solutions. This paper brings together a variety of inspiring ideas that define the field of EPANNs. The main methods and results are reviewed. Finally, new opportunities and developments are presented

arXiv.org e-Print Archive

Loughborough University Institutional Repository

Crossref

The IT University of Copenhagen's Repository

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)