Search CORE

37,897 research outputs found

Interpretable Structure-Evolving LSTM

Author: Feng Jiashi
Liang Xiaodan
Lin Liang
Shen Xiaohui
Xing Eric P.
Yan Shuicheng
Publication venue
Publication date: 08/03/2017
Field of study

This paper develops a general framework for learning interpretable data representation via Long Short-Term Memory (LSTM) recurrent neural networks over hierarchal graph structures. Instead of learning LSTM models over the pre-fixed structures, we propose to further learn the intermediate interpretable multi-level graph structures in a progressive and stochastic way from data during the LSTM network optimization. We thus call this model the structure-evolving LSTM. In particular, starting with an initial element-level graph representation where each node is a small data element, the structure-evolving LSTM gradually evolves the multi-level graph representations by stochastically merging the graph nodes with high compatibilities along the stacked LSTM layers. In each LSTM layer, we estimate the compatibility of two connected nodes from their corresponding LSTM gate outputs, which is used to generate a merging probability. The candidate graph structures are accordingly generated where the nodes are grouped into cliques with their merging probabilities. We then produce the new graph structure with a Metropolis-Hasting algorithm, which alleviates the risk of getting stuck in local optimums by stochastic sampling with an acceptance probability. Once a graph structure is accepted, a higher-level graph is then constructed by taking the partitioned cliques as its nodes. During the evolving process, representation becomes more abstracted in higher-levels where redundant information is filtered out, allowing more efficient propagation of long-range data dependencies. We evaluate the effectiveness of structure-evolving LSTM in the application of semantic object parsing and demonstrate its advantage over state-of-the-art LSTM models on standard benchmarks.Comment: To appear in CVPR 2017 as a spotlight pape

arXiv.org e-Print Archive

Crossref

Continuous Learning in a Hierarchical Multiscale Neural Network

Author: Chaumond Julien
Delangue Clement
Wolf Thomas
Publication venue
Publication date: 01/01/2018
Field of study

We reformulate the problem of encoding a multi-scale representation of a sequence in a language model by casting it in a continuous learning framework. We propose a hierarchical multi-scale language model in which short time-scale dependencies are encoded in the hidden state of a lower-level recurrent neural network while longer time-scale dependencies are encoded in the dynamic of the lower-level network by having a meta-learner update the weights of the lower-level neural network in an online meta-learning fashion. We use elastic weights consolidation as a higher-level to prevent catastrophic forgetting in our continuous learning framework.Comment: 5 pages, 2 figures, accepted as short paper at ACL 201

arXiv.org e-Print Archive

Crossref

Semantic Object Parsing with Graph LSTM

Author: A Graves
E Simo-Serra
F Xia
S Hochreiter
X Liang
Y Wang
Publication venue
Publication date: 22/03/2016
Field of study

By taking the semantic object parsing task as an exemplar application scenario, we propose the Graph Long Short-Term Memory (Graph LSTM) network, which is the generalization of LSTM from sequential data or multi-dimensional data to general graph-structured data. Particularly, instead of evenly and fixedly dividing an image to pixels or patches in existing multi-dimensional LSTM structures (e.g., Row, Grid and Diagonal LSTMs), we take each arbitrary-shaped superpixel as a semantically consistent node, and adaptively construct an undirected graph for each image, where the spatial relations of the superpixels are naturally used as edges. Constructed on such an adaptive graph topology, the Graph LSTM is more naturally aligned with the visual patterns in the image (e.g., object boundaries or appearance similarities) and provides a more economical information propagation route. Furthermore, for each optimization step over Graph LSTM, we propose to use a confidence-driven scheme to update the hidden and memory states of nodes progressively till all nodes are updated. In addition, for each node, the forgets gates are adaptively learned to capture different degrees of semantic correlation with neighboring nodes. Comprehensive evaluations on four diverse semantic object parsing datasets well demonstrate the significant superiority of our Graph LSTM over other state-of-the-art solutions.Comment: 18 page

arXiv.org e-Print Archive

Crossref

Algorithms for identification and categorization

Author: Cortes J. M.
Garrido P. L.
Kappen H. J.
Marro J.
Morillas C.
Navidad D.
Torres J. J.
Publication venue: 'AIP Publishing'
Publication date: 01/01/2005
Field of study

The main features of a family of efficient algorithms for recognition and classification of complex patterns are briefly reviewed. They are inspired in the observation that fast synaptic noise is essential for some of the processing of information in the brain.Comment: 6 pages, 5 figure

arXiv.org e-Print Archive

Crossref

Brain enhancement through cognitive training: A new insight from brain connectome

Author: Achard
Achard
Albert
Anastasios Bezerianos
Anguera
AricÃ²
Astolfi
Astolfi
BaccalÃ¡
Baldwin
BarabÃ¡si
Bassett
Bassett
Bassett
Bassett
Bassett
Berka
Birbaumer
Boersma
Borghini
Borghini
Boyke
Breckel
Bressler
Bullmore
Buschkuehl
Cannonieri
Canolty
Cao
Cavanna
Chein
Chein
Clark
Cole
Cole
Cole
Cole
Colom
Colom
Comstock
Conway
Dahlin
Deary
Deuker
Dimitriadis
Draganski
Draganski
Draganski
Driemeyer
Dux
EguÃluz
Elbert
Engvig
Erickson
Fabio Babiloni
Farah
Farwell
Ferri
Fleischman
Friston
Friston
Fumihiko Taya
Ginestet
Gomarus
Gong
Graimann
Green
Gruzelier
Gutchess
Hagmann
Hagmann
Halford
Hamilton
He
Hebb
Heinzel
Heitger
Hempel
Hermundstad
Honey
Iturria-Medina
Jaeggi
JauÅ¡ovec
Jolles
Jolles
Jolles
KamiÅ„ski
Karbach
Kinnison
Kitzbichler
Klimesch
Klingberg
Klingberg
Kohlmorgen
Kothe
Kuhnert
Langer
Langer
Lee
Lee
Lewis
Liang
Luft
Lustig
Maclin
Maguire
Mathewson
McKendrick
Micheloyannis
Micheloyannis
Moreau
Moussa
Mozolic
Neisser
Neuper
Nijhuis
Nitish Thakor
Olesen
Owens
Palva
Park
Petrides
Pfurtscheller
Poldrack
Pop
Prakash
Repantis
Rubinov
Salvador
Schaie
Schmidt-Wilcke
Schneiders
Schneiders
Scholz
Schweizer
Shipstead
Smit
Spoormaker
Sporns
Sporns
Stam
Stam
Stephenson
Stevens
Strenziok
Sulzer
Sun
Sun
Tagliazucchi
Takeuchi
Takeuchi
Takeuchi
Talbot
Tang
Tang
Tang
Taubert
Taya
Tomasi
Tomasi
Uehara
Valdes-Sosa
van den Heuvel
van den Heuvel
van den Heuvel
van den Heuvel
van den Heuvel
Voss
Watts
Weiss
Wolf
Wu
Yu Sun
Zander
Zhao
Zuo
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2015
Field of study

Owing to the recent advances in neurotechnology and the progress in understanding of brain cognitive functions, improvements of cognitive performance or acceleration of learning process with brain enhancement systems is not out of our reach anymore, on the contrary, it is a tangible target of contemporary research. Although a variety of approaches have been proposed, we will mainly focus on cognitive training interventions, in which learners repeatedly perform cognitive tasks to improve their cognitive abilities. In this review article, we propose that the learning process during the cognitive training can be facilitated by an assistive system monitoring cognitive workloads using electroencephalography (EEG) biomarkers, and the brain connectome approach can provide additional valuable biomarkers for facilitating leaners' learning processes. For the purpose, we will introduce studies on the cognitive training interventions, EEG biomarkers for cognitive workload, and human brain connectome. As cognitive overload and mental fatigue would reduce or even eliminate gains of cognitive training interventions, a real-time monitoring of cognitive workload can facilitate the learning process by flexibly adjusting difficulty levels of the training task. Moreover, cognitive training interventions should have effects on brain sub-networks, not on a single brain region, and graph theoretical network metrics quantifying topological architecture of the brain network can differentiate with respect to individual cognitive states as well as to different individuals' cognitive abilities, suggesting that the connectome is a valuable approach for tracking the learning progress. Although only a few studies have exploited the connectome approach for studying alterations of the brain network induced by cognitive training interventions so far, we believe that it would be a useful technique for capturing improvements of cognitive function

Crossref

Frontiers - Publisher Connector

PubMed Central

Archivio della ricerca- Università di Roma La Sapienza

Unstable Dynamics, Nonequilibrium Phases and Criticality in Networked Excitable Media

Author: A. Diaz-Guilera
D. J. Amit
D. Kaplan
D. O. Hebb
D. R. Chialvo
E. M. Izhikevich
E. Thelen
J. Hertz
J. J. Torres
J. M. Cortes
J. Marro
J. Marro
J. Nagumo
S. de Franciscis
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2010
Field of study

Here we numerically study a model of excitable media, namely, a network with occasionally quiet nodes and connection weights that vary with activity on a short-time scale. Even in the absence of stimuli, this exhibits unstable dynamics, nonequilibrium phases -including one in which the global activity wanders irregularly among attractors- and 1/f noise while the system falls into the most irregular behavior. A net result is resilience which results in an efficient search in the model attractors space that can explain the origin of certain phenomenology in neural, genetic and ill-condensed matter systems. By extensive computer simulation we also address a relation previously conjectured between observed power-law distributions and the occurrence of a "critical state" during functionality of (e.g.) cortical networks, and describe the precise nature of such criticality in the model.Comment: 18 pages, 9 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref