Search CORE

18,555 research outputs found

The Computational Power of Interactive Recurrent Neural Networks

Author: Hava T. Siegelmann
Jérémie Cabessa
Minsky M. L.
Turing A. M.
van Leeuwen J.
Publication venue: 'MIT Press - Journals'
Publication date
Field of study

Deep Markov Random Field for Image Modeling

Author: A Dempster
Alex Graves
C Dong
E Ising
GE Hinton
GR Cross
H Rue
J Duchi
J Pearl
J Portilla
J Shotton
J Wright
James Hays
Jean-François Lalonde
M Schuster
PJ Werbos
R Timofte
S Geman
SC Zhu
SZ Li
WT Freeman
Publication venue
Publication date: 07/09/2016
Field of study

Markov Random Fields (MRFs), a formulation widely used in generative image modeling, have long been plagued by the lack of expressive power. This issue is primarily due to the fact that conventional MRFs formulations tend to use simplistic factors to capture local patterns. In this paper, we move beyond such limitations, and propose a novel MRF model that uses fully-connected neurons to express the complex interactions among pixels. Through theoretical analysis, we reveal an inherent connection between this model and recurrent neural networks, and thereon derive an approximated feed-forward network that couples multiple RNNs along opposite directions. This formulation combines the expressive power of deep neural networks and the cyclic dependency structure of MRF in a unified model, bringing the modeling capability to a new level. The feed-forward approximation also allows it to be efficiently learned from data. Experimental results on a variety of low-level vision tasks show notable improvement over state-of-the-arts.Comment: Accepted at ECCV 201

arXiv.org e-Print Archive

Crossref

Conversion of Artificial Recurrent Neural Networks to Spiking Neural Networks for Low-power Neuromorphic Hardware

Author: Cassidy Andrew
Diehl Peter U
IEEE
Neftci Emre
Pedroni Bruno U
Zarrella Guido
Publication venue
Publication date: 01/01/2016
Field of study

In recent years the field of neuromorphic low-power systems that consume orders of magnitude less power gained significant momentum. However, their wider use is still hindered by the lack of algorithms that can harness the strengths of such architectures. While neuromorphic adaptations of representation learning algorithms are now emerging, efficient processing of temporal sequences or variable length-inputs remain difficult. Recurrent neural networks (RNN) are widely used in machine learning to solve a variety of sequence learning tasks. In this work we present a train-and-constrain methodology that enables the mapping of machine learned (Elman) RNNs on a substrate of spiking neurons, while being compatible with the capabilities of current and near-future neuromorphic systems. This "train-and-constrain" method consists of first training RNNs using backpropagation through time, then discretizing the weights and finally converting them to spiking RNNs by matching the responses of artificial neurons with those of the spiking neurons. We demonstrate our approach by mapping a natural language processing task (question classification), where we demonstrate the entire mapping process of the recurrent layer of the network on IBM's Neurosynaptic System "TrueNorth", a spike-based digital neuromorphic hardware architecture. TrueNorth imposes specific constraints on connectivity, neural and synaptic parameters. To satisfy these constraints, it was necessary to discretize the synaptic weights and neural activities to 16 levels, and to limit fan-in to 64 inputs. We find that short synaptic delays are sufficient to implement the dynamical (temporal) aspect of the RNN in the question classification task. The hardware-constrained model achieved 74% accuracy in question classification while using less than 0.025% of the cores on one TrueNorth chip, resulting in an estimated power consumption of ~17 uW

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

ZORA

Attentional Encoder Network for Targeted Sentiment Classification

Author: Jiang Tao
Liu Zhiyue
Rao Yanghui
Song Youwei
Wang Jiahai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2019
Field of study

Targeted sentiment classification aims at determining the sentimental tendency towards specific targets. Most of the previous approaches model context and target words with RNN and attention. However, RNNs are difficult to parallelize and truncated backpropagation through time brings difficulty in remembering long-term patterns. To address this issue, this paper proposes an Attentional Encoder Network (AEN) which eschews recurrence and employs attention based encoders for the modeling between context and target. We raise the label unreliability issue and introduce label smoothing regularization. We also apply pre-trained BERT to this task and obtain new state-of-the-art results. Experiments and analysis demonstrate the effectiveness and lightweight of our model.Comment: 7 page

arXiv.org e-Print Archive

Crossref

Event Representations for Automated Story Generation with Deep Neural Nets

Author: Ammanabrolu Prithviraj
Hancock William
Harrison Brent
Martin Lara J.
Riedl Mark O.
Singh Shruti
Wang Xinyu
Publication venue
Publication date: 12/09/2017
Field of study

Automated story generation is the problem of automatically selecting a sequence of events, actions, or words that can be told as a story. We seek to develop a system that can generate stories by learning everything it needs to know from textual story corpora. To date, recurrent neural networks that learn language models at character, word, or sentence levels have had little success generating coherent stories. We explore the question of event representations that provide a mid-level of abstraction between words and sentences in order to retain the semantic information of the original data while minimizing event sparsity. We present a technique for preprocessing textual story data into event sequences. We then present a technique for automated story generation whereby we decompose the problem into the generation of successive events (event2event) and the generation of natural language sentences from events (event2sentence). We give empirical results comparing different event representations and their effects on event successor generation and the translation of events to natural language.Comment: Submitted to AAAI'1

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications