Search CORE

2,695 research outputs found

A comparative study of the D0 neural-network analysis of the top quark non-leptonic decay channel

Author: Odorico R.
Odorico R.
R Odorico
Rumelhart D. E.
Publication venue: 'IOP Publishing'
Publication date: 07/04/2000
Field of study

A simpler neural-network approach is presented for the analysis of the top quark non-leptonic decay channel in events of the D0 Collaboration. Results for the top quark signal are comparable to those found by the D0 Collaboration by a more elaborate handling of the event information used as input to the neural network.Comment: 5 pages, 1 figur

arXiv.org e-Print Archive

Crossref

CERN Document Server

Rhythmic inhibition allows neural networks to search for maximally consistent states

Author: Giacomo Indiveri
Hesham Mostafa
Hopfield J.
Koch C.
Lorenz K. Müller
Minsky M.
Mostafa H.
Rumelhart D.
von Helmholtz H.
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2015
Field of study

Gamma-band rhythmic inhibition is a ubiquitous phenomenon in neural circuits yet its computational role still remains elusive. We show that a model of Gamma-band rhythmic inhibition allows networks of coupled cortical circuit motifs to search for network configurations that best reconcile external inputs with an internal consistency model encoded in the network connectivity. We show that Hebbian plasticity allows the networks to learn the consistency model by example. The search dynamics driven by rhythmic inhibition enable the described networks to solve difficult constraint satisfaction problems without making assumptions about the form of stochastic fluctuations in the network. We show that the search dynamics are well approximated by a stochastic sampling process. We use the described networks to reproduce perceptual multi-stability phenomena with switching times that are a good match to experimental data and show that they provide a general neural framework which can be used to model other 'perceptual inference' phenomena

arXiv.org e-Print Archive

Crossref

ZORA

Supervised Learning in Multilayer Spiking Neural Networks

Author: André Grüning
Bohte S.
Elias J. G.
Gerstner W.
Hebb D. O.
Ioana Sporea
Maass W.
Rostro-Gonzalez H.
Rumelhart D. E.
Thorpe S. T.
Xin J.
Publication venue: 'MIT Press - Journals'
Publication date: 10/02/2012
Field of study

The current article introduces a supervised learning algorithm for multilayer spiking neural networks. The algorithm presented here overcomes some limitations of existing learning algorithms as it can be applied to neurons firing multiple spikes and it can in principle be applied to any linearisable neuron model. The algorithm is applied successfully to various benchmarks, such as the XOR problem and the Iris data set, as well as complex classifications problems. The simulations also show the flexibility of this supervised learning algorithm which permits different encodings of the spike timing patterns, including precise spike trains encoding.Comment: 38 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Surrey Research Insight

Learned-Norm Pooling for Deep Feedforward and Recurrent Neural Networks

Author: A. Hyvärinen
D. Ciresan
D. Hubel
D.E. Rumelhart
J. Bergstra
J. Bergstra
K. Fukushima
M. Ranzato
M. Trebar
Y. LeCun
Publication venue
Publication date: 01/01/2014
Field of study

In this paper we propose and investigate a novel nonlinear unit, called

L_p

unit, for deep neural networks. The proposed

L_p

unit receives signals from several projections of a subset of units in the layer below and computes a normalized

L_p

norm. We notice two interesting interpretations of the

L_p

unit. First, the proposed unit can be understood as a generalization of a number of conventional pooling operators such as average, root-mean-square and max pooling widely used in, for instance, convolutional neural networks (CNN), HMAX models and neocognitrons. Furthermore, the

L_p

unit is, to a certain degree, similar to the recently proposed maxout unit (Goodfellow et al., 2013) which achieved the state-of-the-art object recognition results on a number of benchmark datasets. Secondly, we provide a geometrical interpretation of the activation function based on which we argue that the

L_p

unit is more efficient at representing complex, nonlinear separating boundaries. Each

L_p

unit defines a superelliptic boundary, with its exact shape defined by the order

p

. We claim that this makes it possible to model arbitrarily shaped, curved boundaries more efficiently by combining a few

L_p

units of different orders. This insight justifies the need for learning different orders for each unit in the model. We empirically evaluate the proposed

L_p

units on a number of datasets and show that multilayer perceptrons (MLP) consisting of the

L_p

units achieve the state-of-the-art results on a number of benchmark datasets. Furthermore, we evaluate the proposed

L_p

unit on the recently proposed deep recurrent neural networks (RNN).Comment: ECML/PKDD 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Recurrent Latent Variable Networks for Session-Based Recommendation

Author: Bastien Frédéric
Bengio Y.
Chatzis Sotirios P.
Duchi John
Glorot X.
Hidasi B.
Kingma D.
Kingma D. P.
Porteous Ian
Rendle S.
Rumelhart D.E.
Salakhutdinov Ruslan
Zhang Y.
Publication venue
Publication date: 13/06/2017
Field of study

In this work, we attempt to ameliorate the impact of data sparsity in the context of session-based recommendation. Specifically, we seek to devise a machine learning mechanism capable of extracting subtle and complex underlying temporal dynamics in the observed session data, so as to inform the recommendation algorithm. To this end, we improve upon systems that utilize deep learning techniques with recurrently connected units; we do so by adopting concepts from the field of Bayesian statistics, namely variational inference. Our proposed approach consists in treating the network recurrent units as stochastic latent variables with a prior distribution imposed over them. On this basis, we proceed to infer corresponding posteriors; these can be used for prediction and recommendation generation, in a way that accounts for the uncertainty in the available sparse training data. To allow for our approach to easily scale to large real-world datasets, we perform inference under an approximate amortized variational inference (AVI) setup, whereby the learned posteriors are parameterized via (conventional) neural networks. We perform an extensive experimental evaluation of our approach using challenging benchmark datasets, and illustrate its superiority over existing state-of-the-art techniques

arXiv.org e-Print Archive

Crossref

Ktisis

Can a connectionist model explain the processing of regularly and irregularly inflected words in German as L1 and L2?

Author: Birdsong D.
Francis W. N.
Maratsos M.
Meier H.
Pfeffer J.
Pinker S.
Rumelhart D.
Smolka E.
Tilo Strobach
Ute Schönpflug
Publication venue: 'SAGE Publications'
Publication date: 01/01/2011
Field of study

The connectionist model is a prevailing model of the structure and functioning of the cognitive system of the processing of morphology. According to this model, the morphology of regularly and irregularly inflected words (e.g., verb participles and noun plurals) is processed in the same cognitive network. A validation of the connectionist model of the processing of morphology in German as L2 has yet to be achieved. To investigate L2-specific aspects, we compared a group of L1 speakers of German with speakers of German as L2. L2 and L1 speakers of German were assigned to their respective group by their reaction times in picture naming prior to the central task. The reaction times in the lexical decision task of verb participles and noun plurals were largely consistent with the assumption of the connectionist model. Interestingly, speakers of German as L2 showed a specific advantage for irregular compared with regular verb participles

Institutional Repository of the Freie Universität Berlin

Crossref

Open Access LMU ( Ludwig-Maximilians-Univ. München)

Audio Event Detection using Weakly Labeled Data

Author: Gencoglu O.
J. F.
Kons Z.
Kumar A.
Mandel M. I.
Pancoast S.
Pikrakis A.
Rumelhart D. E.
Stowell D.
Wang F.
Wang J.
Werbos P. J.
Zhou Z.-H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/07/2016
Field of study

Acoustic event detection is essential for content analysis and description of multimedia recordings. The majority of current literature on the topic learns the detectors through fully-supervised techniques employing strongly labeled data. However, the labels available for majority of multimedia data are generally weak and do not provide sufficient detail for such methods to be employed. In this paper we propose a framework for learning acoustic event detectors using only weakly labeled data. We first show that audio event detection using weak labels can be formulated as an Multiple Instance Learning problem. We then suggest two frameworks for solving multiple-instance learning, one based on support vector machines, and the other on neural networks. The proposed methods can help in removing the time consuming and expensive process of manually annotating data to facilitate fully supervised learning. Moreover, it can not only detect events in a recording but can also provide temporal locations of events in the recording. This helps in obtaining a complete description of the recording and is notable since temporal information was never known in the first place in weakly labeled data.Comment: ACM Multimedia 201

arXiv.org e-Print Archive

Crossref

A Comparison of the Use of Binary Decision Trees and Neural Networks in Top Quark Detection

Author: D. E. Rumelhart
David Bowser-Chao
Debra L. Dzialo
H. Baer
H. Baer
H.-U. Bengtsson
J. L. Bentley
S. M. Omohundro
W. H. Press
Publication venue: 'American Physical Society (APS)'
Publication date: 04/09/1992
Field of study

The use of neural networks for signal vs.~background discrimination in high-energy physics experiment has been investigated and has compared favorably with the efficiency of traditional kinematic cuts. Recent work in top quark identification produced a neural network that, for a given top quark mass, yielded a higher signal to background ratio in Monte Carlo simulation than a corresponding set of conventional cuts. In this article we discuss another pattern-recognition algorithm, the binary decision tree. We have applied a binary decision tree to top quark identification at the Tevatron and found it to be comparable in performance to the neural network. Furthermore, reservations about the "black box" nature of neural network discriminators do not apply to binary decision trees; a binary decision tree may be reduced to a set of kinematic cuts subject to conventional error analysis.Comment: 14pp. Plain TeX + mtexsis.tex (latter available through 'get mtexsis.tex'.) Two postscript files avail. by emai

arXiv.org e-Print Archive

CiteSeerX

Crossref

Recommended from our members

An Overview of the Use of Neural Networks for Data Mining Tasks

Author: Alberts B
Alpaydin E
Ando T
Blake CL
Bramer MA
Castanheira LG
Han J
Lu H
Mitchell M
Ni X
Quinlan RJ
Rumelhart DE
Shafer JC
Shendure J
Simić D
Stahl F
Steinwart I
Surjandari I
Wei JS
Widrow B
Witten IH
Zaslavsky B
Zhang D
Publication venue: 'Wiley'
Publication date: 01/01/2012
Field of study

In the recent years the area of data mining has experienced a considerable demand for technologies that extract knowledge from large and complex data sources. There is a substantial commercial interest as well as research investigations in the area that aim to develop new and improved approaches for extracting information, relationships, and patterns from datasets. Artificial Neural Networks (NN) are popular biologically inspired intelligent methodologies, whose classification, prediction and pattern recognition capabilities have been utilised successfully in many areas, including science, engineering, medicine, business, banking, telecommunication, and many other fields. This paper highlights from a data mining perspective the implementation of NN, using supervised and unsupervised learning, for pattern recognition, classification, prediction and cluster analysis, and focuses the discussion on their usage in bioinformatics and financial data analysis tasks

Central Archive at the University of Reading

Crossref

Portsmouth University Research Portal (Pure)

Bournemouth University Research Online

Deep Big Simple Neural Nets Excel on Handwritten Digit Recognition

Author: Bengio Y.
Dan Claudiu Cireşan
Hochreiter S.
Jürgen Schmidhuber
LeCun Y.
Luca Maria Gambardella
Nair V.
Ranzato M.
Ruetsch G.
Rumelhart D. E.
Russell S.
Salakhutdinov R.
Steinkraus D.
Ueli Meier
Publication venue: 'MIT Press - Journals'
Publication date: 01/03/2010
Field of study

Good old on-line back-propagation for plain multi-layer perceptrons yields a very low 0.35% error rate on the famous MNIST handwritten digits benchmark. All we need to achieve this best result so far are many hidden layers, many neurons per layer, numerous deformed training images, and graphics cards to greatly speed up learning.Comment: 14 pages, 2 figures, 4 listing

arXiv.org e-Print Archive

Crossref