Search CORE

2,140 research outputs found

Recommended from our members

The Emergence of Cognitive Phenomena from Sub-symbolic Processes

Author: Rumelhart David E.
Publication venue: eScholarship, University of California
Publication date: 01/01/1984
Field of study

eScholarship - University of California

A comparative study of the D0 neural-network analysis of the top quark non-leptonic decay channel

Author: Odorico R.
Odorico R.
R Odorico
Rumelhart D. E.
Publication venue: 'IOP Publishing'
Publication date: 07/04/2000
Field of study

A simpler neural-network approach is presented for the analysis of the top quark non-leptonic decay channel in events of the D0 Collaboration. Results for the top quark signal are comparable to those found by the D0 Collaboration by a more elaborate handling of the event information used as input to the neural network.Comment: 5 pages, 1 figur

arXiv.org e-Print Archive

Crossref

CERN Document Server

Neural NILM: Deep Neural Networks Applied to Energy Disaggregation

Author: Atlas L. E.
Bastien F.
Graves A.
Hart G. W.
Kolter J. Z.
Krizhevsky A.
Leeb S. B.
Lowe D. G.
Rumelhart D. E.
Sutskever I.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 28/09/2015
Field of study

Energy disaggregation estimates appliance-by-appliance electricity consumption from a single meter that measures the whole home's electricity demand. Recently, deep neural networks have driven remarkable improvements in classification performance in neighbouring machine learning fields such as image classification and automatic speech recognition. In this paper, we adapt three deep neural network architectures to energy disaggregation: 1) a form of recurrent neural network called `long short-term memory' (LSTM); 2) denoising autoencoders; and 3) a network which regresses the start time, end time and average power demand of each appliance activation. We use seven metrics to test the performance of these algorithms on real aggregate power data from five appliances. Tests are performed against a house not seen during training and against houses seen during training. We find that all three neural nets achieve better F1 scores (averaged over all five appliances) than either combinatorial optimisation or factorial hidden Markov models and that our neural net algorithms generalise well to an unseen house.Comment: To appear in ACM BuildSys'15, November 4--5, 2015, Seou

arXiv.org e-Print Archive

CiteSeerX

Crossref

Spiral - Imperial College Digital Repository

User Loss -- A Forced-Choice-Inspired Approach to Train Neural Networks directly by User Interaction

Author: David E. Rumelhart
Florian Luisier
Georg Petschnigg
Weilin Fu
Yann LeCun
Publication venue
Publication date: 21/12/2018
Field of study

In this paper, we investigate whether is it possible to train a neural network directly from user inputs. We consider this approach to be highly relevant for applications in which the point of optimality is not well-defined and user-dependent. Our application is medical image denoising which is essential in fluoroscopy imaging. In this field every user, i.e. physician, has a different flavor and image quality needs to be tailored towards each individual. To address this important problem, we propose to construct a loss function derived from a forced-choice experiment. In order to make the learning problem feasible, we operate in the domain of precision learning, i.e., we inspire the network architecture by traditional signal processing methods in order to reduce the number of trainable parameters. The algorithm that was used for this is a Laplacian pyramid with only six trainable parameters. In the experimental results, we demonstrate that two image experts who prefer different filter characteristics between sharpness and de-noising can be created using our approach. Also models trained for a specific user perform best on this users test data. This approach opens the way towards implementation of direct user feedback in deep learning and is applicable for a wide range of application.Comment: Accepted on BVM 2019; Extended ArXiv Version with additional figures and detail

arXiv.org e-Print Archive

Crossref

Can a connectionist model explain the processing of regularly and irregularly inflected words in German as L1 and L2?

Author: Birdsong D.
Francis W. N.
Maratsos M.
Meier H.
Pfeffer J.
Pinker S.
Rumelhart D.
Smolka E.
Tilo Strobach
Ute Schönpflug
Publication venue: 'SAGE Publications'
Publication date: 01/01/2011
Field of study

The connectionist model is a prevailing model of the structure and functioning of the cognitive system of the processing of morphology. According to this model, the morphology of regularly and irregularly inflected words (e.g., verb participles and noun plurals) is processed in the same cognitive network. A validation of the connectionist model of the processing of morphology in German as L2 has yet to be achieved. To investigate L2-specific aspects, we compared a group of L1 speakers of German with speakers of German as L2. L2 and L1 speakers of German were assigned to their respective group by their reaction times in picture naming prior to the central task. The reaction times in the lexical decision task of verb participles and noun plurals were largely consistent with the assumption of the connectionist model. Interestingly, speakers of German as L2 showed a specific advantage for irregular compared with regular verb participles

Institutional Repository of the Freie Universität Berlin

Crossref

Open Access LMU ( Ludwig-Maximilians-Univ. München)

A Comparison of the Use of Binary Decision Trees and Neural Networks in Top Quark Detection

Author: D. E. Rumelhart
David Bowser-Chao
Debra L. Dzialo
H. Baer
H. Baer
H.-U. Bengtsson
J. L. Bentley
S. M. Omohundro
W. H. Press
Publication venue: 'American Physical Society (APS)'
Publication date: 04/09/1992
Field of study

The use of neural networks for signal vs.~background discrimination in high-energy physics experiment has been investigated and has compared favorably with the efficiency of traditional kinematic cuts. Recent work in top quark identification produced a neural network that, for a given top quark mass, yielded a higher signal to background ratio in Monte Carlo simulation than a corresponding set of conventional cuts. In this article we discuss another pattern-recognition algorithm, the binary decision tree. We have applied a binary decision tree to top quark identification at the Tevatron and found it to be comparable in performance to the neural network. Furthermore, reservations about the "black box" nature of neural network discriminators do not apply to binary decision trees; a binary decision tree may be reduced to a set of kinematic cuts subject to conventional error analysis.Comment: 14pp. Plain TeX + mtexsis.tex (latter available through 'get mtexsis.tex'.) Two postscript files avail. by emai

arXiv.org e-Print Archive

CiteSeerX

Crossref

A Neural Networks Committee for the Contextual Bandit Problem

Author: D.E. Rumelhart
E. Kaufmann
G. Tesauro
K. Hornik
L. Bottou
L. Kocsis
P. Auer
P. Auer
P. Auer
R. Feraud
S.M. Kakade
T.L. Lai
W. Thompson
Publication venue
Publication date: 01/01/2014
Field of study

This paper presents a new contextual bandit algorithm, NeuralBandit, which does not need hypothesis on stationarity of contexts and rewards. Several neural networks are trained to modelize the value of rewards knowing the context. Two variants, based on multi-experts approach, are proposed to choose online the parameters of multi-layer perceptrons. The proposed algorithms are successfully tested on a large dataset with and without stationarity of rewards.Comment: 21st International Conference on Neural Information Processin

arXiv.org e-Print Archive

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Rennes 1

Learning network event sequences using long short-term memory and second-order statistic loss

Author: Du N.
Frank J.
Lewis E. A.
Rumelhart David E.
Publication venue: 'Wiley'
Publication date: 01/02/2021
Field of study

Modeling temporal event sequences on the vertices of a network is an important problem with widespread applications; examples include modeling influences in social networks, preventing crimes by modeling their space–time occurrences, and forecasting earthquakes. Existing solutions for this problem use a parametric approach, whose applicability is limited to event sequences following some well-known distributions, which is not true for many real life event datasets. To overcome this limitation, in this work, we propose a composite recurrent neural network model for learning events occurring in the vertices of a network over time. Our proposed model combines two long short-term memory units to capture base intensity and conditional intensity of an event sequence. We also introduce a second-order statistic loss that penalizes higher divergence between the generated and the target sequence's distribution of hop count distance of consecutive events. Given a sequence of vertices of a network in which an event has occurred, the proposed model predicts the vertex where the next event would most likely occur. Experimental results on synthetic and real-world datasets validate the superiority of our proposed model in comparison to various baseline methods

Crossref

IUPUIScholarWorks

Supervised Learning in Multilayer Spiking Neural Networks

Author: André Grüning
Bohte S.
Elias J. G.
Gerstner W.
Hebb D. O.
Ioana Sporea
Maass W.
Rostro-Gonzalez H.
Rumelhart D. E.
Thorpe S. T.
Xin J.
Publication venue: 'MIT Press - Journals'
Publication date: 10/02/2012
Field of study

The current article introduces a supervised learning algorithm for multilayer spiking neural networks. The algorithm presented here overcomes some limitations of existing learning algorithms as it can be applied to neurons firing multiple spikes and it can in principle be applied to any linearisable neuron model. The algorithm is applied successfully to various benchmarks, such as the XOR problem and the Iris data set, as well as complex classifications problems. The simulations also show the flexibility of this supervised learning algorithm which permits different encodings of the spike timing patterns, including precise spike trains encoding.Comment: 38 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Surrey Research Insight

Concepts and Their Dynamics: A Quantum-Theoretic Modeling of Human Thought

Author: Aerts D.
Aerts D.
Bruza P. D.
Dirac P. A. M.
Feynman R.
Gabora L.
Jauch J. M.
Kolmogorov A. N.
Laplace P. S.
Mackey G.
Nelson D. L.
Nosofsky R.
Piwowarski B.
Rosch E.
Rosch E.
Rumelhart D. E.
Widdows D.
Widdows D.
Wittgenstein L.
Publication venue: 'Wiley'
Publication date: 01/01/2013
Field of study

We analyze different aspects of our quantum modeling approach of human concepts, and more specifically focus on the quantum effects of contextuality, interference, entanglement and emergence, illustrating how each of them makes its appearance in specific situations of the dynamics of human concepts and their combinations. We point out the relation of our approach, which is based on an ontology of a concept as an entity in a state changing under influence of a context, with the main traditional concept theories, i.e. prototype theory, exemplar theory and theory theory. We ponder about the question why quantum theory performs so well in its modeling of human concepts, and shed light on this question by analyzing the role of complex amplitudes, showing how they allow to describe interference in the statistics of measurement outcomes, while in the traditional theories statistics of outcomes originates in classical probability weights, without the possibility of interference. The relevance of complex numbers, the appearance of entanglement, and the role of Fock space in explaining contextual emergence, all as unique features of the quantum modeling, are explicitly revealed in this paper by analyzing human concepts and their dynamics.Comment: 31 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref

Archivio istituzionale della ricerca - Università degli Studi di Udine

Leicester Research Archive