Search CORE

9,561 research outputs found

Learning to Rank Question Answer Pairs with Holographic Dual LSTM Architecture

Author: Chang Ming-Wei
Heilman Michael
Hu Baotian
Luu Anh Tuan
Mikolov Tomas
Nickel Maximilian
Plate Tony
Qiu Xipeng
Robertson Stephen E.
Wang Di
Wang Mengqiu
Yao Xuchen
Zhou Guangyou
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/07/2017
Field of study

We describe a new deep learning architecture for learning to rank question answer pairs. Our approach extends the long short-term memory (LSTM) network with holographic composition to model the relationship between question and answer representations. As opposed to the neural tensor layer that has been adopted recently, the holographic composition provides the benefits of scalable and rich representational learning approach without incurring huge parameter costs. Overall, we present Holographic Dual LSTM (HD-LSTM), a unified architecture for both deep sentence modeling and semantic matching. Essentially, our model is trained end-to-end whereby the parameters of the LSTM are optimized in a way that best explains the correlation between question and answer representations. In addition, our proposed deep learning architecture requires no extensive feature engineering. Via extensive experiments, we show that HD-LSTM outperforms many other neural architectures on two popular benchmark QA datasets. Empirical studies confirm the effectiveness of holographic composition over the neural tensor layer.Comment: SIGIR 2017 Full Pape

arXiv.org e-Print Archive

Crossref

An Integrated Neural Network-Event-Related Potentials Model of Temporal and Probability Context Effects on Event Categorization

Author: Banquet Jean-Paul
Contreras-Vidal José L.
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/02/1992
Field of study

We present a neural network that adapts and integrates several preexisting or new modules to categorize events in short term memory (STM), encode temporal order in working memory, evaluate timing and probability context in medium and long term memory. The model shows how processed contextual information modulates event recognition and categorization, focal attention and incentive motivation. The model is based on a compendium of Event Related Potentials (ERPs) and behavioral results either collected by the authors or compiled from the classical ERP literature. Its hallmark is, at the functional level, the interplay of memory registers endowed with widely different dynamical ranges, and at the structural level, the attempt to relate the different modules to known anatomical structures.INSERM; NATO; DGA/DRET (911470/A000/DRET/DS/DR

Boston University Institutional Repository (OpenBU)

Plural morphology in compounding is not good evidence to support the dual mechanism model

Author: Davey N.
Hayes J.
Murphy V.
Peters L.
Smith Pamela
Publication venue
Publication date: 01/01/2001
Field of study

The compounding phenomena is considered to be good evidence to support the dual mechanism model of morphological processing (Pinker & Prince, 1992). However evidence from initial neural net modeling has shown that a single route associative memory based account might provide an equally, if not more valid explanation of the treatment of plurals in compounds. Further neural net modeling and empirical work is proposed to test this single route accoun

University of Hertfordshire Research Archive

The propositional nature of human associative learning

Author: De Houwer Jan
Lovibond PF
Mitchell CJ
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/01/2009
Field of study

The past 50 years have seen an accumulation of evidence suggesting that associative learning depends oil high-level cognitive processes that give rise to propositional knowledge. Yet, many learning theorists maintain a belief in a learning mechanism in which links between mental representations are formed automatically. We characterize and highlight the differences between the propositional and link approaches, and review the relevant empirical evidence. We conclude that learning is the consequence of propositional reasoning processes that cooperate with the unconscious processes involved in memory retrieval and perception. We argue that this new conceptual framework allows many of the important recent advances in associative learning research to be retained, but recast in a model that provides a firmer foundation for both immediate application and future research

CiteSeerX

Ghent University Academic Bibliography

From Parallel Sequence Representations to Calligraphic Control: A Conspiracy of Neural Circuits

Author: Bullock Daniel
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/05/2004
Field of study

Calligraphic writing presents a rich set of challenges to the human movement control system. These challenges include: initial learning, and recall from memory, of prescribed stroke sequences; critical timing of stroke onsets and durations; fine control of grip and contact forces; and letter-form invariance under voluntary size scaling, which entails fine control of stroke direction and amplitude during recruitment and derecruitment of musculoskeletal degrees of freedom. Experimental and computational studies in behavioral neuroscience have made rapid progress toward explaining the learning, planning and contTOl exercised in tasks that share features with calligraphic writing and drawing. This article summarizes computational neuroscience models and related neurobiological data that reveal critical operations spanning from parallel sequence representations to fine force control. Part one addresses stroke sequencing. It treats competitive queuing (CQ) models of sequence representation, performance, learning, and recall. Part two addresses letter size scaling and motor equivalence. It treats cursive handwriting models together with models in which sensory-motor tmnsformations are performed by circuits that learn inverse differential kinematic mappings. Part three addresses fine-grained control of timing and transient forces, by treating circuit models that learn to solve inverse dynamics problems.National Institutes of Health (R01 DC02852

Boston University Institutional Repository (OpenBU)

Event Timing in Associative Learning

Author: Herz Andreas V. M.
Nehrkorn Johannes
Tanimoto Hiromu
Yarali Ayse
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 01/01/2012
Field of study

Associative learning relies on event timing. Fruit flies for example, once trained with an odour that precedes electric shock, subsequently avoid this odour (punishment learning); if, on the other hand the odour follows the shock during training, it is approached later on (relief learning). During training, an odour-induced Ca++ signal and a shock-induced dopaminergic signal converge in the Kenyon cells, synergistically activating a Ca++-calmodulin-sensitive adenylate cyclase, which likely leads to the synaptic plasticity underlying the conditioned avoidance of the odour. In Aplysia, the effect of serotonin on the corresponding adenylate cyclase is bi-directionally modulated by Ca++, depending on the relative timing of the two inputs. Using a computational approach, we quantitatively explore this biochemical property of the adenylate cyclase and show that it can generate the effect of event timing on associative learning. We overcome the shortage of behavioural data in Aplysia and biochemical data in Drosophila by combining findings from both systems

Open Access LMU

Brain mechanisms of successful recognition through retrieval of semantic context

Author: Flegal Kristin E.
Marín-Gutiérrez Alejandro
Ragland J. Daniel
Ranganath Charan
Publication venue: 'MIT Press - Journals'
Publication date: 01/08/2014
Field of study

Episodic memory is associated with the encoding and retrieval of context information and with a subjective sense of reexperiencing past events. The neural correlates of episodic retrieval have been extensively studied using fMRI, leading to the identification of a "general recollection network" including medial temporal, parietal, and prefrontal regions. However, in these studies, it is difficult to disentangle the effects of context retrieval from recollection. In this study, we used fMRI to determine the extent to which the recruitment of regions in the recollection network is contingent on context reinstatement. Participants were scanned during a cued recognition test for target words from encoded sentences. Studied target words were preceded by either a cue word studied in the same sentence (thus congruent with encoding context) or a cue word studied in a different sentence (thus incongruent with encoding context). Converging fMRI results from independently defined ROIs and whole-brain analysis showed regional specificity in the recollection network. Activity in hippocampus and parahippocampal cortex was specifically increased during successful retrieval following congruent context cues, whereas parietal and prefrontal components of the general recollection network were associated with confident retrieval irrespective of contextual congruency. Our findings implicate medial temporal regions in the retrieval of semantic context, contributing to, but dissociable from, recollective experience

eScholarship - University of California

Enlighten

The Role of Consciousness in Memory

Author: Baars BJ
Franklin Stan
Ramamurthy Uma
Ventura M
Publication venue
Publication date: 01/01/2005
Field of study

Conscious events interact with memory systems in learning, rehearsal and retrieval (Ebbinghaus 1885/1964; Tulving 1985). Here we present hypotheses that arise from the IDA computional model (Franklin, Kelemen and McCauley 1998; Franklin 2001b) of global workspace theory (Baars 1988, 2002). Our primary tool for this exploration is a flexible cognitive cycle employed by the IDA computational model and hypothesized to be a basic element of human cognitive processing. Since cognitive cycles are hypothesized to occur five to ten times a second and include interaction between conscious contents and several of the memory systems, they provide the means for an exceptionally fine-grained analysis of various cognitive tasks. We apply this tool to the small effect size of subliminal learning compared to supraliminal learning, to process dissociation, to implicit learning, to recognition vs. recall, and to the availability heuristic in recall. The IDA model elucidates the role of consciousness in the updating of perceptual memory, transient episodic memory, and procedural memory. In most cases, memory is hypothesized to interact with conscious events for its normal functioning. The methodology of the paper is unusual in that the hypotheses and explanations presented are derived from an empirically based, but broad and qualitative computational model of human cognition

CogPrints Cognitive Sciences Eprint Archive

Digital Peer Publishing

Lifelong Learning of Spatiotemporal Representations with Dual-Memory Recurrent Self-Organization

Author: Parisi German I.
Tani Jun
Weber Cornelius
Wermter Stefan
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

Artificial autonomous agents and robots interacting in complex environments are required to continually acquire and fine-tune knowledge over sustained periods of time. The ability to learn from continuous streams of information is referred to as lifelong learning and represents a long-standing challenge for neural network models due to catastrophic forgetting. Computational models of lifelong learning typically alleviate catastrophic forgetting in experimental scenarios with given datasets of static images and limited complexity, thereby differing significantly from the conditions artificial agents are exposed to. In more natural settings, sequential information may become progressively available over time and access to previous experience may be restricted. In this paper, we propose a dual-memory self-organizing architecture for lifelong learning scenarios. The architecture comprises two growing recurrent networks with the complementary tasks of learning object instances (episodic memory) and categories (semantic memory). Both growing networks can expand in response to novel sensory experience: the episodic memory learns fine-grained spatiotemporal representations of object instances in an unsupervised fashion while the semantic memory uses task-relevant signals to regulate structural plasticity levels and develop more compact representations from episodic experience. For the consolidation of knowledge in the absence of external sensory input, the episodic memory periodically replays trajectories of neural reactivations. We evaluate the proposed model on the CORe50 benchmark dataset for continuous object recognition, showing that we significantly outperform current methods of lifelong learning in three different incremental learning scenario

arXiv.org e-Print Archive

OIST Institutional Repository

Directory of Open Access Journals

Frontiers - Publisher Connector