1,894 research outputs found
Cognitively-Inspired Model for Incremental Learning Using a Few Examples
Incremental learning attempts to develop a classifier which learns
continuously from a stream of data segregated into different classes. Deep
learning approaches suffer from catastrophic forgetting when learning classes
incrementally, while most incremental learning approaches require a large
amount of training data per class. We examine the problem of incremental
learning using only a few training examples, referred to as Few-Shot
Incremental Learning (FSIL). To solve this problem, we propose a novel approach
inspired by the concept learning model of the hippocampus and the neocortex
that represents each image class as centroids and does not suffer from
catastrophic forgetting. We evaluate our approach on three class-incremental
learning benchmarks: Caltech-101, CUBS-200-2011 and CIFAR-100 for incremental
and few-shot incremental learning and show that our approach achieves
state-of-the-art results in terms of classification accuracy over all learned
classes.Comment: Added link to the code in the pape
Improved Coreference Resolution Using Cognitive Insights
Coreference resolution is the task of extracting referential expressions, or mentions, in text and clustering these by the entity or concept they refer to. The sustained research interest in the task reflects the richness of reference expression usage in natural language and the difficulty in encoding insights from linguistic and cognitive theories effectively. In this thesis, we design and implement LIMERIC, a state-of-the-art coreference resolution engine. LIMERIC naturally incorporates both non-local decoding and entity-level modelling to achieve the highly competitive benchmark performance of 64.22% and 59.99% on the CoNLL-2012 benchmark with a simple model and a baseline feature set. As well as strong performance, a key contribution of this work is a reconceptualisation of the coreference task. We draw an analogy between shift-reduce parsing and coreference resolution to develop an algorithm which naturally mimics cognitive models of human discourse processing. In our feature development work, we leverage insights from cognitive theories to improve our modelling. Each contribution achieves statistically significant improvements and sum to gains of 1.65% and 1.66% on the CoNLL-2012 benchmark, yielding performance values of 65.76% and 61.27%. For each novel feature we propose, we contribute an accompanying analysis so as to better understand how cognitive theories apply to real language data. LIMERIC is at once a platform for exploring cognitive insights into coreference and a viable alternative to current systems. We are excited by the promise of incorporating our and further cognitive insights into more complex frameworks since this has the potential to both improve the performance of computational models, as well as our understanding of the mechanisms underpinning human reference resolution
Recommended from our members
Spatiotemporal EEG Dynamics of Prospective Memory in Ageing and Mild Cognitive Impairment
Prospective memory (PM, the memory of future intentions) is one of the first complaints of those that develop dementia-related disease. Little is known about the neurophysiology of PM in ageing and those with mild cognitive impairment (MCI). By using a novel artificial neural network to investigate the spatial and temporal features of PM related brain activity, new insights can be uncovered. Young adults (nâ=â30), healthy older adults (nâ=â39) and older adults with MCI (nâ=â27) completed a working memory and two PM (perceptual, conceptual) tasks. Time-locked electroencephalographic potentials (ERPs) from 128-electrodes were analysed using a brain-inspired spiking neural network (SNN) architecture. Local and global connectivity from the SNNs was then evaluated. SNNs outperformed other machine learning methods in classification of brain activity between younger, older and older adults with MCI. SNNs trained using PM related brain activity had better classification accuracy than working memory related brain activity. In general, younger adults exhibited greater local cluster connectivity compared to both older adult groups. Older adults with MCI demonstrated decreased global connectivity in response to working memory and perceptual PM tasks but increased connectivity in the conceptual PM models relative to younger and healthy older adults. SNNs can provide a useful method for differentiating between those with and without MCI. Using brain activity related to PM in combination with SNNs may provide a sensitive biomarker for detecting cognitive decline. Cognitively demanding tasks may increase the amount connectivity in older adults with MCI as a means of compensation
The Paradigm Discovery Problem
This work treats the paradigm discovery problem (PDP), the task of learning
an inflectional morphological system from unannotated sentences. We formalize
the PDP and develop evaluation metrics for judging systems. Using currently
available resources, we construct datasets for the task. We also devise a
heuristic benchmark for the PDP and report empirical results on five diverse
languages. Our benchmark system first makes use of word embeddings and string
similarity to cluster forms by cell and by paradigm. Then, we bootstrap a
neural transducer on top of the clustered data to predict words to realize the
empty paradigm slots. An error analysis of our system suggests clustering by
cell across different inflection classes is the most pressing challenge for
future work. Our code and data are available for public use.Comment: Forthcoming at ACL 202
Can humain association norm evaluate latent semantic analysis?
This paper presents the comparison of word association norm created by a psycholinguistic experiment to association lists generated by algorithms operating on text corpora. We compare lists generated by Church and Hanks algorithm and lists generated by LSA algorithm. An argument is presented on how those automatically generated lists reflect real semantic relations
A probabilistic approach for cluster based polyrepresentative information retrieval
A thesis submitted to the University of Bedfordshire in
partial ful lment of the requirements for the degree of
Doctor of PhilosophyDocument clustering in information retrieval (IR) is considered an alternative to rank-based retrieval approaches, because of its potential to support user interactions
beyond just typing in queries. Similarly, the Principle of Polyrepresentation (multi-evidence: combining multiple cognitively and/or functionally diff erent information need or information object representations for improving
an IR system's performance) is an established approach in cognitive IR with plausible applicability in the domain of information seeking and retrieval. The combination of these two approaches can assimilate their respective individual
strengths in order to further improve the performance of IR systems.
The main goal of this study is to combine cognitive and cluster-based IR approaches for improving the eff ectiveness of (interactive) information retrieval systems. In order to achieve this goal, polyrepresentative information retrieval
strategies for cluster browsing and retrieval have been designed, focusing on the evaluation aspect of such strategies.
This thesis addresses the challenge of designing and evaluating an Optimum Clustering Framework (OCF) based model, implementing probabilistic document clustering for interactive IR. Thus, polyrepresentative cluster browsing
strategies have been devised. With these strategies a simulated user based method has been adopted for evaluating the polyrepresentative cluster browsing
and searching strategies.
The proposed approaches are evaluated for information need based polyrepresentative clustering as well as document based polyrepresentation and the combination thereof. For document-based polyrepresentation, the notion of citation
context is exploited, which has special applications in scientometrics and bibliometrics for science literature modelling. The information need polyrepresentation,
on the other hand, utilizes the various aspects of user information need, which is crucial for enhancing the retrieval performance.
Besides describing a probabilistic framework for polyrepresentative document clustering, one of the main fi ndings of this work is that the proposed combination
of the Principle of Polyrepresentation with document clustering has the potential of enhancing the user interactions with an IR system, provided that the various representations of information need and information objects are utilized.
The thesis also explores interactive IR approaches in the context of polyrepresentative interactive information retrieval when it is combined with document clustering methods. Experiments suggest there is a potential in the proposed
cluster-based polyrepresentation approach, since statistically signifi cant improvements were found when comparing the approach to a BM25-based baseline in an ideal scenario. Further marginal improvements were observed when cluster-based re-ranking and cluster-ranking based comparisons were made.
The performance of the approach depends on the underlying information object and information need representations used, which confi rms fi ndings of previous studies where the Principle of Polyrepresentation was applied in diff erent ways
Grounding semantic cognition using computational modelling and network analysis
The overarching objective of this thesis is to further the field of grounded semantics using a range of computational and empirical studies. Over the past thirty years, there have been many algorithmic advances in the
modelling of semantic cognition. A commonality across these cognitive models is a reliance on hand-engineering âtoy-modelsâ. Despite incorporating newer
techniques (e.g. Long short-term memory), the model inputs remain unchanged. We argue that the inputs to these traditional semantic models have little resemblance with real human experiences. In this dissertation, we ground our neural network models by training them with real-world visual scenes using naturalistic photographs. Our approach is an alternative to both hand-coded
features and embodied raw sensorimotor signals.
We conceptually replicate the mutually reinforcing nature of hybrid (feature-based and grounded) representations using silhouettes of concrete concepts as model inputs. We next gradually develop a novel grounded cognitive semantic representation which we call scene2vec, starting with object co-occurrences and then adding emotions and language-based tags. Limitations of our scene-based representation are identified for more abstract concepts (e.g. freedom). We further present a large-scale human semantics study, which reveals small-world semantic network topologies are context-dependent and
that scenes are the most dominant cognitive dimension. This finding leads us to conclude that there is no meaning without context. Lastly, scene2vec shows
promising human-like context-sensitive stereotypes (e.g. gender role bias), and we explore how such stereotypes are reduced by targeted debiasing. In conclusion, this thesis provides support for a novel computational
viewpoint on investigating meaning - scene-based grounded semantics. Future research scaling scene-based semantic models to human-levels through virtual grounding has the potential to unearth new insights into the human mind and
concurrently lead to advancements in artificial general intelligence by enabling robots, embodied or otherwise, to acquire and represent meaning directly from the environment
- âŚ