5,954 research outputs found
Low-Shot Learning with Imprinted Weights
Human vision is able to immediately recognize novel visual categories after
seeing just one or a few training examples. We describe how to add a similar
capability to ConvNet classifiers by directly setting the final layer weights
from novel training examples during low-shot learning. We call this process
weight imprinting as it directly sets weights for a new category based on an
appropriately scaled copy of the embedding layer activations for that training
example. The imprinting process provides a valuable complement to training with
stochastic gradient descent, as it provides immediate good classification
performance and an initialization for any further fine-tuning in the future. We
show how this imprinting process is related to proxy-based embeddings. However,
it differs in that only a single imprinted weight vector is learned for each
novel category, rather than relying on a nearest-neighbor distance to training
instances as typically used with embedding methods. Our experiments show that
using averaging of imprinted weights provides better generalization than using
nearest-neighbor instance embeddings.Comment: CVPR 201
The Recommendation Architecture: Lessons from Large-Scale Electronic Systems Applied to Cognition
A fundamental approach of cognitive science is to understand cognitive systems by separating them into modules. Theoretical reasons are described which force any system which learns to perform a complex combination of real time functions into a modular architecture. Constraints on the way modules divide up functionality are also described. The architecture of such systems, including biological systems, is constrained into a form called the recommendation architecture, with a primary separation between clustering and competition. Clustering is a modular hierarchy which manages the interactions between functions on the basis of detection of functionally ambiguous repetition. Change to previously detected repetitions is limited in order to maintain a meaningful, although partially ambiguous context for all modules which make use of the previously defined repetitions. Competition interprets the repetition conditions detected by clustering as a range of alternative behavioural recommendations, and uses consequence feedback to learn to select the most appropriate recommendation. The requirements imposed by functional complexity result in very specific structures and processes which resemble those of brains. The design of an implemented electronic version of the recommendation architecture is described, and it is demonstrated that the system can heuristically define its own functionality, and learn without disrupting earlier learning. The recommendation architecture is compared with a range of alternative cognitive architectural proposals, and the conclusion reached that it has substantial potential both for understanding brains and for designing systems to perform cognitive functions
A learning rule for place fields in a cortical model: theta phase precession as a network effect
We show that a model of the hippocampus introduced recently by Scarpetta,
Zhaoping & Hertz ([2002] Neural Computation 14(10):2371-96), explains the theta
phase precession phenomena. In our model, the theta phase precession comes out
as a consequence of the associative-memory-like network dynamics, i.e. the
network's ability to imprint and recall oscillatory patterns, coded both by
phases and amplitudes of oscillation. The learning rule used to imprint the
oscillatory states is a natural generalization of that used for static patterns
in the Hopfield model, and is based on the spike time dependent synaptic
plasticity (STDP), experimentally observed. In agreement with experimental
findings, the place cell's activity appears at consistently earlier phases of
subsequent cycles of the ongoing theta rhythm during a pass through the place
field, while the oscillation amplitude of the place cell's firing rate
increases as the animal approaches the center of the place field and decreases
as the animal leaves the center. The total phase precession of the place cell
is lower than 360 degrees, in agreement with experiments. As the animal enters
a receptive field the place cell's activity comes slightly less than 180
degrees after the phase of maximal pyramidal cell population activity, in
agreement with the findings of Skaggs et al (1996). Our model predicts that the
theta phase is much better correlated with location than with time spent in the
receptive field. Finally, in agreement with the recent experimental findings of
Zugaro et al (2005), our model predicts that theta phase precession persists
after transient intra-hippocampal perturbation.Comment: 10 pages, 7 figures, to be published in Hippocampu
A Functional Architecture Approach to Neural Systems
The technology for the design of systems to perform extremely complex combinations of real-time functionality has developed over a long period. This technology is based on the use of a hardware architecture with a physical separation into memory and processing, and a software architecture which divides functionality into a disciplined hierarchy of software components which exchange unambiguous information. This technology experiences difficulty in design of systems to perform parallel processing, and extreme difficulty in design of systems which can heuristically change their own functionality. These limitations derive from the approach to information exchange between functional components. A design approach in which functional components can exchange ambiguous information leads to systems with the recommendation architecture which are less subject to these limitations. Biological brains have been constrained by natural pressures to adopt functional architectures with this different information exchange approach. Neural networks have not made a complete shift to use of ambiguous information, and do not address adequate management of context for ambiguous information exchange between modules. As a result such networks cannot be scaled to complex functionality. Simulations of systems with the recommendation architecture demonstrate the capability to heuristically organize to perform complex functionality
COVID-19 Detection from Chest X-Ray Images Using Deep Convolutional Neural Networks with Weights Imprinting Approach
COVID-19 pandemic has drastically changed our lives. Chest radiographyhas been used to detect COVID-19. However, the numberof publicly available COVID-19 x-ray images is extremely limited,resulting in a highly imbalanced dataset. This is a challenge whenusing deep learning for classification and detection. In this work, wepropose the use of pre-trained deep Convolutional Neural Networks(CNN) and integrate them with a few-shot learning approach namedimprinted weights. The integrated model is fine tuned to enhancethe capability of detecting COVID-19. The proposed solution thencombines the fine-tuned models using a weighted average ensemblefor achieving an optimal 82% sensitivity to COVID-19. To thebest of authors’ knowledge, the proposed solution is one of the firstto utilize imprinted weights model with weighted average ensemblefor enhancing the model sensitivity to COVID-19
- …