Search CORE

29,627 research outputs found

Facing Visual Tasks Based on Different Cognitive Architectures

Author: Francesco S. Beltran
Marcos Ruiz-Soler
Publication venue: 'IntechOpen'
Publication date: 01/07/2007
Field of study

IntechOpen

Crossref

Speaker Normalization Using Cortical Strip Maps: A Neural Model for Steady State vowel Categorization

Author: Ames Heather
Grossberg Stephen
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 24/11/2007
Field of study

Auditory signals of speech are speaker-dependent, but representations of language meaning are speaker-independent. The transformation from speaker-dependent to speaker-independent language representations enables speech to be learned and understood from different speakers. A neural model is presented that performs speaker normalization to generate a pitch-independent representation of speech sounds, while also preserving information about speaker identity. This speaker-invariant representation is categorized into unitized speech items, which input to sequential working memories whose distributed patterns can be categorized, or chunked, into syllable and word representations. The proposed model fits into an emerging model of auditory streaming and speech categorization. The auditory streaming and speaker normalization parts of the model both use multiple strip representations and asymmetric competitive circuits, thereby suggesting that these two circuits arose from similar neural designs. The normalized speech items are rapidly categorized and stably remembered by Adaptive Resonance Theory circuits. Simulations use synthesized steady-state vowels from the Peterson and Barney [J. Acoust. Soc. Am. 24, 175-184 (1952)] vowel database and achieve accuracy rates similar to those achieved by human listeners. These results are compared to behavioral data and other speaker normalization models.National Science Foundation (SBE-0354378); Office of Naval Research (N00014-01-1-0624

Boston University Institutional Repository (OpenBU)

A survey of visual preprocessing and shape representation techniques

Author: Olshausen Bruno A.
Publication venue
Publication date
Field of study

Many recent theories and methods proposed for visual preprocessing and shape representation are summarized. The survey brings together research from the fields of biology, psychology, computer science, electrical engineering, and most recently, neural networks. It was motivated by the need to preprocess images for a sparse distributed memory (SDM), but the techniques presented may also prove useful for applying other associative memories to visual pattern recognition. The material of this survey is divided into three sections: an overview of biological visual processing; methods of preprocessing (extracting parts of shape, texture, motion, and depth); and shape representation and recognition (form invariance, primitives and structural descriptions, and theories of attention)

NASA Technical Reports Server

Change blindness: eradication of gestalt strategies

Author: Goddard Paul
Wilson Steve
Publication venue: 'Pion Ltd'
Publication date: 01/08/2011
Field of study

Arrays of eight, texture-defined rectangles were used as stimuli in a one-shot change blindness (CB) task where there was a 50% chance that one rectangle would change orientation between two successive presentations separated by an interval. CB was eliminated by cueing the target rectangle in the first stimulus, reduced by cueing in the interval and unaffected by cueing in the second presentation. This supports the idea that a representation was formed that persisted through the interval before being 'overwritten' by the second presentation (Landman et al, 2003 Vision Research 43149–164]. Another possibility is that participants used some kind of grouping or Gestalt strategy. To test this we changed the spatial position of the rectangles in the second presentation by shifting them along imaginary spokes (by ±1 degree) emanating from the central fixation point. There was no significant difference seen in performance between this and the standard task [F(1,4)=2.565, p=0.185]. This may suggest two things: (i) Gestalt grouping is not used as a strategy in these tasks, and (ii) it gives further weight to the argument that objects may be stored and retrieved from a pre-attentional store during this task

University of Lincoln Institutional Repository

Face perception: An integrative review of the role of spatial frequencies

Author: Ruiz-Soler Marcos
Salvador Beltrán Francesc
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/05/2016
Field of study

The aim of this article is to reinterpret the results obtained from the research analyzing the role played by spatial frequencies in face perception. Two main working lines have been explored in this body of research: the critical bandwidth of spatial frequencies that allows face recognition to take place (the masking approach), and the role played by different spatial frequencies while the visual percept is being developed (the microgenetic approach). However, results obtained to date are not satisfactory in that no single explanation accounts for all the data obtained from each of the approaches. We propose that the main factor for understanding the role of spatial frequencies in face perception depends on the interaction between the demands of the task and the information in the image (the diagnostic recognition approach). Using this new framework, we review the most significant research carried out since the early 1970s to provide a reinterpretation of the data obtained

Diposit Digital de la Universitat de Barcelona

Recent Progress in Image Deblurring

Author: Tao Dacheng
Wang Ruxin
Publication venue
Publication date: 24/09/2014
Field of study

This paper comprehensively reviews the recent development of image deblurring, including non-blind/blind, spatially invariant/variant deblurring techniques. Indeed, these techniques share the same objective of inferring a latent sharp image from one or several corresponding blurry images, while the blind deblurring techniques are also required to derive an accurate blur kernel. Considering the critical role of image restoration in modern imaging systems to provide high-quality images under complex environments such as motion, undesirable lighting conditions, and imperfect system components, image deblurring has attracted growing attention in recent years. From the viewpoint of how to handle the ill-posedness which is a crucial issue in deblurring tasks, existing methods can be grouped into five categories: Bayesian inference framework, variational methods, sparse representation-based methods, homography-based modeling, and region-based methods. In spite of achieving a certain level of development, image deblurring, especially the blind case, is limited in its success by complex application conditions which make the blur kernel hard to obtain and be spatially variant. We provide a holistic understanding and deep insight into image deblurring in this review. An analysis of the empirical evidence for representative methods, practical issues, as well as a discussion of promising future directions are also presented.Comment: 53 pages, 17 figure

arXiv.org e-Print Archive

CiteSeerX

Pattern classification of valence in depression

Author: Anderson
Baucom
Belousov
Craddock
Damasio
Davatzikos
De Martino
Drevets
Ekman
First
Fu
Gerdes
Goldapple
Grimm
Hahn
Hanke
Haxby
Haynes
Ihssen
Johnston
Johnston
Keedwell
Kensinger
LaConte
Lang
Lang
Li
Linden
Linden
Marquand
Misaki
Mitchell
Mourão-Miranda
Mourão-Miranda
Mourão-Miranda
Murphy
Norman
Ochsner
Phan
Phillips
Pinel
Roseman
Russell
Schachter
Shibata
Siegle
Simmons
Sitaram
Talairach
Yoon
Yuen
Zhang
Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2013
Field of study

Copyright @ The authors, 2013. This is an open access article available under Creative Commons Licence, CC-BY-NC-ND 3.0.Neuroimaging biomarkers of depression have potential to aid diagnosis, identify individuals at risk and predict treatment response or course of illness. Nevertheless none have been identified so far, potentially because no single brain parameter captures the complexity of the pathophysiology of depression. Multi-voxel pattern analysis (MVPA) may overcome this issue as it can identify patterns of voxels that are spatially distributed across the brain. Here we present the results of an MVPA to investigate the neuronal patterns underlying passive viewing of positive, negative and neutral pictures in depressed patients. A linear support vector machine (SVM) was trained to discriminate different valence conditions based on the functional magnetic resonance imaging (fMRI) data of nine unipolar depressed patients. A similar dataset obtained in nine healthy individuals was included to conduct a group classification analysis via linear discriminant analysis (LDA). Accuracy scores of 86% or higher were obtained for each valence contrast via patterns that included limbic areas such as the amygdala and frontal areas such as the ventrolateral prefrontal cortex. The LDA identified two areas (the dorsomedial prefrontal cortex and caudate nucleus) that allowed group classification with 72.2% accuracy. Our preliminary findings suggest that MVPA can identify stable valence patterns, with more sensitivity than univariate analysis, in depressed participants and that it may be possible to discriminate between healthy and depressed individuals based on differences in the brain's response to emotional cues.This work was supported by a PhD studentship to I.H. from the National Institute for Social Care and Health Research (NISCHR) HS/10/25 and MRC grant G 1100629

Maastricht University Research Portal

Elsevier - Publisher Connector

Crossref

Online Research @ Cardiff

PubMed Central

Juelich Shared Electronic Resources

Brunel University Research Archive

Using film cutting in interface design

Author: Barnard P.
Dean M.
May J.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2003
Field of study

It has been suggested that computer interfaces could be made more usable if their designers utilized cinematography techniques, which have evolved to guide the viewer through a narrative despite frequent discontinuities in the presented scene (i.e., cuts between shots). Because of differences between the domains of film and interface design, it is not straightforward to understand how such techniques can be transferred. May and Barnard (1995) argued that a psychological model of watching film could support such a transference. This article presents an extended account of this model, which allows identification of the practice of collocation of objects of interest in the same screen position before and after a cut. To verify that filmmakers do, in fact, use such techniques successfully, eye movements were measured while participants watched the entirety of a commerciall

White Rose Research Online