Search CORE

6,300 research outputs found

A frequency-selective feedback model of auditory efferent suppression and its implications for the recognition of speech in noise

Author: Brown G
Clark N
Juergens T
Meddis R
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2012
Field of study

The potential contribution of the peripheral auditory efferent system to our understanding of speech in a background of competing noise was studied using a computer model of the auditory periphery and assessed using an automatic speech recognition system. A previous study had shown that a fixed efferent attenuation applied to all channels of a multi-channel model could improve the recognition of connected digit triplets in noise [G. J. Brown, R. T. Ferry, and R. Meddis, J. Acoust. Soc. Am. 127, 943?954 (2010)]. In the current study an anatomically justified feedback loop was used to automatically regulate separate attenuation values for each auditory channel. This arrangement resulted in a further enhancement of speech recognition over fixed-attenuation conditions. Comparisons between multi-talker babble and pink noise interference conditions suggest that the benefit originates from the model?s ability to modify the amount of suppression in each channel separately according to the spectral shape of the interfering sounds

University of Essex Research Repository

Crossref

Articulating: the neural mechanisms of speech production

Author: Guenther Frank H.
Kearney Elaine
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2019
Field of study

Speech production is a highly complex sensorimotor task involving tightly coordinated processing across large expanses of the cerebral cortex. Historically, the study of the neural underpinnings of speech suffered from the lack of an animal model. The development of non-invasive structural and functional neuroimaging techniques in the late 20th century has dramatically improved our understanding of the speech network. Techniques for measuring regional cerebral blood flow have illuminated the neural regions involved in various aspects of speech, including feedforward and feedback control mechanisms. In parallel, we have designed, experimentally tested, and refined a neural network model detailing the neural computations performed by specific neuroanatomical regions during speech. Computer simulations of the model account for a wide range of experimental findings, including data on articulatory kinematics and brain activity during normal and perturbed speech. Furthermore, the model is being used to investigate a wide range of communication disorders.R01 DC002852 - NIDCD NIH HHS; R01 DC007683 - NIDCD NIH HHS; R01 DC016270 - NIDCD NIH HHSAccepted manuscrip

Boston University Institutional Repository (OpenBU)

Queensland University of Technology ePrints Archive

A computer model of auditory efferent suppression: Implications for the recognition of speech in noise

Author: Brown GJ
Ferry RT
Meddis R
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2010
Field of study

The neural mechanisms underlying the ability of human listeners to recognize speech in the presence of background noise are still imperfectly understood. However, there is mounting evidence that the medial olivocochlear system plays an important role, via efferents that exert a suppressive effect on the response of the basilar membrane. The current paper presents a computer modeling study that investigates the possible role of this activity on speech intelligibility in noise. A model of auditory efferent processing [ Ferry, R. T., and Meddis, R. (2007). J. Acoust. Soc. Am. 122, 3519?3526 ] is used to provide acoustic features for a statistical automatic speech recognition system, thus allowing the effects of efferent activity on speech intelligibility to be quantified. Performance of the ?basic? model (without efferent activity) on a connected digit recognition task is good when the speech is uncorrupted by noise but falls when noise is present. However, recognition performance is much improved when efferent activity is applied. Furthermore, optimal performance is obtained when the amount of efferent activity is proportional to the noise level. The results obtained are consistent with the suggestion that efferent suppression causes a ?release from adaptation? in the auditory-nerve response to noisy speech, which enhances its intelligibility

University of Essex Research Repository

CiteSeerX

Crossref

Quantitative Analysis Linking Inner Hair Cell Voltage Changes and Postsynaptic Conductance Change: A Modelling Study

Author: Drakakis EM
Prokopiou AN
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2015
Field of study

This paper presents a computational model which estimates the postsynaptic conductance change of mammalian Type I afferent peripheral process when airborne acoustic waves impact on the tympanic membrane. A model of the human auditory periphery is used to estimate the inner hair cell potential change in response to airborne sound. A generic and tunable topology of the mammalian synaptic ribbon is generated and the voltage dependence of its substructures is used to calculate discrete and probabilistic neurotransmitter vesicle release. Results suggest an almost linear relationship between increasing sound level (in dB SPL) and the postsynaptic conductance for frequencies considered too high for neurons to phase lock with (i.e., a few kHz). Furthermore coordinated vesicle release is shown for up to 300–400 Hz and a mechanism of phase shifting the subharmonic content of a stimulating signal is suggested. Model outputs suggest that strong onset response and highly synchronised multivesicular release rely on compound fusion of ribbon tethered vesicles

Directory of Open Access Journals

PubMed Central

Spiral - Imperial College Digital Repository

Systems for virtual sound imaging

Author: Fazi Filippo Maria
Kang Kyeongok
Nelson Philip Arthur
Park Munhum
Seo Jeongil
Shin Mincheol
Takeuchi Takashi
Publication venue
Publication date: 12/10/2011
Field of study

Southampton (e-Prints Soton)

The evolution of auditory contrast

Author: Boersma Paul
Hamann Silke
Publication venue
Publication date: 01/10/2009
Field of study

This paper reconciles the standpoint that language users do not aim at improving their sound systems with the observation that languages seem to improve their sound systems. Computer simulations of inventories of sibilants show that Optimality-Theoretic learners who optimize their perception grammars automatically introduce a so-called prototype effect, i.e. the phenomenon that the learner’s preferred auditory realization of a certain phonological category is more peripheral than the average auditory realization of this category in her language environment. In production, however, this prototype effect is counteracted by an articulatory effect that limits the auditory form to something that is not too difficult to pronounce. If the prototype effect and the articulatory effect are of a different size, the learner must end up with an auditorily different sound system from that of her language environment. The computer simulations show that, independently of the initial auditory sound system, a stable equilibrium is reached within a small number of generations. In this stable state, the dispersion of the sibilants of the language strikes an optimal balance between articulatory ease and auditory contrast. The important point is that this is derived within a model without any goal-oriented elements such as dispersion constraints

Hochschulschriftenserver - Universität Frankfurt am Main

A physiologically inspired model for solving the cocktail party problem.

Author: Chou Kenny F.
Colburn H. Steven
Dong Junzi
Sen Kamal
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/12/2019
Field of study

At a cocktail party, we can broadly monitor the entire acoustic scene to detect important cues (e.g., our names being called, or the fire alarm going off), or selectively listen to a target sound source (e.g., a conversation partner). It has recently been observed that individual neurons in the avian field L (analog to the mammalian auditory cortex) can display broad spatial tuning to single targets and selective tuning to a target embedded in spatially distributed sound mixtures. Here, we describe a model inspired by these experimental observations and apply it to process mixtures of human speech sentences. This processing is realized in the neural spiking domain. It converts binaural acoustic inputs into cortical spike trains using a multi-stage model composed of a cochlear filter-bank, a midbrain spatial-localization network, and a cortical network. The output spike trains of the cortical network are then converted back into an acoustic waveform, using a stimulus reconstruction technique. The intelligibility of the reconstructed output is quantified using an objective measure of speech intelligibility. We apply the algorithm to single and multi-talker speech to demonstrate that the physiologically inspired algorithm is able to achieve intelligible reconstruction of an "attended" target sentence embedded in two other non-attended masker sentences. The algorithm is also robust to masker level and displays performance trends comparable to humans. The ideas from this work may help improve the performance of hearing assistive devices (e.g., hearing aids and cochlear implants), speech-recognition technology, and computational algorithms for processing natural scenes cluttered with spatially distributed acoustic objects.R01 DC000100 - NIDCD NIH HHSPublished versio

Boston University Institutional Repository (OpenBU)

A Neural Network for Synthesizing the Pitch of an Acoustic Source

Author: Cohen Michael A.
Grossberg Stephen
Wyse Lonce
Publication venue: Boston University Center for Adaptive Systems and Department of Cognitive and Neural Systems
Publication date: 01/02/1992
Field of study

This article describes a neural network model capable of generating a spatial representation of the pitch of an acoustic source. Pitch is one of several auditory percepts used by humans to separate multiple sound sources in the environment from each other. The model provides a neural instantiation of a type of "harmonic sieve". It is capable of quantitatively simulating a large body of psychoacoustical data, including new data on octave shift perception.Air Force Office of Scientific Research (90-0128, 90-0175); Defense Advanced Research Projects Agency (90-0083); National Science Foundation (IRI 90-24877); American Society for Engineering Educatio

Boston University Institutional Repository (OpenBU)