Search CORE

410 research outputs found

Communications Biophysics

Author: Braida Louis D.
Brown M. Christian
Bustamante Diane K.
Colburn H. Steven
Delgutte Bertrand
Delhorne Lorraine A.
DeRosier David J.
Downs Maralene M.
Durlach Nathaniel I.
Eatock Ruth Anne
Eddington Donald K.
Freeman Dennis M.
Frishkopf Lawrence S.
Gabriel Kaigham J.
Gifford Margaret L.
Girzon Gary
Goldberg Rina
Guinan John J., Jr.
Ito Yoshiko
Jain Manoj
Ketten Darlene R.
Kiang Nelson Y-S.
Kobler James B.
Koehnke Janet A.
Leotta Daniel F.
Macmillan Neil A.
McConnell Michael V.
McCue Michael P.
Oman Charles M.
Pang Xiao-Dong
Peake William T.
Pemberton Joseph C.
Peterson Patrick M.
Posen Miles P.
Rabinowitz William M.
Reed Charlotte M.
Rose Christopher
Rosowski John J.
Silletto Karen
Thayer Robert C.
Thomson Scott
Tsuk Michael J.
Uchanski Rosalie M.
Weiss Thomas F.
Zue Victor W.
Zurek Patrick M.
Publication venue: Research Laboratory of Electronics (RLE) at the Massachusetts Institute of Technology (MIT)
Publication date: 01/01/1985
Field of study

Contains research objectives and reports on eight research projects split into three sections.National Institutes of Health (Grant 2 PO1 NS13126)National Institutes of Health (Grant 5 RO1 NS18682)National Institutes of Health (Grant 5 RO1 NS20322)National Institutes of Health (Grant 1 RO1 NS 20269)National Institutes of Health (Grant 5 T32 NS 07047)Symbion, Inc.National Institutes of Health (Grant 5 R01 NS10916)National Institutes of Health (Grant 1 RO NS 16917)National Science Foundation (Grant BNS83-19874)National Science Foundation (Grant BNS83-19887)National Institutes of Health (Grant 5 RO1 NS12846)National Institutes of Health (Grant 1 RO1 NS21322-01)National Institutes of Health (Grant 5 T32-NS07099-07)National Institutes of Health (Grant 1 RO1 NS14092-06)National Science Foundation (Grant BNS77-21751)National Institutes of Health (Grant 5 RO1 NS11080

DSpace@MIT

The cortical processing of speech sounds in the temporal lobe

Author: Chang E.
Sjerps M.
Publication venue
Publication date: 01/10/2019
Field of study

MPG.PuRe

Neural network modeling of categorical perception

Author: Damper R. I.
Harnad S. R.
Publication venue
Publication date: 01/01/2000
Field of study

Southampton (e-Prints Soton)

Dispersion of axially symmetric waves in fluid-filled cylindrical shells

Author: Ahyi A. C.
Bao X.L.
Bjørnø Irina
Jensen Leif Bjørnø
Raju P. K.
Überall H.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2000
Field of study

Crossref

Online Research Database In Technology

Representation of speech in the primary auditory cortex and its implications for robust speech processing

Author: Mesgarani Nima
Publication venue
Publication date: 05/08/2008
Field of study

Speech has evolved as a primary form of communication between humans. This most used means of communication has been the subject of intense study for years, but there is still a lot that we do not know about it. It is an oft repeated fact, that even the performance of the best speech processing algorithms still lags far behind that of the average human, It seems inescapable that unless we know more about the way the brain performs this task, our machines can not go much further. This thesis focuses on the question of speech representation in the brain, both from a physiological and technological perspective. We explore the representation of speech through the encoding of its smallest elements - phonemic features - in the primary auditory cortex. We report on how population of neurons with diverse tuning properties respond discriminately to phonemes resulting in explicit encoding of their parameters. Next, we show that this sparse encoding of the phonemic features is a simple consequence of the linear spectro-temporal properties of the auditory cortical neurons and that a Spectro-Temporal receptive field model can predict similar patterns of activation. This is an important step toward the realization of systems that operate based on the same principles as the cortex. Using an inverse method of reconstruction, we shall also explore the extent to which phonemic features are preserved in the cortical representation of noisy speech. The results suggest that the cortical responses are more robust to noise and that the important features of phonemes are preserved in the cortical representation even in noise. Finally, we explain how a model of this cortical representation can be used for speech processing and enhancement applications to improve their robustness and performance

Digital Repository at the University of Maryland

Computational Models of Representation and Plasticity in the Central Auditory System

Author: Carlin Michael Anthony
Publication venue: Johns Hopkins University
Publication date
Field of study

The performance for automated speech processing tasks like speech recognition and speech activity detection rapidly degrades in challenging acoustic conditions. It is therefore necessary to engineer systems that extract meaningful information from sound while exhibiting invariance to background noise, different speakers, and other disruptive channel conditions. In this thesis, we take a biomimetic approach to these problems, and explore computational strategies used by the central auditory system that underlie neural information extraction from sound. In the first part of this thesis, we explore coding strategies employed by the central auditory system that yield neural responses that exhibit desirable noise robustness. We specifically demonstrate that a coding strategy based on sustained neural firings yields richly structured spectro-temporal receptive fields (STRFs) that reflect the structure and diversity of natural sounds. The emergent receptive fields are comparable to known physiological neuronal properties and can be employed as a signal processing strategy to improve noise invariance in a speech recognition task. Next, we extend the model of sound encoding based on spectro-temporal receptive fields to incorporate the cognitive effects of selective attention. We propose a framework for modeling attention-driven plasticity that induces changes to receptive fields driven by task demands. We define a discriminative cost function whose optimization and solution reflect a biologically plausible strategy for STRF adaptation that helps listeners better attend to target sounds. Importantly, the adaptation patterns predicted by the framework have a close correspondence with known neurophysiological data. We next generalize the framework to act on the spectro-temporal dynamics of task-relevant stimuli, and make predictions for tasks that have yet to be experimentally measured. We argue that our generalization represents a form of object-based attention, which helps shed light on the current debate about auditory attentional mechanisms. Finally, we show how attention-modulated STRFs form a high-fidelity representation of the attended target, and we apply our results to obtain improvements in a speech activity detection task. Overall, the results of this thesis improve our general understanding of central auditory processing, and our computational frameworks can be used to guide further studies in animal models. Furthermore, our models inspire signal processing strategies that are useful for automated speech and sound processing tasks

JScholarship

Determination and evaluation of clinically efficient stopping criteria for the multiple auditory steady-state response technique

Author: D'Haenens Wendy
Dhooge Ingeborg
Vinck Bart
Publication venue
Publication date: 01/01/2009
Field of study

Background: Although the auditory steady-state response (ASSR) technique utilizes objective statistical detection algorithms to estimate behavioural hearing thresholds, the audiologist still has to decide when to terminate ASSR recordings introducing once more a certain degree of subjectivity. Aims: The present study aimed at establishing clinically efficient stopping criteria for a multiple 80-Hz ASSR system. Methods: In Experiment 1, data of 31 normal hearing subjects were analyzed off-line to propose stopping rules. Consequently, ASSR recordings will be stopped when (1) all 8 responses reach significance and significance can be maintained for 8 consecutive sweeps; (2) the mean noise levels were ≤ 4 nV (if at this “≤ 4-nV” criterion, p-values were between 0.05 and 0.1, measurements were extended only once by 8 sweeps); and (3) a maximum amount of 48 sweeps was attained. In Experiment 2, these stopping criteria were applied on 10 normal hearing and 10 hearing-impaired adults to asses the efficiency. Results: The application of these stopping rules resulted in ASSR threshold values that were comparable to other multiple-ASSR research with normal hearing and hearing-impaired adults. Furthermore, in 80% of the cases, ASSR thresholds could be obtained within a time-frame of 1 hour. Investigating the significant response-amplitudes of the hearing-impaired adults through cumulative curves indicated that probably a higher noise-stop criterion than “≤ 4 nV” can be used. Conclusions: The proposed stopping rules can be used in adults to determine accurate ASSR thresholds within an acceptable time-frame of about 1 hour. However, additional research with infants and adults with varying degrees and configurations of hearing loss is needed to optimize these criteria

Ghent University Academic Bibliography