2,726 research outputs found

    Spike-Timing-Based Computation in Sound Localization

    Get PDF
    Spike timing is precise in the auditory system and it has been argued that it conveys information about auditory stimuli, in particular about the location of a sound source. However, beyond simple time differences, the way in which neurons might extract this information is unclear and the potential computational advantages are unknown. The computational difficulty of this task for an animal is to locate the source of an unexpected sound from two monaural signals that are highly dependent on the unknown source signal. In neuron models consisting of spectro-temporal filtering and spiking nonlinearity, we found that the binaural structure induced by spatialized sounds is mapped to synchrony patterns that depend on source location rather than on source signal. Location-specific synchrony patterns would then result in the activation of location-specific assemblies of postsynaptic neurons. We designed a spiking neuron model which exploited this principle to locate a variety of sound sources in a virtual acoustic environment using measured human head-related transfer functions. The model was able to accurately estimate the location of previously unknown sounds in both azimuth and elevation (including front/back discrimination) in a known acoustic environment. We found that multiple representations of different acoustic environments could coexist as sets of overlapping neural assemblies which could be associated with spatial locations by Hebbian learning. The model demonstrates the computational relevance of relative spike timing to extract spatial information about sources independently of the source signal

    An event-driven probabilistic model of sound source localization using cochlea spikes

    Full text link
    This work presents a probabilistic model that estimates the location of sound sources using the output spikes of a silicon cochlea such as the Dynamic Audio Sensor. Unlike previous work which estimated the source locations directly from the interaural time differences (ITDs) extracted from the timing of the cochlea spikes, the spikes are used instead to support a distribution model of the ITDs representing possible locations of sound sources. Results on noisy single speaker recordings show average accuracies of approximately 80% on detecting the correct source locations and an estimation lag of <;100ms

    A physiologically inspired model for solving the cocktail party problem.

    Get PDF
    At a cocktail party, we can broadly monitor the entire acoustic scene to detect important cues (e.g., our names being called, or the fire alarm going off), or selectively listen to a target sound source (e.g., a conversation partner). It has recently been observed that individual neurons in the avian field L (analog to the mammalian auditory cortex) can display broad spatial tuning to single targets and selective tuning to a target embedded in spatially distributed sound mixtures. Here, we describe a model inspired by these experimental observations and apply it to process mixtures of human speech sentences. This processing is realized in the neural spiking domain. It converts binaural acoustic inputs into cortical spike trains using a multi-stage model composed of a cochlear filter-bank, a midbrain spatial-localization network, and a cortical network. The output spike trains of the cortical network are then converted back into an acoustic waveform, using a stimulus reconstruction technique. The intelligibility of the reconstructed output is quantified using an objective measure of speech intelligibility. We apply the algorithm to single and multi-talker speech to demonstrate that the physiologically inspired algorithm is able to achieve intelligible reconstruction of an "attended" target sentence embedded in two other non-attended masker sentences. The algorithm is also robust to masker level and displays performance trends comparable to humans. The ideas from this work may help improve the performance of hearing assistive devices (e.g., hearing aids and cochlear implants), speech-recognition technology, and computational algorithms for processing natural scenes cluttered with spatially distributed acoustic objects.R01 DC000100 - NIDCD NIH HHSPublished versio

    Cross-Correlation in the Auditory Coincidence Detectors of Owls

    Get PDF
    Interaural time difference (ITD) plays a central role in many auditory functions, most importantly in sound localization. The classic model for how ITD is computed was put forth by Jeffress (1948). One of the predictions of the Jeffress model is that the neurons that compute ITD should behave as cross-correlators. Whereas cross-correlation-like properties of the ITD-computing neurons have been reported, attempts to show that the shape of the ITD response function is determined by the spectral tuning of the neuron, a core prediction of cross-correlation, have been unsuccessful. Using reverse correlation analysis, we demonstrate in the barn owl that the relationship between the spectral tuning and the ITD response of the ITD-computing neurons is that predicted by cross-correlation. Moreover, we show that a model of coincidence detector responses derived from responses to binaurally uncorrelated noise is consistent with binaural interaction based on cross-correlation. These results are thus consistent with one of the key tenets of the Jeffress model. Our work sets forth both the methodology to answer whether cross-correlation describes coincidence detector responses and a demonstration that in the barn owl, the result is that expected by theory

    Applying stochastic spike train theory for high-accuracy MEG/EEG

    Get PDF
    The accuracy of electroencephalography (EEG) and magnetoencephalography (MEG) is challenged by overlapping sources from within the brain. This lack of accuracy is a severe limitation to the possibilities and reliability of modern stimulation protocols in basic research and clinical diagnostics. As a solution, we here introduce a theory of stochastic neuronal spike timing probability densities for describing the large-scale spiking activity in neural networks, and a novel spike density component analysis (SCA) method for isolating specific neural sources. Three studies are conducted based on 564 cases of evoked responses to auditory stimuli from 94 human subjects each measured with 60 EEG electrodes and 306 MEG sensors. In the first study we show that the large-scale spike timing (but not non-encephalographic artifacts) in MEG/EEG waveforms can be modeled with Gaussian probability density functions with …Non peer reviewe

    Applying stochastic spike train theory for high-accuracy human MEG/EEG

    Get PDF
    Background The accuracy of electroencephalography (EEG) and magnetoencephalography (MEG) in measuring neural evoked responses (ERs) is challenged by overlapping neural sources. This lack of accuracy is a severe limitation to the application of ERs to clinical diagnostics. New method We here introduce a theory of stochastic neuronal spike timing probability densities for describing the large-scale spiking activity in neural assemblies, and a spike density component analysis (SCA) method for isolating specific neural sources. The method is tested in three empirical studies with 564 cases of ERs to auditory stimuli from 94 humans, each measured with 60 EEG electrodes and 306 MEG sensors, and a simulation study with 12,300 ERs. Results The first study showed that neural sources (but not non-encephalic artifacts) in individual averaged MEG/EEG waveforms are modelled accurately with temporal Gaussian probability density functions (median 99.7 %–99.9 % variance explained). The following studies confirmed that SCA can isolate an ER, namely the mismatch negativity (MMN), and that SCA reveals inter-individual variation in MMN amplitude. Finally, SCA reduced errors by suppressing interfering sources in simulated cases. Comparison with existing methods We found that gamma and sine functions fail to adequately describe individual MEG/EEG waveforms. Also, we observed that principal component analysis (PCA) and independent component analysis (ICA) does not consistently suppress interference from overlapping brain activity in neither empirical nor simulated cases. Conclusions These findings suggest that the overlapping neural sources in single-subject or patient data can be more accurately separated by applying SCA in comparison to PCA and ICA.Peer reviewe

    Design of a silicon cochlea system with biologically faithful response

    Get PDF
    This paper presents the design and simulation results of a silicon cochlea system that has closely similar behavior as the real cochlea. A cochlea filter-bank based on the improved three-stage filter cascade structure is used to model the frequency decomposition function of the basilar membrane; a filter tuning block is designed to model the adaptive response of the cochlea; besides, an asynchronous event-triggered spike codec is employed as the system interface with bank-end spiking neural networks. As shown in the simulation results, the system has biologically faithful frequency response, impulse response, and active adaptation behavior; also the system outputs multiple band-pass channels of spikes from which the original sound input can be recovered. The proposed silicon cochlea is feasible for analog VLSI implementation so that it not only emulates the way that sounds are preprocessed in human ears but also is able match the compact physical size of a real cochlea

    Multisensory Training Improves Auditory Spatial Processing following Bilateral Cochlear Implantation

    Get PDF
    Cochlear implants (CIs) partially restore hearing to the deaf by directly stimulating the inner ear. In individuals fitted with CIs, lack of auditory experience due to loss of hearing before language acquisition can adversely impact outcomes. For example, adults with early-onset hearing loss generally do not integrate inputs from both ears effectively when fitted with bilateral CIs (BiCIs). Here, we used an animal model to investigate the effects of long-term deafness on auditory localization with BiCIs and approaches for promoting the use of binaural spatial cues. Ferrets were deafened either at the age of hearing onset or as adults. All animals were implanted in adulthood, either unilaterally or bilaterally, and were subsequently assessed for their ability to localize sound in the horizontal plane. The unilaterally implanted animals were unable to perform this task, regardless of the duration of deafness. Among animals with BiCIs, early-onset hearing loss was associated with poor auditory localization performance, compared with late-onset hearing loss. However, performance in the early-deafened group with BiCIs improved significantly after multisensory training with interleaved auditory and visual stimuli. We demonstrate a possible neural substrate for this by showing a training-induced improvement in the responsiveness of auditory cortical neurons and in their sensitivity to interaural level differences, the principal localization cue available to BiCI users. Importantly, our behavioral and physiological evidence demonstrates a facilitative role for vision in restoring auditory spatial processing following potential cross-modal reorganization. These findings support investigation of a similar training paradigm in human CI users
    • …
    corecore