1,758 research outputs found

    A binaural grouping model for predicting speech intelligibility in multitalker environments

    Get PDF
    Spatially separating speech maskers from target speech often leads to a large intelligibility improvement. Modeling this phenomenon has long been of interest to binaural-hearing researchers for uncovering brain mechanisms and for improving signal-processing algorithms in hearing-assistive devices. Much of the previous binaural modeling work focused on the unmasking enabled by binaural cues at the periphery, and little quantitative modeling has been directed toward the grouping or source-separation benefits of binaural processing. In this article, we propose a binaural model that focuses on grouping, specifically on the selection of time-frequency units that are dominated by signals from the direction of the target. The proposed model uses Equalization-Cancellation (EC) processing with a binary decision rule to estimate a time-frequency binary mask. EC processing is carried out to cancel the target signal and the energy change between the EC input and output is used as a feature that reflects target dominance in each time-frequency unit. The processing in the proposed model requires little computational resources and is straightforward to implement. In combination with the Coherence-based Speech Intelligibility Index, the model is applied to predict the speech intelligibility data measured by Marrone et al. The predicted speech reception threshold matches the pattern of the measured data well, even though the predicted intelligibility improvements relative to the colocated condition are larger than some of the measured data, which may reflect the lack of internal noise in this initial version of the model.R01 DC000100 - NIDCD NIH HH

    Cortical transformation of spatial processing for solving the cocktail party problem: a computational model(1,2,3).

    Get PDF
    In multisource, "cocktail party" sound environments, human and animal auditory systems can use spatial cues to effectively separate and follow one source of sound over competing sources. While mechanisms to extract spatial cues such as interaural time differences (ITDs) are well understood in precortical areas, how such information is reused and transformed in higher cortical regions to represent segregated sound sources is not clear. We present a computational model describing a hypothesized neural network that spans spatial cue detection areas and the cortex. This network is based on recent physiological findings that cortical neurons selectively encode target stimuli in the presence of competing maskers based on source locations (Maddox et al., 2012). We demonstrate that key features of cortical responses can be generated by the model network, which exploits spatial interactions between inputs via lateral inhibition, enabling the spatial separation of target and interfering sources while allowing monitoring of a broader acoustic space when there is no competition. We present the model network along with testable experimental paradigms as a starting point for understanding the transformation and organization of spatial information from midbrain to cortex. This network is then extended to suggest engineering solutions that may be useful for hearing-assistive devices in solving the cocktail party problem.R01 DC000100 - NIDCD NIH HHSPublished versio

    A physiologically inspired model for solving the cocktail party problem.

    Get PDF
    At a cocktail party, we can broadly monitor the entire acoustic scene to detect important cues (e.g., our names being called, or the fire alarm going off), or selectively listen to a target sound source (e.g., a conversation partner). It has recently been observed that individual neurons in the avian field L (analog to the mammalian auditory cortex) can display broad spatial tuning to single targets and selective tuning to a target embedded in spatially distributed sound mixtures. Here, we describe a model inspired by these experimental observations and apply it to process mixtures of human speech sentences. This processing is realized in the neural spiking domain. It converts binaural acoustic inputs into cortical spike trains using a multi-stage model composed of a cochlear filter-bank, a midbrain spatial-localization network, and a cortical network. The output spike trains of the cortical network are then converted back into an acoustic waveform, using a stimulus reconstruction technique. The intelligibility of the reconstructed output is quantified using an objective measure of speech intelligibility. We apply the algorithm to single and multi-talker speech to demonstrate that the physiologically inspired algorithm is able to achieve intelligible reconstruction of an "attended" target sentence embedded in two other non-attended masker sentences. The algorithm is also robust to masker level and displays performance trends comparable to humans. The ideas from this work may help improve the performance of hearing assistive devices (e.g., hearing aids and cochlear implants), speech-recognition technology, and computational algorithms for processing natural scenes cluttered with spatially distributed acoustic objects.R01 DC000100 - NIDCD NIH HHSPublished versio

    Incorporating Information Literacy and Site Specific Dance

    Get PDF
    Selene and Paul explained their collaboration by first giving the history of the Dance major here at UVM and then discussing how they incorporated information literacy skills into their class Site Dance & Performance. They discussed the final projects\u27 locations, parameters, what actually transpired, and finally, the students\u27 thoughts on the whole experience

    A Modeling Study of the Responses of the Lateral Superior Olive to Ipsilateral Sinusoidally Amplitude-Modulated Tones

    Get PDF
    The lateral superior olive (LSO) is a brainstem nucleus that is classically understood to encode binaural information in high-frequency sounds. Previous studies have shown that LSO cells are sensitive to envelope interaural time difference in sinusoidally amplitude-modulated (SAM) tones (Joris and Yin, J Neurophysiol 73:1043–1062, 1995; Joris, J Neurophysiol 76:2137–2156, 1996) and that a subpopulation of LSO neurons exhibit low-threshold potassium currents mediated by Kv1 channels (Barnes-Davies et al., Eur J Neurosci 19:325–333, 2004). It has also been shown that in many LSO cells the average response rate to ipsilateral SAM tones decreases with modulation frequency above a few hundred Hertz (Joris and Yin, J Neurophysiol 79:253–269, 1998). This low-pass feature is not directly inherited from the inputs to the LSO since the response rate of these input neurons changes little with increasing modulation frequency. In the current study, an LSO cell model is developed to investigate mechanisms consistent with the responses described above, notably the emergent rate decrease with increasing frequency. The mechanisms explored included the effects of after-hyperpolarization (AHP) channels, the dynamics of low-threshold potassium channels (KLT), and the effects of background inhibition. In the model, AHP channels alone were not sufficient to induce the observed rate decrease at high modulation frequencies. The model also suggests that the background inhibition alone, possibly from the medial nucleus of the trapezoid body, can account for the small rate decrease seen in some LSO neurons, but could not explain the large rate decrease seen in other LSO neurons at high modulation frequencies. In contrast, both the small and large rate decreases were replicated when KLT channels were included in the LSO neuron model. These results support the conclusion that KLT channels may play a major role in the large rate decreases seen in some units and that background inhibition may be a contributing factor, a factor that could be adequate for small decreases

    Comparison of the mean photospheric magnetic field and the interplanetary magnetic field

    Get PDF
    Polarity comparison of solar magnetic field and interplanetary magnetic fiel

    Communications Biophysics

    Get PDF
    Contains reports on five research projects.National Institutes of Health (Grant 1 P01 GM-14940-01)Joint Services Electronics Program under Contract DA 28-043-AMC-02536(E

    Effects of Sire EPD, Dam Traits and Calf Traits on Calving Difficulty and Subsequent Reproduction of Two-Year-Old Heifers

    Get PDF
    A three-year study evaluated effects of sire birth weight EPD, heifer and calf traits on valuing difficulty and subsequent rebreeding of two-year-old cows. MARC II yearling heifers (n=550) were assigned for breeding to one of four angus sires with birth weight EPD of -2.1, -1.8, +6.3 and +5.9 lb. Of all heifer weights, only dam birth weight affected calving difficulty score. Heifers requiring caesareans had smallest pelvic areas. Calving difficulty increased as calf birth weight and external measurements increased. Low EPD sires produced calves with smaller head and foot circumferences and less dystocia. Degree of calving difficulty did not affect subsequent pregnancy rates, but did delay rebreeding conception date
    • …
    corecore