428 research outputs found

    Can training improve speech intelligibility and voice recognition?

    Get PDF
    We found that people learn voices very rapidly. We are able to recognize a new voice (and distinguish it from other voices) accurately after as little as 10 minutes of training. While recognition of a voice seems to plateau quite quickly (our recognition doesn\u27t improve with more training), intelligibility does keep improving as training continues up to one hour. We think the benefits of voice familiarity (such as improved intelligibility in everyday settings, helping people with hearing loss or jobs in noisy environments) can be achieved through deliberate training.https://ir.lib.uwo.ca/brainscanresearchsummaries/1007/thumbnail.jp

    Speech-evoked brain activity is more robust to competing speech when it is spoken by someone familiar: Speech representations for familiar voices

    Get PDF
    When speech is masked by competing sound, people are better at understanding what is said if the talker is familiar compared to unfamiliar. The benefit is robust, but how does processing of familiar voices facilitate intelligibility? We combined high-resolution fMRI with representational similarity analysis to quantify the difference in distributed activity between clear and masked speech. We demonstrate that brain representations of spoken sentences are less affected by a competing sentence when they are spoken by a friend or partner than by someone unfamiliar—effectively, showing a cortical signal-to-noise ratio (SNR) enhancement for familiar voices. This effect correlated with the familiar-voice intelligibility benefit. We functionally parcellated auditory cortex, and found that the most prominent familiar-voice advantage was manifest along the posterior superior and middle temporal gyri. Overall, our results demonstrate that experience-driven improvements in intelligibility are associated with enhanced multivariate pattern activity in posterior temporal cortex

    Intelligibility benefit for familiar voices is not accompanied by better discrimination of fundamental frequency or vocal tract length

    Get PDF
    Speech is more intelligible when it is spoken by familiar than unfamiliar people. If this benefit arises because key voice characteristics like perceptual correlates of fundamental frequency or vocal tract length (VTL) are more accurately represented for familiar voices, listeners may be able to discriminate smaller manipulations to such characteristics for familiar than unfamiliar voices. We measured participants’ (N = 17) thresholds for discriminating pitch (correlate of fundamental frequency, or glottal pulse rate) and formant spacing (correlate of VTL; ‘VTL-timbre’) for voices that were familiar (participants’ friends) and unfamiliar (other participants’ friends). As expected, familiar voices were more intelligible. However, discrimination thresholds were no smaller for the same familiar voices. The size of the intelligibility benefit for a familiar over an unfamiliar voice did not relate to the difference in discrimination thresholds for the same voices. Also, the familiar-voice intelligibility benefit was just as large following perceptible manipulations to pitch and VTL-timbre. These results are more consistent with cognitive accounts of speech perception than traditional accounts that predict better discrimination

    Intelligibility benefit for familiar voices does not depend on better discrimination of fundamental frequency or vocal tract length

    Get PDF
    Speech is more intelligible when it is spoken by familiar than unfamiliar people. Two cues to voice identity are glottal pulse rate (GPR) and vocal tract length (VTL): perhaps these features are more accurately represented for familiar voices in a listener’s brain. If so, listeners should be able to discriminate smaller manipulations to perceptual correlates of these vocal parameters for familiar than unfamiliar voices. We recruited pairs of friends who had known each other for 0.5–22.5 years. We measured thresholds for discriminating pitch (correlate of GPR) and formant spacing (correlate of VTL; ‘VTL-timbre’) for voices that were familiar (friends) and unfamiliar (friends of other participants). When a competing talker was present, speech was substantially more intelligible when it was spoken in a familiar voice. Discrimination thresholds were not systematically smaller for familiar compared to unfamiliar talkers. Although, participants detected smaller deviations to VTL-timbre than pitch uniquely for familiar talkers, suggesting a different balance of characteristics contribute to discrimination of familiar and unfamiliar voices. Across participants, we found no relationship between the size of the intelligibility benefit for a familiar over an unfamiliar voice and the difference in discrimination thresholds for the same voices. Also, the intelligibility benefit was not affected by the acoustic manipulations we imposed on voices to assess discrimination thresholds. Overall, these results provide no evidence that two important cues to voice identity—pitch and VTL-timbre—are more accurately represented when voices are familiar, or are necessarily responsible for the large intelligibility benefit derived from familiar voices

    Speech-evoked brain activity is more robust to competing speech when it is spoken by someone familiar

    Get PDF
    The representation of spoken-sentence information in specific regions of the brain is more resistant to interference by competing speech if the target talker is familiar. The posterior temporal cortex represents information about target speech more robustly in the presence of competing speech when the target talker is a friend or partner. We have also shown that the relative robustness of the representations for a familiar, compared to an unfamiliar, voice aligns with the intelligibility benefit that the listener gains from that familiar voice.https://ir.lib.uwo.ca/brainscanresearchsummaries/1006/thumbnail.jp

    Speech-evoked brain activity is more robust to competing speech when it is spoken by someone familiar

    Get PDF
    When speech is masked by competing sound, people are better at understanding what is said if the talker is familiar compared to unfamiliar. The benefit is robust, but how does processing of familiar voices facilitate intelligibility? We combined high-resolution fMRI with representational similarity analysis to quantify the difference in distributed activity between clear and masked speech. We demonstrate that brain representations of spoken sentences are less affected by a competing sentence when they are spoken by a friend or partner than by someone unfamiliar—effectively, showing a cortical signal-to-noise ratio (SNR) enhancement for familiar voices. This effect correlated with the familiar-voice intelligibility benefit. We functionally parcellated auditory cortex, and found that the most prominent familiar-voice advantage was manifest along the posterior superior and middle temporal gyri. Overall, our results demonstrate that experience-driven improvements in intelligibility are associated with enhanced multivariate pattern activity in posterior temporal cortex

    Effects of a consistent target or masker voice on target speech intelligibility in two- and three-talker mixtures.

    Get PDF
    When the spatial location or identity of a sound is held constant, it is not masked as effectively by competing sounds. This suggests that experience with a particular voice over time might facilitate perceptual organization in multitalker environments. The current study examines whether listeners benefit from experience with a voice only when it is the target, or also when it is a masker, using diotic presentation and a closed-set task (coordinate response measure). A reliable interaction was observed such that, in two-talker mixtures, consistency of masker or target voice over 3-7 trials significantly benefited target recognition performance, whereas in three-talker mixtures, target, but not masker, consistency was beneficial. Overall, this work suggests that voice consistency improves intelligibility, although somewhat differently when two talkers, compared to three talkers, are present, suggesting that consistent-voice information facilitates intelligibility in at least two different ways. Listeners can use a template-matching strategy to extract a known voice from a mixture when it is the target. However, consistent-voice information facilitates segregation only when two, but not three, talkers are present
    • 

    corecore