5,389 research outputs found

    Transfer Effect of Speech-sound Learning on Auditory-motor Processing of Perceived Vocal Pitch Errors

    Get PDF
    Speech perception and production are intimately linked. There is evidence that speech motor learning results in changes to auditory processing of speech. Whether speech motor control benefits from perceptual learning in speech, however, remains unclear. This event-related potential study investigated whether speech-sound learning can modulate the processing of feedback errors during vocal pitch regulation. Mandarin speakers were trained to perceive five Thai lexical tones while learning to associate pictures with spoken words over 5 days. Before and after training, participants produced sustained vowel sounds while they heard their vocal pitch feedback unexpectedly perturbed. As compared to the pre-training session, the magnitude of vocal compensation significantly decreased for the control group, but remained consistent for the trained group at the post-training session. However, the trained group had smaller and faster N1 responses to pitch perturbations and exhibited enhanced P2 responses that correlated significantly with their learning performance. These findings indicate that the cortical processing of vocal pitch regulation can be shaped by learning new speech-sound associations, suggesting that perceptual learning in speech can produce transfer effects to facilitating the neural mechanisms underlying the online monitoring of auditory feedback regarding vocal production

    Effect of tones on voice onset time (VOT) in Cantonese aspirated stops

    Get PDF
    "A dissertation submitted in partial fulfillment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, June 30, 2010."Includes bibliographical references (p. 22-24).Thesis (B.Sc)--University of Hong Kong, 2010.The study investigated the possible interaction between VOT values associated with aspirated stops produced at six different lexical tones (high falling, high rising, mid level, mid-low falling, mid-low rising and mid-low level) in Cantonese. A total of 27 male Cantonese speakers were recruited and they were instructed to read phrases containing targeted CV syllables formed by the aspirated Cantonese stops (/ph/, /th/, and /kh/) and the vowel /a/ at the six tones. VOT analysis revealed that, across aspirated stops, tones in the upper tone register produced shorter VOT while those in the lower tone register had longer VOT values. In particular, mid-low rising tone showed the longest VOT than all other tones. This finding indicated an interaction between VOT and tone during Cantonese stop production is confirmed.published_or_final_versionSpeech and Hearing SciencesBachelorBachelor of Science in Speech and Hearing Science

    Mandarin speech perception in combined electric and acoustic stimulation.

    Get PDF
    For deaf individuals with residual low-frequency acoustic hearing, combined use of a cochlear implant (CI) and hearing aid (HA) typically provides better speech understanding than with either device alone. Because of coarse spectral resolution, CIs do not provide fundamental frequency (F0) information that contributes to understanding of tonal languages such as Mandarin Chinese. The HA can provide good representation of F0 and, depending on the range of aided acoustic hearing, first and second formant (F1 and F2) information. In this study, Mandarin tone, vowel, and consonant recognition in quiet and noise was measured in 12 adult Mandarin-speaking bimodal listeners with the CI-only and with the CI+HA. Tone recognition was significantly better with the CI+HA in noise, but not in quiet. Vowel recognition was significantly better with the CI+HA in quiet, but not in noise. There was no significant difference in consonant recognition between the CI-only and the CI+HA in quiet or in noise. There was a wide range in bimodal benefit, with improvements often greater than 20 percentage points in some tests and conditions. The bimodal benefit was compared to CI subjects' HA-aided pure-tone average (PTA) thresholds between 250 and 2000 Hz; subjects were divided into two groups: "better" PTA (<50 dB HL) or "poorer" PTA (>50 dB HL). The bimodal benefit differed significantly between groups only for consonant recognition. The bimodal benefit for tone recognition in quiet was significantly correlated with CI experience, suggesting that bimodal CI users learn to better combine low-frequency spectro-temporal information from acoustic hearing with temporal envelope information from electric hearing. Given the small number of subjects in this study (n = 12), further research with Chinese bimodal listeners may provide more information regarding the contribution of acoustic and electric hearing to tonal language perception

    Context effects on second-language learning of tonal contrasts.

    Full text link
    Studies of lexical tone  learning generally focus on monosyllabic contexts, while reports of phonetic learning benefits associated with input variability are based largely on experienced learners. This study trained inexperienced learners on Mandarin tonal contrasts to test two hypotheses regarding the influence of context and variability on tone  learning. The first hypothesis was that increased phonetic variability of tones in disyllabic contexts makes initial tone  learning more challenging in disyllabic than monosyllabic words. The second hypothesis was that the learnability of a given tone varies across contexts due to differences in tonal variability. Results of a word learning experiment supported both hypotheses: tones were acquired less successfully in disyllables than in monosyllables, and the relative difficulty of disyllables was closely related to contextual tonal variability. These results indicate limited relevance of monosyllable-based data on Mandarin learning for the disyllabic majority of the Mandarin lexicon. Furthermore, in the short term, variability can diminish learning; its effects are not necessarily beneficial but dependent on acquisition stage and other learner characteristics. These findings thus highlight the importance of considering contextual variability and the interaction between variability and type of learner in the design, interpretation, and application of research on phonetic learning

    Lexical Tone vs. F0 Effects on VOT in Cantonese

    Get PDF
    This study investigates the effects of lexical tone on the Voice Onset Time (VOT) of prevocalic stops in Cantonese. It builds on literature showing how various factors affect VOT without a resultant loss in phonological contrast and also ties in a body of research on tone and consonant interaction. The specific research questions addressed are (1) Does tone have an effect on VOT in Cantonese?; (2) If so, what kind of an effect does it have?; and (3) Is this effect purely an automatic articulatory consequence of F0 modulation or is this effect also mediated by lexical tone and hence a secondary cue that contributes towards maintaining phonological contrasts between different tonal categories? To address these questions, the speech of 6 native speakers (5 male and 1 female) of Hong Kong Cantonese was examined. A total of 80 tokens of words contrasting in tone and aspiration were analyzed for each speaker for a grand total of 480 tokens. Results from an ANOVA test showed that there is a statistically significant effect (

    The listening talker: A review of human and algorithmic context-induced modifications of speech

    Get PDF
    International audienceSpeech output technology is finding widespread application, including in scenarios where intelligibility might be compromised - at least for some listeners - by adverse conditions. Unlike most current algorithms, talkers continually adapt their speech patterns as a response to the immediate context of spoken communication, where the type of interlocutor and the environment are the dominant situational factors influencing speech production. Observations of talker behaviour can motivate the design of more robust speech output algorithms. Starting with a listener-oriented categorisation of possible goals for speech modification, this review article summarises the extensive set of behavioural findings related to human speech modification, identifies which factors appear to be beneficial, and goes on to examine previous computational attempts to improve intelligibility in noise. The review concludes by tabulating 46 speech modifications, many of which have yet to be perceptually or algorithmically evaluated. Consequently, the review provides a roadmap for future work in improving the robustness of speech output

    The Phonetics of VOT and Tone Interaction in Cantonese

    Get PDF
    This study investigates the possible effects of lexical tone on Voice Onset Time (VOT) in Cantonese, a tonal language with a two-way contrast between short-lag (voiceless unaspirated) and long-lag (voiceless aspirated) stops. VOT was measured as the time interval between the stop burst and the onset of voicing for the following vowel. The recorded speech of 6 native speakers each producing 10 repetitions of 20 different words contrasting in aspiration and tone was analyzed. Tokens from each individual subject were divided into two sets for the purpose of comparison. The first set involved a comparison between the effects of a high-level 55 tone and a mid-level 33 tone. Results showed no significant VOT differences unless aspirated and unaspirated stops were examined separately. In this case, only the aspirated stops showed a significant difference with the 33 tone associated with higher VOT. The second set of stimuli compared the effects of 4 different phonemic tone categories (55, 25, 33, and 21) on VOT. Results show that words beginning with a lower tonal onset (and thus the 25 and 21 tones) correlated with higher VOT than words beginning with a higher tonal onset (the 55 and 33 tones)

    On the mechanism of response latencies in auditory nerve fibers

    Get PDF
    Despite the structural differences of the middle and inner ears, the latency pattern in auditory nerve fibers to an identical sound has been found similar across numerous species. Studies have shown the similarity in remarkable species with distinct cochleae or even without a basilar membrane. This stimulus-, neuron-, and species- independent similarity of latency cannot be simply explained by the concept of cochlear traveling waves that is generally accepted as the main cause of the neural latency pattern. An original concept of Fourier pattern is defined, intended to characterize a feature of temporal processing—specifically phase encoding—that is not readily apparent in more conventional analyses. The pattern is created by marking the first amplitude maximum for each sinusoid component of the stimulus, to encode phase information. The hypothesis is that the hearing organ serves as a running analyzer whose output reflects synchronization of auditory neural activity consistent with the Fourier pattern. A combined research of experimental, correlational and meta-analysis approaches is used to test the hypothesis. Manipulations included phase encoding and stimuli to test their effects on the predicted latency pattern. Animal studies in the literature using the same stimulus were then compared to determine the degree of relationship. The results show that each marking accounts for a large percentage of a corresponding peak latency in the peristimulus-time histogram. For each of the stimuli considered, the latency predicted by the Fourier pattern is highly correlated with the observed latency in the auditory nerve fiber of representative species. The results suggest that the hearing organ analyzes not only amplitude spectrum but also phase information in Fourier analysis, to distribute the specific spikes among auditory nerve fibers and within a single unit. This phase-encoding mechanism in Fourier analysis is proposed to be the common mechanism that, in the face of species differences in peripheral auditory hardware, accounts for the considerable similarities across species in their latency-by-frequency functions, in turn assuring optimal phase encoding across species. Also, the mechanism has the potential to improve phase encoding of cochlear implants

    Tone production using inspiratory phonation by Cantonese speakers

    Get PDF
    Also available in print.Thesis (B.Sc)--University of Hong Kong, 2008.A dissertation submitted in partial fulfilment of the requirements for the Bachelor of Science (Speech and Hearing Sciences), The University of Hong Kong, June 30, 2008.Includes bibliographical references (p. 27-29).published_or_final_versionSpeech and Hearing SciencesBachelorBachelor of Science in Speech and Hearing Science
    corecore