710 research outputs found

    The relation between acoustic and articulatory variation in vowels : data from American and Australian English

    Get PDF
    In studies of dialect variation, the articulatory nature of vowels is sometimes inferred from formant values using the following heuristic: F1 is inversely correlated with tongue height and F2 is inversely correlated with tongue backness. This study compared vowel formants and corresponding lingual articulation in two dialects of English, standard North American English and Australian English. Five speakers of North American English and four speakers of Australian English were recorded producing multiple repetitions of ten monophthongs embedded in the /sVd/ context. Simultaneous articulatory data were collected using electromagnetic articulography. Results show that there are significant correlations between tongue position and formants in the direction predicted by the heuristic but also that the relations implied by the heuristic break down under specific conditions. Articulatory vowel spaces, based on tongue dorsum (TD) position, and acoustic vowel spaces, based on formants, show systematic misalignment due in part to the influence of other articulatory factors, including lip rounding and tongue curvature on formant values. Incorporating these dimensions into our dialect comparison yields a richer description and a more robust understanding of how vowel formant patterns are reproduced within and across dialects

    Vowel reduction and loss: challenges and perspectives

    Get PDF
    This introduction gives an overview of a workshop on vowel reduction and loss held at SLE 2017 and the resulting papers collected here. It also discusses the present state of research on vowel reduction and loss in a number of perspectives and outlines the main themes dealt with throughout the course of this special issue

    Dynamic Formant Trajectories in German Read Speech : Impact of Predictability and Prominence

    Get PDF
    Phonetic structures expand temporally and spectrally when they are difficult to predict from their context. To some extent, effects of predictability are modulated by prosodic structure. So far, studies on the impact of contextual predictability and prosody on phonetic structures have neglected the dynamic nature of the speech signal. This study investigates the impact of predictability and prominence on the dynamic structure of the first and second formants of German vowels. We expect to find differences in the formant movements between vowels standing in different predictability contexts and a modulation of this effect by prominence. First and second formant values are extracted from a large German corpus. Formant trajectories of peripheral vowels are modeled using generalized additive mixed models, which estimate nonlinear regressions between a dependent variable and predictors. Contextual predictability is measured as biphone and triphone surprisal based on a statistical German language model. We test for the effects of the information-theoretic measures surprisal and word frequency, as well as prominence, on formant movement, while controlling for vowel phonemes and duration. Primary lexical stress and vowel phonemes are significant predictors of first and second formant trajectory shape. We replicate previous findings that vowels are more dispersed in stressed syllables than in unstressed syllables. The interaction of stress and surprisal explains formant movement: unstressed vowels show more variability in their formant trajectory shape at different surprisal levels than stressed vowels. This work shows that effects of contextual predictability on fine phonetic detail can be observed not only in pointwise measures but also in dynamic features of phonetic segments

    The effect of coarticulatory resistance and aerodynamic requirements of consonants on syllable organization in Polish

    Get PDF

    Modelling the effects of speech rate variation for automatic speech recognition

    Get PDF
    Wrede B. Modelling the effects of speech rate variation for automatic speech recognition. Bielefeld (Germany): Bielefeld University; 2002.In automatic speech recognition it is a widely observed phenomenon that variations in speech rate cause severe degradations of the speech recognition performance. This is due to the fact that standard stochastic based speech recognition systems specialise on average speech rate. Although many approaches to modelling speech rate variation have been made, an integrated approach in a substantial system still has be to developed. General approaches to rate modelling are based on rate dependent models which are trained with rate specific subsets of the training data. During decoding a signal based rate estimation is performed according to which the set of rate dependent models is selected. While such approaches are able to reduce the word error rate significantly, they suffer from shortcomings such as the reduction of training data and the expensive training and decoding procedure. However, phonetic investigations show that there is a systematic relationship between speech rate and the acoustic characteristics of speech. In fast speech a tendency of reduction can be observed which can be described in more detail as a centralisation effect and an increase in coarticulation. Centralisation means that the formant frequencies of vowels tend to shift towards the vowel space center while increased coarticulation denotes the tendency of the spectral features of a vowel to shift towards those of its phonemic neighbour. The goal of this work is to investigate the possibility to incorporate the knowledge of the systematic nature of the influence of speech rate variation on the acoustic features in speech rate modelling. In an acoustic-phonetic analysis of a large corpus of spontaneous speech it was shown that an increased degree of the two effects of centralisation and coarticulation can be found in fast speech. Several measures for these effects were developed and used in speech recognition experiments with rate dependent models. A thorough investigation of rate dependent models showed that with duration and coarticulation based measures significant increases of the performance could be achieved. It was shown that by the use of different measures the models were adapted either to centralisation or coarticulation. Further experiments showed that by a more detailed modelling with more rate classes a further improvement can be achieved. It was also observed that a general basis for the models is needed before rate adaptation can be performed. In a comparison to other sources of acoustic variation it was shown that the effects of speech rate are as severe as those of speaker variation and environmental noise. All these results show that for a more substantial system that models rate variations accurately it is necessary to focus on both, durational and spectral effects. The systematic nature of the effects indicates that a continuous modelling is possible

    Dutch and German 3-year-olds’ representations of voicing alternations

    Get PDF
    The voicing contrast is neutralised syllable and word finally in Dutch and German, leading to alternations within the morphological paradigm (e.g. Dutch ‘bed(s)’, be[t] be[d]en, German ‘dog(s)’, Hun[t]-Hun[d]e). Despite structural similarity, language-specific morphological, phonological and lexical properties impact on the distribution of this alternation in the two languages. Previous acquisition research has focused on one language only, predominantly focusing on children’s production accuracy, concluding that alternations are not acquired until late in the acquisition process in either language. This paper adapts a perceptual method to investigate how voicing alternations are represented in the mental lexicon of Dutch and German 3-year-olds. Sensitivity to mispronunciations of voicing word-medially in plural forms was measured using a visual fixation procedure. Dutch children exhibited evidence of overgeneralising the voicing alternation, whereas German children consistently preferred the correct pronunciation to mispronunciations. Results indicate that the acquisition of voicing alternations is influenced by language-specific factors beyond the alternation itself

    Individual Differences in Speech Production and Perception

    Get PDF
    Inter-individual variation in speech is a topic of increasing interest both in human sciences and speech technology. It can yield important insights into biological, cognitive, communicative, and social aspects of language. Written by specialists in psycholinguistics, phonetics, speech development, speech perception and speech technology, this volume presents experimental and modeling studies that provide the reader with a deep understanding of interspeaker variability and its role in speech processing, speech development, and interspeaker interactions. It discusses how theoretical models take into account individual behavior, explains why interspeaker variability enriches speech communication, and summarizes the limitations of the use of speaker information in forensics

    Back from the future:Nonlinear anticipation in adults and children's speech

    Get PDF
    Purpose: This study examines the temporal organization of vocalic anticipation in German children from 3 to 7 years of age and adults. The main objective was to test for non-linearprocesses in vocalic anticipation, which may result from the interaction between lingualgestural goalsfor individual vowels, and those for their neighbors over time. Method: The technique of ultrasound imaging was employed to record tongue movement at fivetimepoints throughout short utterances of the form V1#CV2. Vocalic anticipation was examined with Generalized Additive Modeling, an analytical approach allowing forthe estimation of both linear and non-linearinfluences on anticipatoryprocesses. Results: both adults and children exhibit non-linear patterns of vocalic anticipation over time with the degree and extent of vocalic anticipation varying as a function of the individual consonants and vowels assembled. However, noticeable developmental discrepancieswere found with vocalic anticipation being present earlier in children ́sutterances at 3-4-5 years of agein comparison to adults and to some extent 7-year-old children.Conclusions: Anarrowing of speech production organization from large chunks in kindergarten to more contextually-specified organizationsseems to occur fromkindergarten toprimary school toadulthood, although variation in the temporal overlap of lingual gestures for consecutive segments is already present in the youngestcohorts. In adults, non-linear anticipatory patterns over time suggest a strong differentiation between the gestural goals for consecutive segments. In children, this differentiation is not yet mature: vowelsshow greater prominence over time and seem activated more in-phase with those of previous segments relative to adults
    corecore