535 research outputs found
Learning [Voice]
The [voice] distinction between homorganic stops and fricatives is made by a number of acoustic correlates including voicing, segment duration, and preceding vowel duration. The present work looks at [voice] from a number of multidimensional perspectives.
This dissertation\u27s focus is a corpus study of the phonetic realization of [voice] in two English-learning infants aged 1;1--3;5. While preceding vowel duration has been studied before in infants, the other correlates of post-vocalic voicing investigated here --- preceding F1, consonant duration, and closure voicing intensity --- had not been measured before in infant speech. The study makes empirical contributions regarding the development of the production of [voice] in infants, not just from a surface-level perspective but also with implications for the phonetics-phonology interface in the adult and developing linguistic systems. Additionally, several methodological contributions will be made in the use of large sized corpora and data modeling techniques.
The study revealed that even in infants, F1 at the midpoint of a vowel preceding a voiced consonant was lower by roughly 50 Hz compared to a vowel before a voiceless consonant, which is in line with the effect found in adults. But while the effect has been considered most likely to be a physiological and nonlinguistic phenomenon in adults, it actually appeared to be correlated in the wrong direction with other aspects of [voice] here, casting doubt on a physiological explanation. Some of the consonant pairs had statistically significant differences in duration and closure voicing. Additionally, a preceding vowel duration difference was found and as well a preliminary indication of a developmental trend that suggests the preceding vowel duration difference is being learned.
The phonetics of adult speech is also considered. Results are presented from a dialectal corpus study of North American English and a lab speech experiment which clarifies the relationship between preceding vowel duration and flapping and the relationship between [voice] and F1 in preceding vowels. Fluent adult speech is also described and machine learning algorithms are applied to learning the [voice] distinction using multidimensional acoustic input plus some lexical knowledge
Speaking Rate Effects on Locus Equation Slope
A locus equation describes a 1st order regression fit to a scatter of vowel steady-state frequency values predicting vowel onset frequency values. Locus equation coefficients are often interpreted as indices of coarticulation. Speaking rate variations with a constant consonantâvowel form are thought to induce changes in the degree of coarticulation. In the current work, the hypothesis that locus slope is a transparent index of coarticulation is examined through the analysis of acoustic samples of large-scale, nearly continuous variations in speaking rate. Following the methodological conventions for locus equation derivation, data pooled across ten vowels yield locus equation slopes that are mostly consistent with the hypothesis that locus equations vary systematically with coarticulation. Comparable analyses between different four-vowel pools reveal variations in the locus slope range and changes in locus slope sensitivity to rate change. Analyses across rate but within vowels are substantially less consistent with the locus hypothesis. Taken together, these findings suggest that the practice of vowel pooling exerts a non-negligible influence on locus outcomes. Results are discussed within the context of articulatory accounts of locus equations and the effects of speaking rate change
Jaw and Order
It is well-accepted that the jaw plays an active role in influencing vowel height. The general aim of the current study is to further investigate the extent to which the jaw is active in producing consonantal distinctions, with specific focus on coronal consonants. Therefore, tongue tip and jaw positions are compared for the German coronal consonants Is, J, t, d, n, 1/, that is, consonants having the same active articulators (apical/laminal) but differing in manner of articulation. In order to test the stability of articulatory positions for each of these coronal consonants, a natural perturbation paradigm was introduced by recording two levels of vocal effort: comfortable, and loud without shouting. Tongue and jaw movements of five speakers of German were recorded by means of EMMA during /aCa/ sequences. By analyzing the tongue tip and jaw positions and their spatial variability we found that (1) the jaw's contribution to these consonants varies with manner of articulation, and (2) for all coronal consonants the positions are stable across loudness conditions except for those of the nasal. Results are discussed with respect to the tasks of the jaw, and the possible articulatory adjustments that may accompany louder speech
Neural Modeling and Imaging of the Cortical Interactions Underlying Syllable Production
This paper describes a neural model of speech acquisition and production that accounts for a wide range of acoustic, kinematic, and neuroimaging data concerning the control of speech movements. The model is a neural network whose components correspond to regions of the cerebral cortex and cerebellum, including premotor, motor, auditory, and somatosensory cortical areas. Computer simulations of the model verify its ability to account for compensation to lip and jaw perturbations during speech. Specific anatomical locations of the model's components are estimated, and these estimates are used to simulate fMRI experiments of simple syllable production with and without jaw perturbations.National Institute on Deafness and Other Communication Disorders (R01 DC02852, RO1 DC01925
Experimental phonetic study of the timing of voicing in English obstruents
The treatment given to the timing of voicing in three areas of phonetic
research -- phonetic taxonomy, speech production modelling, and speech
synthesis -- Is considered in the light of an acoustic study of the timing of
voicing in British English obstruents. In each case, it is found to be deficient.
The underlying cause is the difficulty in applying a rigid segmental approach to
an aspect of speech production characterised by important inter-articulator
asynchronies, coupled to the limited quantitative data available concerning the
systematic properties of the timing of voicing in languages.
It is argued that the categories and labels used to describe the timing of
voicing In obstruents are Inadequate for fulfilling the descriptive goals of
phonetic theory. One possible alternative descriptive strategy is proposed,
based on incorporating aspects of the parametric organisation of speech into
the descriptive framework. Within the domain of speech production modelling,
no satisfactory account has been given of fine-grained variability of the timing
of voicing not capable of explanation in terms of general properties of motor
programming and utterance execution. The experimental results support claims
In the literature that the phonetic control of an utterance may be somewhat
less abstract than has been suggestdd in some previous reports. A schematic
outline is given, of one way in which the timing of voicing could be controlled
in speech production. The success of a speech synthesis-by-rule system
depends to a great extent on a comprehensive encoding of the systematic
phonetic characteristics of the target language. Only limited success has been
achieved in the past thirty years. A set of rules is proposed for generating
more naturalistic patterns of voicing in obstruents, reflecting those observed in
the experimental component of this study. Consideration Is given to strategies
for evaluating the effect of fine-grained phonetic rules In speech synthesis
Recommended from our members
Structure in mind, structure in vocal tract
We update our understanding of the view that grammar regulates intersegmental temporal coordination and present an extension of that view to a new domain: we argue that inter-segmental coordination is basic to prosody. It is the glue joining segments together differently in different languages (here, illustrated with examples from Arabic and Spanish) and orchestrates their unfolding in ways corresponding to constructs posited in theoretical analysis. The correspondence is one between organization in mind-brain and organization in vocal tract. Moreover, for both mind-brain and vocal tract, the organization is phonological and abstract. It is so because it holds over segments of various identities: in Arabic, the first segment in /bka/ is not prosodified as part of the same unit as /ka/ and this holds true also for /blat/, /klat/ and so on, regardless of sonority. In contrast, in English or Spanish, a different organization holds. Crucially, uniformity in organization (same organization presiding over sequences with varying segmental makeup) does not imply uniqueness of phonetic exponents: prosodic organization is pleiotropic, simultaneously expressed by more than one phonetic exponent. Finally, two properties of coordination relations are underscored: lawful flexibility and abstractness. The first is revealed in the degrees of freedom with which movements corresponding to any given effector begin; the second in invariances of task-relevant kinematic signatures regardless of the effectors implicated in any given segmental sequence. Once again, abstract phonological structure is mirrored in vocal tracts via coordination relations holding across physiology and the particular modes of its operation
- âŚ