5,853 research outputs found
The listening talker: A review of human and algorithmic context-induced modifications of speech
International audienceSpeech output technology is finding widespread application, including in scenarios where intelligibility might be compromised - at least for some listeners - by adverse conditions. Unlike most current algorithms, talkers continually adapt their speech patterns as a response to the immediate context of spoken communication, where the type of interlocutor and the environment are the dominant situational factors influencing speech production. Observations of talker behaviour can motivate the design of more robust speech output algorithms. Starting with a listener-oriented categorisation of possible goals for speech modification, this review article summarises the extensive set of behavioural findings related to human speech modification, identifies which factors appear to be beneficial, and goes on to examine previous computational attempts to improve intelligibility in noise. The review concludes by tabulating 46 speech modifications, many of which have yet to be perceptually or algorithmically evaluated. Consequently, the review provides a roadmap for future work in improving the robustness of speech output
Motor Performance in Children with Childhood Apraxia of Speech and Speech Sound Disorders
Purpose: This study sought to determine if (a) children with childhood apraxia of speech (CAS), other speech sound disorders (SSDs), and typical development (TD) would perform differently on a standardized motor assessment and (b) whether comorbid language impairment would impact group differences.
Method: Speech, language, and motor abilities were assessed in children with CAS (n = 10), SSD (n = 16), and TD (n = 14) between the ages of 43 and 105 months. Motor skills were evaluated using the Movement Assessment Battery for Children-Second Edition (Henderson, Sugden, & Barnett, 2007), a behavioral assessment that is sensitive in identifying fine/gross motor impairments in children with a range of motor and learning abilities. Data were reanalyzed after reclassifying children by language ability.
Results: The CAS group performed below the normal limit on all components of the motor assessment and more poorly than the TD and SSD groups on Aiming and Catching and Balance. When children were reclassified by language ability, the comorbid CAS + language impairment group performed worse than the SSD-only and TD groups on Manual Dexterity and Balance and worse than the TD group on Aiming and Catching; all 7 children with CAS + language impairment evidenced performance in the disordered range compared to 1 of 3 children in the CAS-only group and 2 of 6 children in the SSD + language impairment group.
Conclusions: Children with CAS + language impairment appear to be at an increased risk for motor impairments, which may negatively impact social, academic, and vocational outcomes; referrals for motor screenings/assessments should be considered. Findings may suggest a higher order deficit that mediates cognitive-linguistic and motor impairments in this population
Accessibility of referent information influences sentence planning : An eye-tracking study
Acknowledgments We thank Phoebe Ye and Gouming Martens for help with data collection for Experiment 1 and 2, respectively. This research was supported by the European Research Council for the ERC Starting Grant (206198) to YC.Peer reviewedPublisher PD
The impact of animacy and speech rhythm on the word order of conjuncts in German preschoolers and adults
In this study, we investigated the impact of two constraints on the linear order of constituents in German preschool children’s and adults’ speech production: a rhythmic (*LAPSE, militating against sequences of unstressed syllables) and a semantic one (ANIM, requiring animate referents to be named before inanimate ones). Participants were asked to produce coordinated bare noun phrases in response to picture stimuli (e.g., Delfin und Planet, ‘dolphin and planet’) without any predefined word order. Overall, children and adults preferably produced animate items before inanimate ones, confirming findings of Prat-Sala, Shillcock, and Sorace (2000). In the group of preschoolers, the strength of the animacy effect correlated positively with age. Furthermore, the order of the conjuncts was affected by the rhythmic constraint, such that disrhythmic sequences, i.e., stress lapses, were avoided. In both groups, the latter result was significant when the two stimulus pictures did not vary with respect to animacy. In sum, our findings suggest a stronger influence of animacy compared to rhythmic well-formedness on conjunct ordering for German speaking children and adults, in line with findings by McDonald, Bock, and Kelly (1993) who investigated English speaking adults
Speech rhythm: a metaphor?
Is speech rhythmic? In the absence of evidence for a traditional view that languages strive to coordinate either syllables or stress-feet with regular time intervals, we consider the alternative that languages exhibit contrastive rhythm subsisting merely in the alternation of stronger and weaker elements. This is initially plausible, particularly for languages with a steep ‘prominence gradient’, i.e. a large disparity between stronger and weaker elements; but we point out that alternation is poorly achieved even by a ‘stress-timed’ language such as English, and, historically, languages have conspicuously failed to adopt simple phonological remedies that would ensure alternation. Languages seem more concerned to allow ‘syntagmatic contrast’ between successive units and to use durational effects to support linguistic functions than to facilitate rhythm. Furthermore, some languages (e.g. Tamil, Korean) lack the lexical prominence which would most straightforwardly underpin prominence alternation. We conclude that speech is not incontestibly rhythmic, and may even be antirhythmic. However, its linguistic structure and patterning allow the metaphorical extension of rhythm in varying degrees and in different ways depending on the language, and that it is this analogical process which allows speech to be matched to external rhythms
Phonological motivation for the acquisition of onomatopoeia: An analysis of early words
Onomatopoeia are disproportionately high in number in infants’ early words compared to adult language. Studies of infant language perception have proposed an iconic advantage for onomatopoeia, which may make them easier for infants to learn. This study analyses infants’ early word production to show a phonological motivation for onomatopoeia in early acquisition. Cross-linguistic evidence from 16 infants demonstrates how these forms fit within a phonologically-systematic developing lexicon. We observe a predominance of consonant harmony and open cv syllables in infants’ early words—structures that are typical of onomatopoeia across languages. Infants’ acquisition of onomatopoeia is shown to be driven by a preference for structures that are easy to plan and produce. These data present an original perspective on onomatopoeia in early development, highlighting the role of production in language acquisition in general, and onomatopoeic words in particular
Introduction: Multimodal interaction
That human social interaction involves the intertwined cooperation of different modalities is uncontroversial. Researchers in several allied fields have, however, only recently begun to document the precise ways in which talk, gesture, gaze, and aspects of the material surround are brought together to form coherent courses of action. The papers in this volume are attempts to develop this line of inquiry. Although the authors draw on a range of analytic, theoretical, and methodological traditions (conversation analysis, ethnography, distributed cognition, and workplace studies), all are concerned to explore and illuminate the inherently multimodal character of social interaction. Recent studies, including those collected in this volume, suggest that different modalities work together not only to elaborate the semantic content of talk but also to constitute coherent courses of action. In this introduction we present evidence for this position. We begin by reviewing some select literature focusing primarily on communicative functions and interactive organizations of specific modalities before turning to consider the integration of distinct modalities in interaction
Recommended from our members
Expansion of prosodic abilities at the transition from babble to words: a comparison between children with cochlear implants and normally hearing children
Objectives: This longitudinal study examined the impact of emerging vocabulary production on the ability to produce the phonetic cues to prosodic prominence in babbled and lexical disyllables of infants with Cochlear Implants (CI) and normally hearing infants (NH). Current research on typical language acquisition emphasizes the importance of vocabulary development for phonological and phonetic acquisition. Children with cochlear implants (CI) experience significant difficulties with the perception and production of prosody, and the role of possible top-down effects is therefore particularly relevant for this population.
Design: Isolated disyllabic babble and first words were identified and segmented in longitudinal audio-video recordings and transcriptions for 9 NH infants and 9 infants with CI interacting with their parents. Monthly recordings were included from the onset of babbling until children had reached a cumulative vocabulary of 200 words. Three cues to prosodic prominence, F0, intensity and duration, were measured in the vocalic portions of stand-alone disyllables. In order to represent the degree of prosodic differentiation between two syllables in an utterance, the raw values for intensity and duration were transformed to ratios, and for f0 a measure of the perceptual distance in semitones was derived. The degree of prosodic differentiation for disyllabic babble and words for each cue was compared between groups. In addition, group and individual tendencies on the types of stress patterns for babble and words were also examined.
Results: The CI group had overall smaller pitch and intensity distances than the NH group. For the NH group, words had greater pitch and intensity distances than babbled disyllables. Especially for pitch distance, this was accompanied by a shift towards a more clearly expressed stress pattern that reflected the influence of the ambient language. For the CI group, the same expansion in words did not take place for pitch. For intensity, the CI group gave evidence of some increase of prosodic differentiation. The results for the duration measure showed evidence of utterance-final lengthening in both groups. In words, the CI group significantly reduced durational differences between syllables so that a more even-timed, less differentiated pattern emerged.
Conclusions: The onset of vocabulary production did not have the same facilitatory effect for the CI infants on the production of phonetic cues for prosody, especially for pitch. It was argued that the results for duration may reflect greater articulatory difficulties in words for the CI group than the NH group. It was suggested that the lack of clear top-down effects of the vocabulary in the CI group may be due to a lag in development caused by an initial lack of auditory stimulation, possibly compounded by the absence of auditory feedback during the babble phase
- …