4,397 research outputs found

    Metrical structure and the perception of time-compressed speech

    Get PDF
    In the absence of explicitly marked cues to word boundaries, listeners tend to segment spoken English at the onset of strong syllables. This may suggest that under difficult listening conditions, speech should be easier to recognize where strong syllables are word-initial. We report two experiments in which listeners were presented with sentences which had been time-compressed to make listening difficult. The first study contrasted sentences in which all content words began with strong syllables with sentences in which all content words began with weak syllables. The intelligibility of the two groups of sentences did not differ significantly. Apparent rhythmic effects in the results prompted a second experiment; however, no significant effects of systematic rhythmic manipulation were observed. In both experiments, the strongest predictor of intelligibility was the rated plausibility of the sentences. We conclude that listeners' recognition responses to time-compressed speech may be strongly subject to experiential bias; effects of rhythmic structure are most likely to show up also as bias effects

    Continuous-speech segmentation at the beginning of language acquisition: Electrophysiological evidence

    Get PDF
    Word segmentation, or detecting word boundaries in continuous speech, is not an easy task. Spoken language does not contain silences to indicate word boundaries and words partly overlap due to coarticalution. Still, adults listening to their native language perceive speech as individual words. They are able to combine different distributional cues in the language, such as the statistical distribution of sounds and metrical cues, with lexical information, to efficiently detect word boundaries. Infants in the first year of life do not command these cues. However, already between seven and ten months of age, before they know word meaning, infants learn to segment words from speech. This important step in language acquisition is the topic of this dissertation. In chapter 2, the first Event Related Brain Potential (ERP) study on word segmentation in Dutch ten-month-olds is discussed. The results show that ten-month-olds can already segment words with a strong-weak stress pattern from speech and they need roughly the first half of a word to do so. Chapter 3 deals with segmentation of words beginning with a weak syllable, as a considerable number of words in Dutch do not follow the predominant strong-weak stress pattern. The results show that ten-month-olds still largely rely on the strong syllable in the language, and do not show an ERP response to the initial weak syllable. In chapter 4, seven-month-old infants' segmentation of strong-weak words was studied. An ERP response was found to strong-weak words presented in sentences. However, a behavioral response was not found in an additional Headturn Preference Procedure study. There results suggest that the ERP response is a precursor to the behavioral response that infants show at a later age

    The Effect of IP Constituent Position and Foot Complexity on Timing in Polish Learner's English Pronunciation

    Get PDF
    A comparison of native and Polish learners' performance shows similar durations of stressed and pitch accented syllables. The unstressed syllables and syllable clusters, on the other hand, are significantly longer in non-native speech, and the discrepancies increase at lower phrasal prominence levels, especially in the preheads. Similar results for both groups have been obtained with respect to the number of consecutive unstressed syllables (foot complexity). The same test repeated after seven months of pronunciation training reveals a considerable tendency towards native speech timing, although the differences concerning low prominence levels remain significant

    Sensitivity to speech rhythm explains individual differences in reading ability independently of phonological awareness

    Get PDF
    This study considered whether sensitivity to speech rhythm can predict concurrent variance in reading attainment after individual differences in age, vocabulary and phonological awareness have been controlled. Five to six-year-old English-speaking children completed a battery of phonological processing assessments and reading assessments, along with a simple word stress manipulation task. The results showed that performance on the stress manipulation measure predicted a significant amount of variance in reading attainment after age, vocabulary, and phonological processing had been taken into account. These results suggest that stress sensitivity is an important, yet neglected aspect of English-speaking children?s phonological representations, which needs to be incorporated into theoretical accounts of reading development

    An acoustic investigation of the developmental trajectory of lexical stress contrastivity in Italian

    Get PDF
    We examined whether typically developing Italian children exhibit adult-like stress contrastivity for word productions elicited via a picture naming task (n=25 children aged 3\u20135 years and 27 adults). Stimuli were 10 trisyllabic Italian words; half began with a weak\u2013strong (WS) pattern of lexical stress across the initial 2 syllables, as in patata, while the other half began with a strong\u2013weak (SW) pattern, as in gomito. Word productions that were identified as correct via perceptual judgement were analysed acoustically. The initial 2 syllables of each correct word production were analysed in terms of the duration, peak intensity, and peak fundamental frequency of the vowels using a relative measure of contrast\u2014the normalised pairwise variability index (PVI). Results across the majority of measures showed that children\u2019s stress contrastivity was adult-like. However, the data revealed that children\u2019s contrastivity for trisyllabic words beginning with a WS pattern was not adult-like regarding the PVI for vowel duration: children showed less contrastivity than adults. This effect appeared to be driven by differences in word-medial gemination between children and adults. Results are compared with data from a recent acoustic study of stress contrastivity in English speaking children and adults and discussed in relation to language-specific and physiological motor-speech constraints on production

    EEG Correlates of Song Prosody: A New Look at the Relationship between Linguistic and Musical Rhythm

    Get PDF
    Song composers incorporate linguistic prosody into their music when setting words to melody, a process called “textsetting.” Composers tend to align the expected stress of the lyrics with strong metrical positions in the music. The present study was designed to explore the idea that temporal alignment helps listeners to better understand song lyrics by directing listeners’ attention to instances where strong syllables occur on strong beats. Three types of textsettings were created by aligning metronome clicks with all, some or none of the strong syllables in sung sentences. Electroencephalographic recordings were taken while participants listened to the sung sentences (primes) and performed a lexical decision task on subsequent words and pseudowords (targets, presented visually). Comparison of misaligned and well-aligned sentences showed that temporal alignment between strong/weak syllables and strong/weak musical beats were associated with modulations of induced beta and evoked gamma power, which have been shown to fluctuate with rhythmic expectancies. Furthermore, targets that followed well-aligned primes elicited greater induced alpha and beta activity, and better lexical decision task performance, compared with targets that followed misaligned and varied sentences. Overall, these findings suggest that alignment of linguistic stress and musical meter in song enhances musical beat tracking and comprehension of lyrics by synchronizing neural activity with strong syllables. This approach may begin to explain the mechanisms underlying the relationship between linguistic and musical rhythm in songs, and how rhythmic attending facilitates learning and recall of song lyrics. Moreover, the observations reported here coincide with a growing number of studies reporting interactions between the linguistic and musical dimensions of song, which likely stem from shared neural resources for processing music and speech

    Logopenic and nonfluent variants of primary progressive aphasia are differentiated by acoustic measures of speech production

    Get PDF
    Differentiation of logopenic (lvPPA) and nonfluent/agrammatic (nfvPPA) variants of Primary Progressive Aphasia is important yet remains challenging since it hinges on expert based evaluation of speech and language production. In this study acoustic measures of speech in conjunction with voxel-based morphometry were used to determine the success of the measures as an adjunct to diagnosis and to explore the neural basis of apraxia of speech in nfvPPA. Forty-one patients (21 lvPPA, 20 nfvPPA) were recruited from a consecutive sample with suspected frontotemporal dementia. Patients were diagnosed using the current gold-standard of expert perceptual judgment, based on presence/absence of particular speech features during speaking tasks. Seventeen healthy age-matched adults served as controls. MRI scans were available for 11 control and 37 PPA cases; 23 of the PPA cases underwent amyloid ligand PET imaging. Measures, corresponding to perceptual features of apraxia of speech, were periods of silence during reading and relative vowel duration and intensity in polysyllable word repetition. Discriminant function analyses revealed that a measure of relative vowel duration differentiated nfvPPA cases from both control and lvPPA cases (r2 = 0.47) with 88% agreement with expert judgment of presence of apraxia of speech in nfvPPA cases. VBM analysis showed that relative vowel duration covaried with grey matter intensity in areas critical for speech motor planning and programming: precentral gyrus, supplementary motor area and inferior frontal gyrus bilaterally, only affected in the nfvPPA group. This bilateral involvement of frontal speech networks in nfvPPA potentially affects access to compensatory mechanisms involving right hemisphere homologues. Measures of silences during reading also discriminated the PPA and control groups, but did not increase predictive accuracy. Findings suggest that a measure of relative vowel duration from of a polysyllable word repetition task may be sufficient for detecting most cases of apraxia of speech and distinguishing between nfvPPA and lvPPA

    Rate Effects on German Unstressed Syllables

    Get PDF
    German is characterized by the rhythmic alternation of strong and weak syllables. Weak syllables contain short or reduced vowels like schwa. In some instances, the unstressed weak syllable nucleus can be the only difference between words that underlyingly contain a consonant cluster. Examples in German are Kannen 'cans, pitchers' contrasting with kann 'can (V)' or beraten 'to advise' contrasting with braten 'to fry'. In some instances, in a faster rate of speech for example, weakening of the unstressed syllable nucleus is observed which can eventually result in the neutralization between such pairs of words. In slower speech, one might find an opposite effect, that is the appearance of vocalic traces between the members of an underlying consonant cluster. This transition vowel can perceptually cause a confusion in these "minimal pairs". Based on acoustic measurements, I will argue that gestural reorganization can best account for both of these rate effects found in German

    Pauses and the temporal structure of speech

    Get PDF
    Natural-sounding speech synthesis requires close control over the temporal structure of the speech flow. This includes a full predictive scheme for the durational structure and in particuliar the prolongation of final syllables of lexemes as well as for the pausal structure in the utterance. In this chapter, a description of the temporal structure and the summary of the numerous factors that modify it are presented. In the second part, predictive schemes for the temporal structure of speech ("performance structures") are introduced, and their potential for characterising the overall prosodic structure of speech is demonstrated
    • 

    corecore