49 research outputs found
Rhythm Class Perception by Expert Phoneticians
This paper contributes to the recent debate in linguistic-phonetic rhythm research dominated by the idea of a perceptual dichotomy involving âsyllable-timedâ and âstress-timedâ rhythm classes. Some previous studies have shown that it is difficult both to find reliable acoustic correlates of these classes and also to obtain reliable perceptual data for their support.
In an experiment, we asked 12 British English phoneticians to classify the rhythm class of 36 samples spoken by 24 talkers in six dialects of British English. Expert listenersâ perception was shown to be guided by two factors: (1) the assumed rhythm class affiliation of a particular dialect and (2) one acoustic cue related to the prosodic hierarchy, namely the degree of accentual lengthening.
We argue that the rhythm class hypothesis has reached its limits in informing empirical enquiry into linguistic rhythm, and new research avenues are needed to understand this multi-layered phenomenon
Dialectal phonology constrains the phonetics of prominence
Accentual prominence has well-documented effects on various phonetic properties, including timing, vowel quality, amplitude, and pitch. These cues can exist in trading relationships and can differ in magnitude in different languages. Less is understood about how phonetic cues to accentuation surface under different phonological constraints, such as those posed by segmental phonology, aspects of the prosodic hierarchy, and intonational phonology. Dialectal comparisons offer a valuable window on these issues, because dialects of a language share basic aspects of structure and function, but can differ in key segmental and suprasegmental constraints which may affect the cues that realise accentual prominence. We compared the realisation of trochaic words (e.g. cheesy, picky) in accented/unaccented and phrase-final/non-final positions in two dialects of British English, Standard Southern British English, and Standard Scottish English as spoken in Glasgow. We found generally shallower prominence gradients for Glasgow than SSBE with respect to intensity and duration, and very little evidence of accentual lengthening of vowels in Glasgow, compared to robust effects in SSBE. In contrast, phrase-finality had similar effects across the two dialects. The differences observed illustrate how the expression of accentual prominence reflects and reveals the different segmental and intonational systems that operate within dialects of the same language
Then, What is Charisma? The Role of Audio-visual Prosody in L1 and L2 Political Speeches
Charisma plays a significant role in political speeches, and determines the ability of a politician to carry an audience. While acoustic features of charisma have received some empirical attention, the contribution of visual prosody has been mostly neglected in studies focusing on features of a charismatic appearance. Unknown are also the audio-visual cues to charisma in non-native speakers. This small-scale study investigated speeches delivered by Donald Trump (L1 American English) and Arnold Schwarzenegger (L1 Austrian German, L2 American English). Video and audio recordings of their political speeches (around 25 min per speaker) and the transcripts were used. The use of pitch range, speech rate, emphatic stress and hand gestures was analysed. In order to establish the core means of the speakersâ persuasive influence on their audiences, within-speaker comparisons were conducted for phrases with and without cheering from the audiences. The results showed some differences in the use of the audio-visual prosodic features between the L1 and L2 speaker as well as some similarities, and suggest that charisma is not easily attributable to a fixed set of prosodic means but may be best understood as a skillful modulation of audio-visual prosody in social interaction
How Truncating Are âTruncating Languages'? Evidence from Russian and German
Russian and German have been previously been described as âtruncatingâ, or cutting off target frequencies of the phrase-final pitch trajectories when the time available for voicing is compromised. However, supporting evidence is rare and limited to only a few pitch categories. This paper reports a production study conducted to document pitch adjustments to linguistic materials, in which the amount of voicing available for the realization of a pitch pattern varies from relatively long to extremely short. Productions of nuclear H+L*, H* and L*+H pitch accents followed by a low boundary tone were investigated in the two languages. The results of the study show that speakers of both âtruncating languagesââ do not exclusively utilize truncation exclusively when accommodating to different segmental environments. On the contrary, they employ several strategies â among them is truncation but also compression and temporal re-alignment âto produce the target pitch categories under increasing time pressure. Given that speakers can systematically apply all three adjustment strategies to produce some pitch patterns (H* L% in German and Russian) while not using truncation in others (H+L* L% particularly in Russian), we question the effectiveness of the typological classification of these two languages as âtruncatingâ. Moreover, the phonetic detail of truncation varies considerably, both across and within the two languages, indicating that truncation cannot be easily be modeled as a unified phenomenon. The results further suggest that the phrase-final pitch adjustments are crucially sensitive to the phonological composition of the tonal string and the status of a particular tonal event (associated vs. boundary tone), and do not apply to falling vs. rising pitch contours across the board, as previously put forward for German. Implications for the intonational phonology and prosodic typology are addressed in the discussion
What is the fate of Scottish Vowel Length Rule in Glasgow?
This paper studies the longitudinal development of a vowel timing alternation known as the âScottish Vowel Length Ruleâ in a distinctive variety of Scottish English spoken in Glasgow by working-class men and women. Combining apparent-time and real-time evidence, we show that the implementation of the Rule has changed over time, though unlike in many other varieties of Scottish English, the factors shaping its fate seem to be internal rather than external. Overall, Glaswegian English behaves like a quantity language and controls for prosodic timing effects while preserving the phonological timing alternation; and this is despite a marginal, quasi-phonemic status of the Rule
Glasgow Gloom or Leeds Glue? Dialect-Specific Vowel Duration Constrains Lexical Segmentation and Access
Timing cues are important in many aspects of speech processing, fromidentifying segments to locating word and phrase boundaries. They vary across accents, yet representation and processing of this variation are poorly understood. We investigated whether an accent difference in vowel duration affects lexical segmentation and access. In Glasgow English (GE), /i u e o/ are shorter than in Leeds English (LE), especially for /i u/ before voiced stops and nasals. In a word-spotting experiment, GE and LE participants heard nonsense sequences (e.g. pobegloomezh) containing embedded words (gloom, glue), with segmental qualities intermediate between GE and LE. Critical vowel durations were manipulated according to accent (GE-appropriate vowels shorter than LE-appropriate ones) and phonological context (vowels shortest before voiceless stops < voiced stops/nasals < voiced fricatives). GE participants generally spotted words like gloom more accurately with GE-appropriate than LE-appropriate vowels. LE participants were less accurate than GE participants to spot words like gloom with GE-appropriate vowels, but more likely to spot embeddings like glue. These results were broadly as predicted based on the accent differences, but depended less than expected on the accent-specific phonological constraints. We discuss theoretical implications regarding the representation of duration and the time course of lexical access
Structured heterogeneity in Scottish stops over the 20th Century
How and why speakers differ in the phonetic implementation of phonological contrasts, and the relationship of this âstructured heterogeneityâ to language change, has been a key focus over fifty years of variationist sociolinguistics. In phonetics, interest has recently grown in uncovering âstructured variabilityââhow speakers can differ greatly in phonetic realization in nonrandom waysâas part of the long-standing goal of understanding variability in speech. The English stop voicing contrast, which combines extensive phonetic variability with phonological stability, provides an ideal setting for an approach to understanding structured variation in the sounds of a communityâs language that illuminates both synchrony and diachrony. This article examines the voicing contrast in a vernacular dialect (Glasgow Scots) in spontaneous speech, focusing on individual speaker variability within and across cues, including over time. Speakers differ greatly in the use of each of three phonetic cues to the contrast, while reliably using each one to differentiate voiced and voiceless stops. Interspeaker variability is highly structured: speakers lie along a continuum of use of each cue, as well as correlated use of two cuesâvoice onset time and closure voicingâalong a single axis. Diachronic change occurs along this axis, toward a more aspiration-based and less voicing-based phonetic realization of the contrast, suggesting an important connection between synchronic and diachronic speaker variation
Testing an acoustic model of the P-center in English and Japanese
The notion of the âperceptual centerâ or the âP-centerâ has been put forward to account for the repeated finding that acoustic and perceived syllable onsets do not necessarily coincide, at least in the perception of simple monosyllables or disyllables. The magnitude of the discrepancy between acoustics and perceptionâthe location of the P-center in the speech signalâ has proven difficult to estimate, though acoustic models of the effect do exist. The present study asks if the P-center effect can be documented in natural connected speech of English and Japanese and examines if an acoustic model that defines the P-center as the moment of the fastest energy change in a syllabic amplitude envelope adequately reflects the P-center in the two languages. A sensorimotor synchronization paradigm was deployed to address the research questions. The results provide evidence for the existence of the P-center effect in speech of both languages while the acoustic P-center model is found to be less applicable to Japanese. Sensorimotor synchronization patterns further suggest that the P-center may reflect perceptual anticipation of a vowel onset.</p
Changing sounds in a changing city: an acoustic phonetic investigation of real-time change over a century of Glaswegian.
This paper contributes some new findings towards answering these general theoretical questions about real-time sound change and place. Our study exploits the possibilities offered for a longer-term perspective on real-time change by combining archive recordings from the First World War with those from a real- and apparent-time corpus from the 1970s. We consider three aspects of urban Scots, vowel quality and duration, and the realization of word-initial /l/, using acoustic phonetic measures. The real-time comparisons reveal change in progress in all three features. The direction of the changes is intriguing, since despite the substantial geographical and social changes which have taken place across the UK during especially the second half of the 20th century, and the impact of these in terms of contact-induced changes on urban British accents (e.g. Foulkes and Docherty 1999), it appears that linguistic and social factors to do with the dialect and its location have played a stronger role
Structured heterogeneity in Scottish stops over the twentieth 20th century
How and why speakers differ in the phonetic implementation of phonological contrasts, and the relationship of this âstructured heterogeneityâ to language change, has been a key focus over 50 years of variationist sociolinguistics. In phonetics, interest has recently grown in uncovering âstructured variabilityââhow speakers can differ greatly in phonetic realization in non-random waysâas part of the longstanding goal of understanding variability in speech. The English stop voicing contrast, which combines extensive phonetic variability with phonological stability, provides an ideal setting for an approach to understanding structured variation in the sounds of a communityâs language which illuminates both synchrony and diachrony. This paper examines the voicing contrast in a vernacular dialect (Glasgow Scots) in spontaneous speech, focusing on individual speaker variability within and across cues, including over time. Speakers differ greatly in the use of each of three phonetic cues to the contrast, while reliably using each one to differentiate voiced and voiceless stops. Interspeaker variability is highly structured: speakers lie along a continuum of use of each cue, as well as correlated use of two cuesâVOT and closure voicingâalong a single axis. Diachronic change occurs along this axis, towards a more aspiration-based and less voicing-based phonetic realization of the contrast, suggesting an important connection between synchronic and diachronic speaker variation