21 research outputs found
Variation in polar interrogative contours within and between Arabic dialects
Quantitative analysis of fundamental frequency (F0) contours in yes/no-questions and coordinated questions, are compared across eight Arabic dialects, based on scripted role play data from the Intonational Variation in Arabic corpus [1]. Visualisation of the F0 contour of all tokens is used to evaluate how consistently speakers produce a typical contour in each dialect, for each question type. A series of simple Generalised Additive Models (GAM) is used to identify dialects which stand out from others in the realization of one or both question types, as well as groups of dialects which might be further differentiated by more fine-grained analysis
Pitch Accent Trajectories across Different Conditions of Visibility and Information Structure - Evidence from Spontaneous Dyadic Interaction
Wagner P, Bryhadyr N, Schröer M. Pitch Accent Trajectories across Different Conditions of Visibility and Information Structure - Evidence from Spontaneous Dyadic Interaction. In: Proceedings of Interspeech. 2019.Previous research identified a differential contribution of information
structure and the visibility of facial and contextual information
to the acoustic-prosodic expression of pitch accents.
However, it is unclear whether pitch accent shapes are affected
by these conditions as well. To investigate whether varying
context cues have a differentiated impact on pitch accent trajectories
produced in conversational interaction, we modified
the visibility conditions in a spontaneous dyadic interaction
task, i.e. a verbalized version of TicTacToe. Besides varying
visibility, the game task allows for measuring the impact of
information-structure on pitch accent trajectories, differentiating
important and unpredictable game moves. Using GAMMs
on four speaker groups (identified by a cluster analysis), we
could isolate varying strategies of prosodic adaptation to contextual
change. While few speaker groups showed a reaction to
the availability of visible context cues (facial prosody or executed
game moves), all groups differentiated the verbalization
of unpredictable and predictable game moves with a groupspecific
trajectory adaptation. The importance of game moves
resulted in differentiated adaptations in two out of four speaker
groups. The detected strategic trajectory adaptations were characterized
by different characteristics of boundary tones, adaptations
of the global f0-level, or the shape of the corresponding
pitch accent
Speech data acquisition: the underestimated challenge
(This version makes 1 correction to the references: BARBOSA 2012 was cited in the text but missing from the list of references.)International audienceThe second half of the 20th century was the dawn of information technology; and we now live in the digital age. Experimental studies of prosody develop at a fast pace, in the context of an "explosion of evidence" (Janet Pierrehumbert, Speech Prosody 2010, Chicago). The ease with which anyone can now do recordings should not veil the complexity of the data collection process, however. This article aims at sensitizing students and scientists from the various fields of speech and language research to the fact that speech-data acquisition is an underestimated challenge. Eliciting data that reflect the communicative processes at play in language requires special precautions in devising experimental procedures and a fundamental understanding of both ends of the elicitation process: speaker and recording facilities. The article compiles basic information on each of these requirements and recapitulates some pieces of practical advice, drawing many examples from prosody studies, a field where the thoughtful conception of experimental protocols is especially crucial
Recommended from our members
Second language acquisition of intonation: Peak alignment in American English
The objective of the present study was to investigate (1) whether, and to what degree, late bilinguals of different L1 backgrounds are comparable to native speakers in the phonetic implementation of tonal targets in their L2, (2) whether they exhibit general patterns of acquisition irrespective of the typological closeness of their L1 to their L2, and (3) whether learners’ choice of accent contours and the alignment of the high tone (H∗) proceeds in parallel with proficiency in the L2. More specifically, we examined the acquisition of the nuclear contour composition and the H∗ alignment of the American English (L)H∗L- (i.e. pitch accent and boundary tone combination) in initial-stressed and final-stressed words by Japanese and Spanish late bilingual speakers at varying proficiency levels in American English. Our results show that the L1 Spanish speakers were more comparable than the L1 Japanese speakers to the native English speakers in the phonological aspect of intonation (choice of pitch accent contour). In terms of peak alignment, we found that the late bilinguals generally tended to realise significantly later alignment than the native speakers, although the precise manifestation of this varied according to the L1 background of speakers and the stress pattern of words.This research was supported by a PhD scholarship awarded to the first author from the Cambridge Commonwealth Trust and St John's College, University of Cambridge, and by a PhD research support grant from the Japan Foundation. These sources of support are gratefully acknowledged
From communicative functions to prosodic forms
This is a proposal in favour of proceeding from communicative function to linguistic form, rather than the reverse, for an insightful account of how humans communicate by speech in languages. A functional framework is developed that encompasses argumentation structures, declarative and interrogative functions, and expressive intensification. Such a function orientation can become a powerful tool in comparative prosodic research across the world's languages. The potential of this approach is shown by comparing the prosodic form of Mandarin Chinese data collected in functionally contextualized scenarios with corresponding data from English and German
Prosodic detail in Neapolitan Italian
Recent findings on phonetic detail have been taken as supporting exemplar-based approaches to prosody. Through four experiments on both production and perception of both melodic and temporal detail in Neapolitan Italian, we show that prosodic detail is not incompatible with abstractionist approaches either. Specifically, we suggest that the exploration of prosodic detail leads to a refined understanding of the relationships between the richly specified and continuous varying phonetic information on one side, and coarse phonologically structured contrasts on the other, thus offering insights on how pragmatic information is conveyed by prosody
Prosodic detail in Neapolitan Italian
Recent findings on phonetic detail have been taken as supporting exemplar-based approaches to prosody. Through four experiments on both production and perception of both melodic and temporal detail in Neapolitan Italian, we show that prosodic detail is not incompatible with abstractionist approaches either. Specifically, we suggest that the exploration of prosodic detail leads to a refined understanding of the relationships between the richly specified and continuous varying phonetic information on one side, and coarse phonologically structured contrasts on the other, thus offering insights on how pragmatic information is conveyed by prosody
Prosodic detail in Neapolitan Italian
Recent findings on phonetic detail have been taken as supporting exemplar-based approaches to prosody. Through four experiments on both production and perception of both melodic and temporal detail in Neapolitan Italian, we show that prosodic detail is not incompatible with abstractionist approaches either. Specifically, we suggest that the exploration of prosodic detail leads to a refined understanding of the relationships between the richly specified and continuous varying phonetic information on one side, and coarse phonologically structured contrasts on the other, thus offering insights on how pragmatic information is conveyed by prosody
Production, perception and online processing of prominence in the post-focal domain
This dissertation presents a fundamentally new and in-depth investigation of the distribution of prominence in different focal structures in two varieties of Italian (the one spoken in Udine and the one spoken in Bari), by means of the implementation of a categorical analysis with the continuous prosodic parameters related to F0 and periodic energy. Results provide evidence of the fact that prominence in these varieties of Italian is conveyed by both a categorical three-way distinction and a gradual modulation: absence or presence of pitch movement in the distinction between background (post-focal position) and the focal conditions, and a gradual modification of energy and duration. The degree of prominence of words occurring in different focal structures was also investigated in perception. The reportedly different distribution of prominence found in questions for the variety of Italian spoken in Bari is shown to have an influence in the degree of perceived prominence. This influence is found in the comparison between prominence’s ratings of Bari and Udine native speakers, as well as of Bari native speakers and German native speakers, with Italian as L2. Furthermore, the present dissertation tests the real-time processing of the pitch excursion registered in the post-focal region of questions in the Bari variety. Findings confirmed that the fine-grained changes in prominence are processed in real time. Moreover, results indicate that top-down expectations play a crucial role in modulating general cognitive processes. Overall, this thesis supports the view of prosodic prominence as characterised by a bundle of cues, probabilistically distributed in the listener’s perceptual space, which form top-down expectations that play a role both in offline perception and in online processing. Signal-based factors also play a role in perception and online processing, but can however be overridden by expectations
Individual Differences in Speech Production and Perception
Inter-individual variation in speech is a topic of increasing interest both in human sciences and speech technology. It can yield important insights into biological, cognitive, communicative, and social aspects of language. Written by specialists in psycholinguistics, phonetics, speech development, speech perception and speech technology, this volume presents experimental and modeling studies that provide the reader with a deep understanding of interspeaker variability and its role in speech processing, speech development, and interspeaker interactions. It discusses how theoretical models take into account individual behavior, explains why interspeaker variability enriches speech communication, and summarizes the limitations of the use of speaker information in forensics