159 research outputs found

    Analyzing Prosody with Legendre Polynomial Coefficients

    Full text link
    This investigation demonstrates the effectiveness of Legendre polynomial coefficients representing prosodic contours within the context of two different tasks: nativeness classification and sarcasm detection. By making use of accurate representations of prosodic contours to answer fundamental linguistic questions, we contribute significantly to the body of research focused on analyzing prosody in linguistics as well as modeling prosody for machine learning tasks. Using Legendre polynomial coefficient representations of prosodic contours, we answer prosodic questions about differences in prosody between native English speakers and non-native English speakers whose first language is Mandarin. We also learn more about prosodic qualities of sarcastic speech. We additionally perform machine learning classification for both tasks, (achieving an accuracy of 72.3% for nativeness classification, and achieving 81.57% for sarcasm detection). We recommend that linguists looking to analyze prosodic contours make use of Legendre polynomial coefficients modeling; the accuracy and quality of the resulting prosodic contour representations makes them highly interpretable for linguistic analysis

    Tonal placement in Tashlhiyt: How an intonation system accommodates to adverse phonological environments

    Get PDF
    In most languages, words contain vowels, elements of high intensity with rich harmonic structure, enabling the  perceptual retrieval of pitch. By contrast, in Tashlhiyt, a Berber language, words can be composed entirely of voiceless segments. When an utterance consists of such words, the phonetic opportunity for the execution of intonational pitch movements is exceptionally limited. This book explores in a series of production and perception experiments how these typologically rare phonotactic patterns interact with intonational aspects of linguistic structure. It turns out that Tashlhiyt allows for a tremendously flexible placement of tonal events. Observed intonational structures can be conceived of as different solutions to a functional dilemma: The requirement to realise meaningful pitch movements in certain positions and the extent to which segments lend themselves to a clear manifestation of these pitch movements

    Tonal placement in Tashlhiyt: How an intonation system accommodates to adverse phonological environments

    Get PDF
    In most languages, words contain vowels, elements of high intensity with rich harmonic structure, enabling the  perceptual retrieval of pitch. By contrast, in Tashlhiyt, a Berber language, words can be composed entirely of voiceless segments. When an utterance consists of such words, the phonetic opportunity for the execution of intonational pitch movements is exceptionally limited. This book explores in a series of production and perception experiments how these typologically rare phonotactic patterns interact with intonational aspects of linguistic structure. It turns out that Tashlhiyt allows for a tremendously flexible placement of tonal events. Observed intonational structures can be conceived of as different solutions to a functional dilemma: The requirement to realise meaningful pitch movements in certain positions and the extent to which segments lend themselves to a clear manifestation of these pitch movements

    Tonal placement in Tashlhiyt: How an intonation system accommodates to adverse phonological environments

    Get PDF
    In most languages, words contain vowels, elements of high intensity with rich harmonic structure, enabling the  perceptual retrieval of pitch. By contrast, in Tashlhiyt, a Berber language, words can be composed entirely of voiceless segments. When an utterance consists of such words, the phonetic opportunity for the execution of intonational pitch movements is exceptionally limited. This book explores in a series of production and perception experiments how these typologically rare phonotactic patterns interact with intonational aspects of linguistic structure. It turns out that Tashlhiyt allows for a tremendously flexible placement of tonal events. Observed intonational structures can be conceived of as different solutions to a functional dilemma: The requirement to realise meaningful pitch movements in certain positions and the extent to which segments lend themselves to a clear manifestation of these pitch movements

    Are tones aligned with articulatory events? Evidence from Italian and French

    No full text
    International audienceTonal alignment work has suggested that the temporal location of tonal targets relative to segmental "anchors" might be governed by principles of synchrony and stability (Arvaniti et al 1998, Ladd et al. 1999, inter alia). However, a number of discrepancies have emerged in the cross-linguistic study of alignment. For instance, despite some regularities in the alignment of L targets (Caspers and van Heuven 1993; Prieto et al. 1995), the alignment of H targets appears to be quite controversial. In fact, it is sometimes difficult to find definite segmental landmarks to which such targets might be aligned. Also, most of the alignment proposals so far inherently assume that if some anchors for tonal alignment do exist they must be acoustic in nature. A plausible alternative would be to assume that such anchors are primarily articulatory, which would explain why in some cases the underlying regularities would be masked. Hence, we adopt a new experimental paradigm for alignment research in which articulatory measures are performed simultaneously with acoustic measures. In order to test the constant alignment hypothesis, a preliminary study (D'Imperio et al. 2003) was conducted in which various latency measures, both acoustically and articulatorily based, were analyzed. Specifically, the kinematics of OPTOTRAK markers attached to the speaker's upper and lower lip was tracked over time during the production of the corpus sentences. The melodic target considered is the H tone of LH nuclear rises in Neapolitan Italian. In this variety, yes/no question LH rises are systematically later than (narrow focus) statement LH rises (D'Imperio 2000, 2001, 2002; D'Imperio and House 1997). In order to test the hypothesis of constant anchoring of H targets, the materials were produced with two different rates of speech, i.e. normal and fast. Summarizing the results, H targets of nuclear rises in Neapolitan statements and questions appear to be more closely phased with the articulatory dimension of between-lip distance than with two of the most commonly employed acoustic segmental landmarks for tonal alignment (i.e., onset and offset of stressed vowel). Statement H tones are phased with maximum between-lip distance within the stressed syllable. Note that this location does not correspond to any identifiable segmental boundary, acoustic event or phonological unit, and does not overlap with RMS peak amplitude. In fact, RMS peaks were generally much earlier than articulatory peaks, hence further away from H peaks. This calls for the collection and analysis of more articulatory data (especially jaw and tongue movements) to shed light on tonal alignment issues.In a second study, a French corpus was collected on the basis of the alignment contrast found by Welby (2003, in press). Welby's results show that listeners use the alignment of the initial rise (LHi) in French Accentual Phrases as a cue to speech segmentation. Specifically, listeners exploit the presence of an early rise to demarcate the beginning of a content word. In the present study, a corpus was built with a set of utterances displaying this specific alignment contrast. The kinematics of 10 pellets (8 on the face and tongue, 2 references) was tracked over time using an electromagnetometer (EMA, Carstens). The phasing of several articulatory events relative to the L and H part of the early rise were examined. The preliminary results seem to point to some kind of fine alignment specification for the L and H target. Specifically, we hypothesize that tonal target commands of Neapolitan as well as French rises are phased with commands of the supralaryngeal articulator involved to produce the segments to which the tone is associated. Regarding the word segmentation issue for French, it is important to study alignment in a diachronic perspective since we know of case of speech segmentation errors that can lead to lexical reinterpretation and change (l'abondance "abundance" > la bondance, from Welby 2003). We also take these results to suggest that not all rises align in the same way with the associated syllable. Though the role of articulatory constraints is important, the exact phasing properties of prosodic events are language-specific. Since prosody has recently become the realm of investigation of the Task Dynamics program (Byrd and Saltzman 2003), our alignment work will be cast under such a perspective

    Jeddah Arabic intonation : an autosegmental-metrical approach

    Get PDF
    IPhD ThesisThis thesis is a theoretical and instrumental investigation of intonation in Jeddah Arabic, an urban Arabic variety spoken in west Saudi Arabia. The study is carried out in an attempt to establish the dialect’s prosodic properties and to widen the scope and volume of the literature on Arabic prosody that would in turn aid in the cross-dialectal comparison of prosodic and intonational patterns. The investigation is carried out in light of the Auto-Segmental Metrical theory of intonation- a theory that has been reported to account for the intonational patterns of many languages. In AM theory, intonation is manifested via prominent F0 behaviour in interaction with phonological structure, hence maintains a close relationship between accent distribution and phonological/metrical structure. This F0 behaviour is examined acoustically through pitch level, range and excursion size, in the form of increased peak height and excursion, pitch compression or absence thereof to mark intonational structure. In addition to pitch, other acoustic correlates such as duration and amplitude are examined as well. The thesis includes the examination of the different tunes, postlexical phrasing, and accent categories (contour shapes) that occur in the dialect. Moreover, and as an integral part of AM analysis, the thesis closely examines both theoretically and acoustically the concepts of tonal alignment and accentuation and information structure in this Arabic dialect. Data for the study were collected from 20 native male and female speakers of Jeddah Arabic. Data were then semiautomatically segmented and manually transcribed using a modified TOBI system for Arabic. It is found that JA speakers rely on both qualitative and quantitative detail to enhance intonationally important material that is conveyed prosodically. The results also point to that JA is a stress-accent language that is although similar to other languages in this group, contributes differently to the general cross-language prosodic variation. The dialect demonstrates prominent pitch accents that faithfully associate and align with stressed syllables and are distributed in two intonational levels above the prosodic word: the intermediate phrase and the intonational phrase. Those two intonational levels are found to be marked by both tonal and non-tonal correlates. Experimental evidence shows that contrary to the typical reported correlates of those prosodic constituents, in JA intermediate phrases boundaries demonstrate longer pre-boundary units than intonational phrases. This non-tonal pattern in intermediate phrase boundaries correlates with later alignment of the tone with respect to the onset of the stressed syllable

    Tonal placement in Tashlhiyt

    Get PDF
    In most languages, words contain vowels, elements of high intensity with rich harmonic structure, enabling the perceptual retrieval of pitch. By contrast, in Tashlhiyt, a Berber language, words can be composed entirely of voiceless segments. When an utterance consists of such words, the phonetic opportunity for the execution of intonational pitch movements is exceptionally limited. This book explores in a series of production and perception experiments how these typologically rare phonotactic patterns interact with intonational aspects of linguistic structure. It turns out that Tashlhiyt allows for a tremendously flexible placement of tonal events. Observed intonational structures can be conceived of as different solutions to a functional dilemma: The requirement to realise meaningful pitch movements in certain positions and the extent to which segments lend themselves to a clear manifestation of these pitch movements

    Effects of Prosody while Disambiguating Ambiguous Japanese Sentences in the Brain of Native Speakers and Learners of Japanese: A Proposition for Pronunciation and Prosody Training

    Get PDF
    Recently, the significant role that pronunciation and prosody plays in processing spoken language has been widely recognized and a variety of teaching methodologies of pronunciation/prosody has been implemented in teaching foreign languages. Thus, an analysis of how similarly or differently native and L2 learners of a language use pronunciation/prosodic cues needs to be further investigated, and the learnability of pronunciation/prosodic features should be explored. In this study, the role of prosody in Japanese sentence processing will be specifically explored among native speakers and L2 learners of Japanese whose native language is English. In Experiment 1 and 2, the effect of prosody during Japanese sentence processing was explored among native speakers using a psycholinguistic measurement. In Experiment 3, we compared L2 learners processing and judgment of Japanese sentences utilizing the brain-imaging technique Electroencephalography (EEG) together with a psycholinguistic measurement. In Experiment 1 and 2, native speakers of Japanese listened to globally ambiguous sentences that can be interpreted in two ways, and temporarily ambiguous sentences that have two different syntactic structures. They either rated how acceptable each sentence is or answered a comprehension question on each sentence. As for the globally ambiguous sentences, the results revealed that overall one type (`embedded-clause') of interpretation is preferred over the other type (`main-clause') of interpretation at the judgment given time pressure. Prosody guided their interpretations to a certain degree; however, it did not have a deterministic effect, especially for arriving at a `main-clause' interpretation. As for the temporarily ambiguous sentences, significant effects of prosody in parsing temporarily ambiguous sentences were found, with the results suggesting that while parsing affects processing, its role is not deterministic. In Experiment 3, native speakers and intermediate- to advanced-level L2 learners of Japanese listened to two types of temporarily ambiguous sentences read with two types of prosody and rated how acceptable each sentence was. Simultaneously, their brain activity was continuously recorded using EEG. The results revealed important similarities and differences among the native speakers and L2 learners' processing of these sentences. Both groups yielded a brain response that indicates the detection of prosodic break, and prosody was utilized at least to some extent. However, the patterns were different among the two groups, and the precise nature of the effects for the learners suggests that they have difficulties with processing non-default-type of structure (`main-clause' structure), and the congruent prosody for that structure (`main-clause prosody'). These results indicate that L2 learners have access to prosodic cues in sentence comprehension. On the other hand, the measurements of processing presented here suggest that these learners are not yet utilizing prosody in a native-like way, suggesting the utility of creating new ways to introduce prosody and its relation with the structure and meaning of Japanese sentences. It is suggested that teaching how to use prosodic cues in comprehending complex sentences with various types of sentence structures may develop L2 learners' ability to develop their oral communication skills

    Prosody and Intonation in Formosan Languages

    Full text link
    The Formosan languages are the languages of the Aboriginal peoples of Taiwan. These languages are part of the Austronesian language family, and represent all but one primary branch of this family of 1,200+ languages. The Formosan languages are endangered, some critically so. While these languages have seen attention in the literature for their syntactic and phonological systems, little work has been done on their prosodic structure or intonation. This dissertation analyzes the prosodic structure and intonational phonology of Mantauran Rukai, Budai Rukai, Tsou, Kanakanavu, Hla’alua, Sandimen Paiwan, Piuma Paiwan, Kavalan, Amis, Bunun, Tgdaya Seediq, Truku Seediq, and Pazeh, based on original fieldwork. In addition, archival materials are incorporated into analyses of Tsou, Truku Seediq, Tgdaya Seediq, and Puyuma. This study finds that the Formosan languages show rich tonal phonologies in their intonational systems, and have complex interactions between stress assignment and morphology. Some examples include the following: Mantauran Rukai, previously described as an initial-stress language, actually has a complex stress assignment system with an alternation between first- and third-syllable stress, which as a system is unique in descriptions of stress assignment in the world’s languages. Hla’alua (Saaroa), previously described as having free variation between antepenultimate and penultimate stress, actually has an accent system in which some lexical items are consistently produced without an accented syllable, while others are. Hla’alua also has a rich tonal phonology assigned at two higher levels of the prosodic hierarchy. Kavalan has a unique rule that causes spreading tones to shift to the opposite domain edge when a certain number of tonal elements are aligned to the same boundary. Elements of the intonational phonology in Amis and Kavalan include glottal stops in addition to tonal elements. Bunun has distinct pitch accent melodies for words vs. clitics. In addition to the unique features found in individual Formosan languages, this dissertation’s comparative study finds at least two geographic areas within Taiwan in which features of prosody and intonation cluster. One is southwestern Taiwan, including Tsou, Kanakanavu, Hla’alua, and Rukai, which share features including a lack of glide-vowel contrasts and variability of initial H vs. L elements in certain prosodic domains. The other is eastern Taiwan, including Amis, Kavalan, and Puyuma, which share features including suppression of non-IP-final pitch accents, alternations between ultimate and pre-ultimate F0 peaks across intonational contours, and interactions between glottal stop epenthesis and intonational phonology

    Tonal placement in Tashlhiyt: How an intonation system accommodates to adverse phonological environments

    Get PDF
    In most languages, words contain vowels, elements of high intensity with rich harmonic structure, enabling the  perceptual retrieval of pitch. By contrast, in Tashlhiyt, a Berber language, words can be composed entirely of voiceless segments. When an utterance consists of such words, the phonetic opportunity for the execution of intonational pitch movements is exceptionally limited. This book explores in a series of production and perception experiments how these typologically rare phonotactic patterns interact with intonational aspects of linguistic structure. It turns out that Tashlhiyt allows for a tremendously flexible placement of tonal events. Observed intonational structures can be conceived of as different solutions to a functional dilemma: The requirement to realise meaningful pitch movements in certain positions and the extent to which segments lend themselves to a clear manifestation of these pitch movements
    • …
    corecore