Search CORE

63 research outputs found

Phonetics of segmental FO and machine recognition of Korean speech

Author: Jang Tae-Yeoub
Publication venue: The University of Edinburgh
Publication date: 01/01/2000
Field of study

Crosslinguistic trends in tone change A review of tone change studies in East and Southeast Asia

Author: Xu Y
Yang C
Publication venue: 'John Benjamins Publishing Company'
Publication date: 01/09/2019
Field of study

Ground-breaking studies on how Bangkok Thai tones have changed over the past 100 years (Pittayaporn 2007, 2018; Zhu et al. 2015) reveal a pattern that Zhu et al. (2015) term the “clockwise tone shift cycle:” low > falling > high level or rising-falling > rising > falling-rising or low. The present study addresses three follow-up questions: (1) Are tone changes like those seen in Bangkok Thai also attested in other languages? (2) What other tone changes are repeated across multiple languages? (3) What phonetic biases are most likely to be the origins of the reported changes? A typological review of 52 tone change studies across 45 Sinitic, Tai-Kadai, Hmong-Mien, and Tibeto-Burman languages reveals that clockwise changes are by far the most common. The paper concludes by exploring how tonal truncation (Xu 2017) generates synchronic variation that matches the diachronic patterns; this suggests that truncation is a key mechanism in tone change

UCL Discovery

Multiple acoustic cues for Korean stops and automatic speech recognition

Author: Yun Weonhee.
Publication venue: The University of Edinburgh
Publication date: 01/01/2003
Field of study

The purpose of this thesis is to analyse acoustic characteristics of Korean stops by way of multivariate statistical tests, and to apply the results of the analysis in Automatic Speech Recognition (ASR) of Korean. Three acoustic cues that differentiate three types of Ko¬ rean oral stops are closure duration, Voice Onset Time (VOT) and fundamental frequency (FO) of a vowel after a stop. We review the characteristics of these parameters previously reported in various phonetic studies and test their usefulness for differentiating the three types of stops on two databases, one with controlled contexts, as in other phonetic stud¬ ies, and the other a continuous speech database designed for ASR. Statistical tests on both databases confirm that the three types of stops can be differentiated by the three acoustic parameters. In order to exploit these parameters for ASR, a context dependent Hidden Markov Model (HMM) based baseline system with a short pause model is built, which results in great improvement of performance compared to other systems. For mod¬ elling of the three acoustic parameters, an automatic segmentation technique for closure and VOT is developed. Samples of each acoustic parameter are modelled with univariate and multivariate probability distribution functions. Stop probability from these models is integrated by a post-processing technique. Our results show that integration of stop prob¬ ability does not make much improvement over the results of a baseline system. However, the results suggest that stop probabilities will be useful in determining the correct hy¬ pothesis with a larger lexicon containing more minimal pairs of words that differ by the identity of just one stop

Edinburgh Research Archive

Introducing prosodic phonetics

Author: Heuven V.J.J.P. van
Publication venue: RU Leiden
Publication date: 01/01/1994
Field of study

Wetensch. publicati

Leiden University Scholary Publications

An acoustic-phonetic descriptive analysis of Kagoshima Japanese tonal phenomena

Author: Ishihara Shunichi
Publication venue
Publication date
Field of study

This thesis presents a linguistic-phonetic description of the acoustic properties of the contrastive accentual patterns in Kagoshima Japanese. Kagoshima Japanese is one of the traditionally so-called two pattern varieties of Japanese. Different phonological analyses of the contrast are described in detail in order to situate the phonetic description. Data from four native speakers—two males and two females—are used, appropriately log z-score normalised to yield mean normalised FO curves necessary for linguistic-phonetic description. Normalisation procedures are discussed, justified and explained. The acoustic realisation of the Kagoshima Japanese contrast is specified on words from one to seven syllables in length, and a surface representation proposed within Autosegmental-Metrical theory. Some microprosodic aspects are also examined, especially in terms of the effect of the syllable-rhyme structure (i.e. vowel length, final nasal) on FO. In addition is examined what happens when accentual units are juxtaposed in three types of syntagms (noun phrase e.g. nagaka tamago; possessive phrase e.g. mago no kimono; simple sentence, e.g. sakana ga nigeru). Where appropriate, comparisons are drawn with Standard Japanese to highlight similarities and differences in tonality. It is shown that, like Standard Japanese, the Kagoshima Japanese accentual contrast is realised as ± falling pitch/FO, and also like SJ, the contrast is manifested globally, throughout the word. Microprosodically, it is shown that syllable-final nasals are associated with higher FO, not necessarily on the nasal itself, and that heavy syllables also evince a higher FO. Finally, two extrinsic allotones—level and falling—are demonstrated for one of the accentual types, depending on its rhyme constituents. As far as the juxtaposed data are concerned, it is shown that, unlike Standard Japanese, no deaccentuation is involved. Moreover, a differential effect is demonstrated, in terms of FO downstep, with respect to syntactic type: noun phrases behave differently from possessive phrases and simple sentences in showing no downstep. It is suggested that the magnitude of downstep in Kagoshima Japanese may be smaller than in Standard Japanese. It is proposed that the linguistic-phonetic representations derived in this thesis can be used not only to investigate within-language linguistic features (e.g. the Kagoshima Japanese accentual contrast), but also to compare Kagoshima with the corresponding linguistic-phonetic representations of other Japanese varieties

The Australian National University

Prosodic analysis and Asian linguistics : to Honour R.K. Sprigg

Author: Bradley David
Henderson E. J. A
Mazaudon Martine
Sprigg Richard Keith
Publication venue: Dept. of Linguistics, Research School of Pacific Studies, The Australian National University
Publication date: 01/01/1989
Field of study

The Australian National University

A study on form and function of prosody based on acoustics, interpretation, and modelling - with evidence from the analysis by synthesis of Mandarin speech prosody

Author: Zhi Na
Publication venue: 'Scuola Normale Superiore - Edizioni della Normale'
Publication date: 01/11/2012
Field of study

An analysis-by-synthesis study on Mandarin speech prosody is conducted in the present dissertation. The features of Mandarin speech prosody are discussed by focusing on two salient aspects: the function of prosody and the form of prosody. The study attempts to find a plausible way in which the two aspects can be mapped onto each other through the functional analysis of prosody and the multi-level formal representation. The form of Mandarin speech prosody is a complex F0 picture due to the simultaneous uses of pitch contours by both lexical tones and sentential intonation. The phenomenon of tone sandhi in speech context triggers more puzzling issues when researchers are confronted with the acoustic form of Mandarin prosody. The functional use of prosody in Mandarin speech concerns: at the lexical level for word identity (Tone1, Tone2, Tone3, Tone4, and Tone0); at the sentential level for prominence marking (sentence accents) and the indication of prosodic boundaries (intonation boundary tones). In the present study, the analysis of prosodic function at the two levels provides a basic framework in coding the surface melodic form of Mandarin prosody, which consists of pitch contours in tonal units and boundary tones at the beginning and end of intonation unit. For the formal representation of Mandarin speech prosody, the surface F0 contour of each utterance is coded into a sequence of INTSINT symbols, and subject to the Prozed tool for speech synthesis. It is shown that the synthesized stimuli derived from the symbolic coding can closely follow the melodic features and correctly express the prosodic function of the original Mandarin utterances. The present study employs acoustic data, symbolic coding, and speech synthesis for the derivative mapping between prosodic function and form, which aims to interpret the complex prosodic phenomenon, and provide an insight for the annotation and analysis of Mandarin speech prosody

Archivio istituzionale della Ricerca - Scuola Normale Superiore

Rozdíly realizace tónů hanojského a saigonského dialektu vietnamštiny mezi čteným a polospontánním mluveným projevem

Author: Slówik Ondřej
Publication venue: Univerzita Karlova, Filozofická fakulta
Publication date: 01/01/2018
Field of study

Hlavním cílem této disertace je popis rozdílů realizací tónů mezi hanojským s saigonským dialektem se zaměřením na situaci ve čteném a polospontánním mluveném projevu. Výzkum se zabývá zejména produkcí, avšak je doplněn i oddílem, který řeší problematiku percepce tónů s využitím percepčního testu. Oddíl 2.1. popisuje tonalitu a tónové jazyky v obecném smyslu. V oddíle 2.2. se popisuje vietnamský jazyk s důrazem na tónové inventáře obou zkoumaných dialektů. Zmiňuje se též vznik a vývoj tónů obecně i v rámci vietnamštiny. Třetí kapitola představuje metodu výzkumu, zejména výběr mluvčích, přípravu materiálu a nahrávek, extrakce dat a přípravu analýz a percepčního testu. Čtvrtá kapitola je rozdělena do tří oddílů. Oddíl 4.1. hovoří o realizacích tónů v izolaci a ve speciálně zvoleném kontextu. Jeho účelem je popsat chování tónů co nejméně ovlivněných okolními proměnnými. Výsledky této analýzy by měly být srovnatelné s již publikovanými studiemi na stejné téma. V oddíle 4.2. je na data nahlédnuto z kvantitativní perspektivy a výsledky této analýzy by měly věrněji reflektovat jazykovou realitu. Oddíl 4.3. představuje percepční test, jehož účelem je zmapovat schopnost rozlišování tónů mimo přirozený kontext. Výsledky analýz indikují zásadní rozdíly mezi konturami tónů hanojského a saigonského dialektu....The chief objective of this dissertation is the description of tone realization differences in Hanoian and Saigonese dialects based on a representative sample of recorded material, with special focus on read monologue and semi-spontaneous conversational speech. The research discusses mainly issues of tone production but it is complemented by a section on tone perception in form of a perception test. The theoretical background in Section 2.1. describes the topic of tonality and tonal languages in general. Section 2.2. is devoted to the description of the Vietnamese language and attention is specifically paid to tonal inventories of both researched dialects. Tonogenesis is mentioned on a general level as well as in the Vietnamese language in particular. Chapter 3 introduces the research methodology, namely the speaker selection, speech material preparation and recording, data extraction and preparation for the analyses and the perception test. Chapter 4 is divided into three sections. Section 4.1. speaks about tone realizations in isolation and carefully preselected context. Its goal is to investigate the behaviour of tonal contours influenced by as few variables as possible. The results should be comparable to the findings of previously conducted studies. Section 4.2. strives to assess data from a...Institute of PhoneticsFonetický ústavFilozofická fakultaFaculty of Art

CU Digital Repository

Recommended from our members

Neutral Tone in Mandarin: Representation and Interaction with Utterance-level Prosody

Author: Zhang Yixin
Publication venue: University of Cambridge
Publication date: 27/12/2021
Field of study

In Standard Mandarin, there are syllables that do not carry any of the four citation tones (T1: High-level tone, T2: Mid-rising tone, T3: Low-convex tone and T4: High-falling tone), and they are said to have a neutral tone (NT). These syllables are usually shorter, lighter, prosodically grouped with the preceding CT-bearing syllables. These characteristics of NT have led to a prevailing view that it has no underlying phonological specification. However, research has focused more on how the surface pitch variations of NT are realized rather than the underlying representation of NT. In contrast, morphological, sociolinguistic and diachronic work on NT has suggested that NT may not be a homogeneous entity. In this thesis, I provide acoustic and psycholinguistic evidence that there are two types of NT, Intrinsic NT and Derived NT. Intrinsic NT refers to morphemes that were lexicalized as tone-deleted, unstressed syllables even before the formation of the four CTs of modern Mandarin. Derived NT refers to morphemes derived from the CTs via stress-related tone-deletion. In Part A, the phonological representation of Intrinsic and Derived NT is explored through two production and two processing experiments. The results show that Intrinsic NT is likely to have an underspecified tonal target while Derived NTs are underlyingly CTs. In addition, both subtypes of NT are metrically light, unlike heavy CTs. Part B explores the interaction between NTs and utterance-level prosody in production and perception experiments. NT-bearing syllables have lengthening patterns under focus similar to CT-bearing syllables, in contrast to the realization of unstressed syllables in English. In perception, the identification of intonation (Statement vs. Question) on Intrinsic NT was similar to Derived NT. When compared to CTs, the NTs elicit less bias towards question than T4, and higher accuracy than T2, which may result from their simpler surface representations.CHINA Scholarship COUNCIL (CSC) and Cambridge Trus

Apollo (Cambridge)