Search CORE

37 research outputs found

Production and perception of lexical tone variation in Mandarin dialects

Author: Zhao Liang
Publication venue
Publication date: 01/10/2023
Field of study

White Rose E-theses Online

AUTOMATIC EXTRACTION OF ARABIC SUBWORD UNITS FOR CONTINUOUS SPEECH RECOGNITION

Author
Publication venue
Publication date
Field of study

KFUPM ePrints

Recommended from our members

Automatic Dialect and Accent Recognition and its Application to Speech Recognition

Author: Biadsy Fadi
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/2011
Field of study

A fundamental challenge for current research on speech science and technology is understanding and modeling individual variation in spoken language. Individuals have their own speaking styles, depending on many factors, such as their dialect and accent as well as their socioeconomic background. These individual differences typically introduce modeling difficulties for large-scale speaker-independent systems designed to process input from any variant of a given language. This dissertation focuses on automatically identifying the dialect or accent of a speaker given a sample of their speech, and demonstrates how such a technology can be employed to improve Automatic Speech Recognition (ASR). In this thesis, we describe a variety of approaches that make use of multiple streams of information in the acoustic signal to build a system that recognizes the regional dialect and accent of a speaker. In particular, we examine frame-based acoustic, phonetic, and phonotactic features, as well as high-level prosodic features, comparing generative and discriminative modeling techniques. We first analyze the effectiveness of approaches to language identification that have been successfully employed by that community, applying them here to dialect identification. We next show how we can improve upon these techniques. Finally, we introduce several novel modeling approaches -- Discriminative Phonotactics and kernel-based methods. We test our best performing approach on four broad Arabic dialects, ten Arabic sub-dialects, American English vs. Indian English accents, American English Southern vs. Non-Southern, American dialects at the state level plus Canada, and three Portuguese dialects. Our experiments demonstrate that our novel approach, which relies on the hypothesis that certain phones are realized differently across dialects, achieves new state-of-the-art performance on most dialect recognition tasks. This approach achieves an Equal Error Rate (EER) of 4% for four broad Arabic dialects, an EER of 6.3% for American vs. Indian English accents, 14.6% for American English Southern vs. Non-Southern dialects, and 7.9% for three Portuguese dialects. Our framework can also be used to automatically extract linguistic knowledge, specifically the context-dependent phonetic cues that may distinguish one dialect form another. We illustrate the efficacy of our approach by demonstrating the correlation of our results with geographical proximity of the various dialects. As a final measure of the utility of our studies, we also show that, it is possible to improve ASR. Employing our dialect identification system prior to ASR to identify the Levantine Arabic dialect in mixed speech of a variety of dialects allows us to optimize the engine's language model and use Levantine-specific acoustic models where appropriate. This procedure improves the Word Error Rate (WER) for Levantine by 4.6% absolute; 9.3% relative. In addition, we demonstrate in this thesis that, using a linguistically-motivated pronunciation modeling approach, we can improve the WER of a state-of-the art ASR system by 2.2% absolute and 11.5% relative WER on Modern Standard Arabic

Columbia University Academic Commons

AUTOMATIC EXTRACTION OF ARABIC SUBWORD UNITS FOR CONTINUOUS SPEECH RECOGNITION

Author
Publication venue
Publication date
Field of study

A Corpus-based Study of the English Translation of Chinese Empty Words

Author: Chen Yi-Chiao
Publication venue: Humanities, Imperial College London
Publication date: 01/04/2013
Field of study

This study aims to investigate the translation procedures adopted for the English translation of Chinese empty words. To begin with, the methods which translators adopt to tackle empty words are identified by examining examples in a specially-constructed parallel corpus, which includes Chinese literary texts and their English translations. Eventually, eight translation procedures (1) Match; 2) Paraphrase; 3) Shared Match; 4) Implicitation; 5) Amplification; 6) Grammatical Conveyance; 7) Borrowing; 8) Omission) and one non-procedure (Mismatch) are identified. It is noted that Grammatical Conveyance is a procedure which could be deemed as a newly-identified method. As a further step, the proportion of these procedures/non-procedure is investigated to identify the most-/least-adopted ones (Match and Amplification respectively) and to discuss category-specific ones (Shared Match and Borrowing). In addition to identifying the procedures for translating Chinese empty words into English, this research also makes contributions in the following two aspects. Firstly, this study, to my knowledge, is the first research which examines all Chinese empty words at a time to identify the ways translators tackle them. Secondly, it is known from the results that difficult-to-tackle empty words are found in the categories of Adverb and Particle. In other words, not all empty words are difficult to translate as former scholars have described

Spiral - Imperial College Digital Repository

Austronesian and other languages of the Pacific and South-east Asia : an annotated catalogue of theses and dissertations

Author: Coppell W. G.
Publication venue: Dept. of Linguistics, Research School of Pacific Studies, The Australian National University
Publication date: 01/01/1981
Field of study

Ezid

The Australian National University

MPG.PuRe

Studies in the linguistic sciences. 17-18 (1987-1988)

Author
Publication venue: Urbana, Ill. : Dept. of Linguistics, University of Illinois,
Publication date
Field of study

Illinois Digital Environment for Access to Learning and Scholarship Repository

Linguistics in East Asia and South East Asia

Author: Noss Richard B.
Yamagiwa Joseph. K.
Yuen Ren Chao
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 27/06/2019
Field of study

Directory of Open Access Books (DOAB)

Language maintenance through primary school education: the case of Daighi

Author: Yang Annie
Yang Chia-Ying
Publication venue: The University of Edinburgh
Publication date: 30/07/2020
Field of study

Ongoing language shift to Taiwanese Mandarin is a pressing concern in Taiwan. With the concerns of losing the rich linguistic and cultural assets of Taiwan’s multilingual society, this study sets out to explore the language maintenance endeavours in primary schools, focusing on Daighi. Exploration of language attitudes is the angle this study adopts to approach language shift, looking specifically at whether language attitudes are promoted through the mandatory local languages class at primary school level. However, a large piece of the picture would be missed without the evaluation of the context, which is crucial to understand Daighi’s position. Sociocultural theory is then adopted as an analytical lens to view teachers’ practices as mediated actions, and to make visible the impact of context in Daighi maintenance. Interviews are used to explore the insights of the frontline Daighi teachers, and Daighi classes of these teachers are observed to investigate their practices, and to match these with their perceptions. In spite of the good teaching practices found at schools and attitudes to support language maintenance, there is still a gap in terms of actual language maintenance, which is defined as developing students to become functional bilinguals (Li Wei, 2006). It is possible that language maintenance is not best achieved by focusing on classroom practice alone. The Discussion Chapter then presents the mediators from global level, national level to classroom, students and teacher agency. Language policy, educational system, and perceived language attitudes of the government, local authority, school, colleagues, family and students emerge as influential mediators that contribute to the ongoing language shift to Taiwanese Mandarin. This study provides an analytical insight into Taiwanese local language education and language attitudes. Through engaging with the teachers, it also inspired critical reflections of their own practices. The findings of this study demonstrate an in-depth understanding of Daighi maintenance and shift, and provide a starting point for further research in Daighi, and in the area of language maintenance in multilingual settings

Edinburgh Research Archive