42 research outputs found
The production and perception of peripheral geminate/singleton coronal stop contrasts in Arabic
Gemination is typologically common word-medially but is rare at the periphery of the word (word-initially and -finally). In line with this observation, prior research on production and perception of gemination has focused primarily on medial gemination. Much less is known about the production and perception of peripheral gemination. This PhD thesis reports on comprehensive articulatory, acoustic and perceptual investigations of geminate-singleton contrasts according to the position of the contrast in the word and in the utterance. The production component of the project investigated the articulatory and acoustic features of medial and peripheral gemination of voiced and voiceless coronal stops in Modern standard Arabic and regional Arabic vernacular dialects, as produced by speakers from two disparate and geographically distant countries, Morocco and Lebanon. The perceptual experiment investigated how standard and dialectal Arabic gemination contrasts in each word position were categorised and discriminated by three groups of non-native listeners, each differing in their native language experience with gemination at different word positions. The first experiment used ultrasound and acoustic recordings to address the extent to which word-initial gemination in Moroccan and Lebanese dialectal Arabic is maintained, as well as the articulatory and acoustic variability of the contrast according to the position of the gemination contrast in the utterance (initial vs. medial) and between the two dialects. The second experiment compared the production of word-medial and -final gemination in Modern Standard Arabic as produced by Moroccan and Lebanese speakers. The aim of the perceptual experiment was to disentangle the contribution of phonological and phonetic effects of the listeners’ native languages on the categorisation and discrimination of non-lexical Moroccan gemination by three groups of non-native listeners varying in their phonological (native Lebanese group and heritage Lebanese group, for whom Moroccan is unintelligible, i.e., non-native language) and phonetic-only (native English group) experience with gemination across the three word positions. The findings in this thesis constitute important contributions about positional and dialectal effects on the production and perception of gemination contrasts, going beyond medial gemination (which was mainly included as control) and illuminating in particular the typologically rare peripheral gemination
North East Indian linguistics 6
The papers for this volume were initially presented at the sixth and seventh meetings of the North East Indian Linguistics Society, held in Guwahati, India, in 2011 and 2012. As with previous conferences, these meetings were held at the Don Bosco Institute in Guwahati, Assam, and hosted in collaboration with Gauhati University. The present collection of papers are
testament to the ongoing interest in North East India and continued success and growth in the community of North East Indian linguists. As in previous volumes, all the papers here were reviewed by leading international specialists in the relevant subfields. This volume, in particular,
highlights the recent research of many scholars from the region. Out of eleven contributions, eight are from North East Indian scholars themselves. This book therefore brightly shines light on the work being done by North East Indian linguists on the languages of their own region. The
remaining contributions are authored by international scholars from Australia, Singapore, Germany/USA, and Nepal
An exploration of minimal and maximal metrical feet
This thesis presents a principled theory of bounded recursive footing. Building on previous research on metrical stress, and couched within the framework of Prosodic Hierarchy Theory, I argue that the rehabilitation of recursive feet in phonological representations leads to an improvement of our theory of prosody.
I investigate the major driving forces that may cause recursion at the foot level and demonstrate that reference to recursive and non-recursive feet in various related and unrelated languages (e.g. Wargamay, Yidiɲ, Chugach, English, Dutch, German, Gilbertese, Seneca, Ryukyuan, Tripura Bangla, Cayuvava) allows us to provide a unified account of a wide range of prosodically-conditioned phenomena which would otherwise remain unexplained. In particular, I demonstrate that the assignment of binary and ternary stress, certain tonal distributions, some puzzling cases of vowel lengthening, consonant fortition, vowel reduction and consonant weakening all clearly benefit from recursion-based analyses.
In arguing for the need for recursive feet in phonological representations, I identify new strength relations in prosodic systems. Besides the well-established strength dichotomy between the head of a foot (i.e. the strong branch of a foot) and the dependent of a foot (i.e. its weak branch), I show that languages may distinguish between further metrical prominence positions. These additional required positions do not need to be stipulated as they come for free in a framework that allows recursion at the level of the foot
Recommended from our members
Text-to-Speech Synthesis Using Found Data for Low-Resource Languages
Text-to-speech synthesis is a key component of interactive, speech-based systems. Typically, building a high-quality voice requires collecting dozens of hours of speech from a single professional speaker in an anechoic chamber with a high-quality microphone. There are about 7,000 languages spoken in the world, and most do not enjoy the speech research attention historically paid to such languages as English, Spanish, Mandarin, and Japanese. Speakers of these so-called "low-resource languages" therefore do not equally benefit from these technological advances. While it takes a great deal of time and resources to collect a traditional text-to-speech corpus for a given language, we may instead be able to make use of various sources of "found'' data which may be available. In particular, sources such as radio broadcast news and ASR corpora are available for many languages. While this kind of data does not exactly match what one would collect for a more standard TTS corpus, it may nevertheless contain parts which are usable for producing natural and intelligible parametric TTS voices.
In the first part of this thesis, we examine various types of found speech data in comparison with data collected for TTS, in terms of a variety of acoustic and prosodic features. We find that radio broadcast news in particular is a good match. Audiobooks may also be a good match despite their largely more expressive style, and certain speakers in conversational and read ASR corpora also resemble TTS speakers in their manner of speaking and thus their data may be usable for training TTS voices.
In the rest of the thesis, we conduct a variety of experiments in training voices on non-traditional sources of data, such as ASR data, radio broadcast news, and audiobooks. We aim to discover which methods produce the most intelligible and natural-sounding voices, focusing on three main approaches:
1) Training data subset selection. In noisy, heterogeneous data sources, we may wish to locate subsets of the data that are well-suited for building voices, based on acoustic and prosodic features that are known to correspond with TTS-style speech, while excluding utterances that introduce noise or other artifacts. We find that choosing subsets of speakers for training data can result in voices that are more intelligible.
2) Augmenting the frontend feature set with new features. In cleaner sources of found data, we may wish to train voices on all of the data, but we may get improvements in naturalness by including acoustic and prosodic features at the frontend and synthesizing in a manner that better matches the TTS style. We find that this approach is promising for creating more natural-sounding voices, regardless of the underlying acoustic model.
3) Adaptation. Another way to make use of high-quality data while also including informative acoustic and prosodic features is to adapt to subsets, rather than to select and train only on subsets. We also experiment with training on mixed high- and low-quality data, and adapting towards the high-quality set, which produces more intelligible voices than training on either type of data by itself.
We hope that our findings may serve as guidelines for anyone wishing to build their own TTS voice using non-traditional sources of found data
Windesi Wamesa Morphophonology
Wamesa [wam] is an endangered Austronesian language spoken in the south-eastern Bird’s Head of New Guinea, in the Indonesian province of West Papua. is dissertation provides a description and formal analysis of the phonology and morphology of the Windesi dialect based on the author’s fieldwork with speakers of the language. Chapter 1 provides an introduction to the language, its speakers, and the cultural, geographic, and linguistic context in which Wamesa is spoken. It also provides background on the fieldwork which forms the basis of this dissertation and the resulting corpus. Chapter 2 describes the phonology of Wamesa, including its phoneme inventory, phonotactics, and productive phonological processes, with phonetic detail. e second half of the chapter gives an account of the phonological adaptation of loan words into Wamesa. Chapter 3 gives a formal analysis of stress assignment in the language based in Optimality eory. Chapter 4 describes the Wamesa clitics and affixes, and Chapter 5 gives an account of the three major word classes, nouns, verbs, and adjectives, as well as modes of spatial expression and a selection of other minor word classes. Chapter 6 gives a formal synchronic analysis of the infixation of verbal subject agreement affixes in Wamesa, followed by a diachronic account of how the paern might have arisen from incremental improvements in speech production and perception
Syllable structure and related processes in optimality theory :an examination of Najdi Arabic
PhD ThesisThis study is an investigation of syllable structure and related processes in one variety of Saudi Arabic. This is the variety spoken by inhabitants of Riyadh and villages near this city in Najd province, henceforth referred to as Najdi Arabic (NA). Although this dialect has been analysed by scholars including Johnstone (1963, 1967), Lehn (1967), Ingham (1971, 1982, 1994), Abboud (1979), Al-Sweel (1987, 1990), Prochazka (1988), Kurpershoek (1999), Alezets (2007), Alessa (2008), and Alghmaiz (2013), syllable structure and related processes in this dialect have not been accounted for within Optimality Theory (OT). Therefore, the main goal of this thesis is to show how OT, as an analytical framework, is utilized to produce a better understanding syllable structure and related processes such as CV metathesis, epenthesis, vowel shortening, and syncope in NA. Accordingly, the fundamental aims of this thesis are to examine phonological processes that have an impact on the syllable structure in this dialect and to show the insights about NA syllable structures and related processes that can be gained through OT analyses.
The research draws on previous work on NA as well as other Arabic varieties more generally. Thus, the theoretical literature on syllables, syllable structures and syllable typologies are taken into consideration in the analysis of NA data. The data for this study are drawn from articles, essays, theses, and journals. These sets of data underwent my own judgment as an NA native speaker. In addition, 15 native speakers of NA were interviewed and consulted on the NA set of data in this thesis.
There are four findings in this study. The first deals with the comprehensive analysis of syllable structure in NA, focusing on the types of onsets and codas as well as the weight of syllables in this dialect. The second extends to the comprehensive analysis that deals with the main phonological processes in NA, focusing on CV-metathesis, epenthesis, vowel shortening, and syncope. The third sheds light on the unified set of OT constraints that has been established to explain NA syllable structure and related processes within OT. Finally, the capability of OT to account for cross-linguistic variation is demonstrated by showing how language-specific constraint rankings based on one set of constraints accounts for CV metathesis, vowel epenthesis, and syncope in Najdi and Urban Hijazi Arabic (UHA)
Dutch A-Scrambling Is Not Movement: Evidence from Antecedent Priming
The present study focuses on A-scrambling in Dutch, a local word-order alternation that typically signals the discourse-anaphoric status of the scrambled constituent. We use cross-modal priming to investigate whether an A-scrambled direct object gives rise to antecedent reactivation effects in the position where a movement theory would postulate a trace. Our results indicate that this is not the case, suggesting that A-scrambling in Dutch results from variation in base-generated order
On looking into words (and beyond): Structures, Relations, Analyses
On Looking into Words is a wide-ranging volume spanning current research into word structure and morphology, with a focus on historical linguistics and linguistic theory. The papers are offered as a tribute to Stephen R. Anderson, the Dorothy R. Diebold Professor of Linguistics at Yale, who is retiring at the end of the 2016-2017 academic year. The contributors are friends, colleagues, and former students of Professor Anderson, all important contributors to linguistics in their own right. As is typical for such volumes, the contributions span a variety of topics relating to the interests of the honorand. In this case, the central contributions that Anderson has made to so many areas of linguistics and cognitive science, drawing on synchronic and diachronic phenomena in diverse linguistic systems, are represented through the papers in the volume.
The 26 papers that constitute this volume are unified by their discussion of the interplay between synchrony and diachrony, theory and empirical results, and the role of diachronic evidence in understanding the nature of language. Central concerns of the volume include morphological gaps, learnability, increases and declines in productivity, and the interaction of different components of the grammar. The papers deal with a range of linked synchronic and diachronic topics in phonology, morphology, and syntax (in particular, cliticization), and their implications for linguistic theory
On looking into words (and beyond): Structures, Relations, Analyses
On Looking into Words is a wide-ranging volume spanning current research into word structure and morphology, with a focus on historical linguistics and linguistic theory. The papers are offered as a tribute to Stephen R. Anderson, the Dorothy R. Diebold Professor of Linguistics at Yale, who is retiring at the end of the 2016-2017 academic year. The contributors are friends, colleagues, and former students of Professor Anderson, all important contributors to linguistics in their own right. As is typical for such volumes, the contributions span a variety of topics relating to the interests of the honorand. In this case, the central contributions that Anderson has made to so many areas of linguistics and cognitive science, drawing on synchronic and diachronic phenomena in diverse linguistic systems, are represented through the papers in the volume.
The 26 papers that constitute this volume are unified by their discussion of the interplay between synchrony and diachrony, theory and empirical results, and the role of diachronic evidence in understanding the nature of language. Central concerns of the volume include morphological gaps, learnability, increases and declines in productivity, and the interaction of different components of the grammar. The papers deal with a range of linked synchronic and diachronic topics in phonology, morphology, and syntax (in particular, cliticization), and their implications for linguistic theory