Search CORE

65 research outputs found

Rhotics.New Data and Perspectives

Author: A. Syed Nasir
Baltazani Mary
Beňuš Štefan
Blondeau Hélène
Bombien Lasse
Cohen Evan-Gary
Hoole Philip
Khattab Ghada
M. Scobbie James
Nicolaidis Katerina
Petin Cédric
Pouplier Marianne
Punnoose Reenu
Riera María
Romano Antonio
Romero Joaquín
Sankoff Gillian
Savu Carmen-Florina
Tops Evie
van 't Veer Marijn
van de Velde Hans
van Hout Roeland
Publication venue: bu,press
Publication date: 01/04/2020
Field of study

This book provides an insight into the patterns of variation and change of rhotics in different languages and from a variety of perspectives. It sheds light on the phonetics, the phonology, the socio-linguistics and the acquisition of /r/-sounds in languages as diverse as Dutch, English, French, German, Greek, Hebrew, Italian, Kuikuro, Malayalam, Romanian, Slovak, Tyrolean and Washili Shingazidja thus contributing to the discussion on the unity and uniqueness of this group of sounds

Directory of Open Access Books (DOAB)

Phonetics of segmental FO and machine recognition of Korean speech

Author: Jang Tae-Yeoub
Publication venue: The University of Edinburgh
Publication date: 01/01/2000
Field of study

Edinburgh Research Archive

Building a Phonological Inventory

Author: Veer B.M. van 't
Publication venue
Publication date: 07/01/2014
Field of study

The Feature Co-occurrence Constraint theory proposed in this dissertation provides a means to capture the development of the language-learning child's segment inventory. It does this by combining a growing set of features with constraints that are automatically activated as soon as these features are acquired. Representation and derivation go hand in hand, and develop together during acquisition. The Feature Co-occurrence Constraint theory builds on a minimal view of phonology, where the inventory is seen as epiphenomenal rather than a mentally ‘real’ object, features are few and monovalent, and the constraint set is limited to no more than two types. The theoretical consequences of the proposal for both feature theory and constraint theory are worked out in detail and a thorough discussion of phonological acquisition is provided, making this book of interest to both theoretical phonologists as acquisitionists.Language Use in Past and Presen

Leiden University Scholary Publications

Loan Phonology

Author
Publication venue: 'John Benjamins Publishing Company'
Publication date
Field of study

For many different reasons, speakers borrow words from other languages to fill gaps in their own lexical inventory. The past ten years have been characterized by a great interest among phonologists in the issue of how the nativization of loanwords occurs. The general feeling is that loanword nativization provides a direct window for observing how acoustic cues are categorized in terms of the distinctive features relevant to the L1 phonological system as well as for studying L1 phonological processes in action and thus to the true synchronic phonology of L1. The collection of essays presented in this volume provides an overview of the complex issues phonologists face when investigating this phenomenon and, more generally, the ways in which unfamiliar sounds and sound sequences are adapted to converge with the native language’s sound pattern. This book is of interest to theoretical phonologists as well as to linguists interested in language contact phenomena

OAPEN Library

Measuring phonological distance between languages

Author: Eden S. Elizabeth
Publication venue: UCL (University College London)
Publication date: 28/10/2018
Field of study

Three independent approaches to measuring cross-language phonological distance are pursued in this thesis: exploiting phonological typological parameters; measuring the cross-entropy of phonologically transcribed texts; and measuring the phonetic similarity of non-word nativisations by speakers from different language backgrounds. Firstly, a set of freely accessible online tools are presented to aid in establishing parametric values for syllable structure and phoneme inventory in different languages. The tools allow researchers to make differing analytical and observational choices and compare the results. These tools are applied to 16 languages, and correspondence between the resulting parameter values is used as a measure of phonological distance. Secondly, the computational technique of cross-entropy measurement is applied to texts from seven languages, transcribed in four different ways: a phonemic IPA transcription; with Elements; and with two sets of binary distinctive features in the SPE tradition. This technique results in consistently replicable rankings of phonological similarity for each transcription system. It is sensitive to differences in transcription systems. It can be used to probe the consequences for information transfer of the choices made in devising a representational system. Thirdly, participants from different language backgrounds are presented with non-words covering the vowel space, and asked to nativise them. The accent distance metric ACCDIST is applied to the resulting words. A profile of how each speaker’s productions cluster in the vowel space is produced, and ACCDIST measures the similarity of these profiles. Averaging across speakers with a shared native language produces a measure of similarity between language profiles. Each of these three approaches delivers a quantitative measure of phonological similarity between individual languages. They are each sensitive to different analytical choices, and require different types and quantities of input data, and so can complement each other. This thesis provides a proof-of-concept for methods which are both internally consistent and falsifiable

UCL Discovery

Experimental phonetic study of the timing of voicing in English obstruents

Author: Docherty Gerard James
Publication venue: University of Edinburgh
Publication date: 01/01/1989
Field of study

The treatment given to the timing of voicing in three areas of phonetic research -- phonetic taxonomy, speech production modelling, and speech synthesis -- Is considered in the light of an acoustic study of the timing of voicing in British English obstruents. In each case, it is found to be deficient. The underlying cause is the difficulty in applying a rigid segmental approach to an aspect of speech production characterised by important inter-articulator asynchronies, coupled to the limited quantitative data available concerning the systematic properties of the timing of voicing in languages. It is argued that the categories and labels used to describe the timing of voicing In obstruents are Inadequate for fulfilling the descriptive goals of phonetic theory. One possible alternative descriptive strategy is proposed, based on incorporating aspects of the parametric organisation of speech into the descriptive framework. Within the domain of speech production modelling, no satisfactory account has been given of fine-grained variability of the timing of voicing not capable of explanation in terms of general properties of motor programming and utterance execution. The experimental results support claims In the literature that the phonetic control of an utterance may be somewhat less abstract than has been suggestdd in some previous reports. A schematic outline is given, of one way in which the timing of voicing could be controlled in speech production. The success of a speech synthesis-by-rule system depends to a great extent on a comprehensive encoding of the systematic phonetic characteristics of the target language. Only limited success has been achieved in the past thirty years. A set of rules is proposed for generating more naturalistic patterns of voicing in obstruents, reflecting those observed in the experimental component of this study. Consideration Is given to strategies for evaluating the effect of fine-grained phonetic rules In speech synthesis

Edinburgh Research Archive

OpenGrey Repository

Chamic and beyond : studies in mainland Austronesian languages

Author: Grant Anthony
Sidwell Paul
Publication venue: Pacific Linguistics, Research School of Pacific and Asian Studies, The Australian National University
Publication date: 01/01/2005
Field of study

The Australian National University

Investigating the Signs and Sounds of Cypro-Minoan

Author: Valério Miguel Filipe Grandão
Publication venue: 'Edicions de la Universitat de Barcelona'
Publication date: 01/01/2016
Field of study

[eng] The aim of this dissertation is to advance our knowledge of Cypro-Minoan, a group of undeciphered syllabic inscriptions dated roughly to the 16th or 15th through the 11th centuries BCE and found mostly on Cyprus, with small numbers of documents from coastal Syria and Tiryns (Peloponnese). Two recently-published collections of inscriptions (Olivier 2007; Ferrara 2012 and 2013) have facilitated greatly the investigation of Cypro-Minoan, but the field is still missing a comprehensive paleographical study of the script and a definitive sign-list on which most scholars can agree. Albeit being now the main reference, Olivier’s sign repertory of 96 syllabograms is structured upon É. Masson’s (1974) division of Cypro-Minoan into three “subscripts” (CM 1, 2 and 3), supposedly created and used for different languages, a scheme which has come under criticism. It remains uncertain whether the Cypro-Minoan documents contain one or multiple writing systems. Together with the size of the corpus (almost 4,000 signs on fewer than 250 inscriptions), these lacunae greatly reduce the chances of decipherment. The present work intends to demonstrate that Cypro-Minoan in fact presents some advantages that open prospects for elucidating the script and that a methodology that is well-adjusted to them may contribute to improve our understanding of the inscriptions. Therefore, the goal of this thesis is twofold: (1) to establish a signary that identifies individual Cypro-Minoan signs and defines their paleographical variation to a fine degree of accuracy; and (2) to investigate the possible sounds represented by these signs. The second objective is achieved by means of a three-step methodology. The first two steps comprise, on one hand, cross-comparisons between the Cypro-Minoan signs (in terms of form and value) and signs attested in related scripts, namely Linear A and the Cypriot Greek syllabary; on the other hand and independently, internal analyses (positional distribution and frequency, alternations of related signs as a result of morphological activity, and scribal hesitations). The sound values proposed through these two methods are then tested by a third, which consists of provisionally transliterating a limited set of Cypro-Minoan inscriptions, to ascertain whether they yield readings corresponding to linguistic data known from external sources, therefore validating the hypothetical sign values and possibly even proposing new ones. While a cogent decipherment is not the scope of this project, two main objectives are achieved. The first is to offer a revised list of Cypro-Minoan signs, not framed within the traditional division, but based on selected homogeneous subcorpora of inscriptions, with no preconceived bias as to the number of script varieties represented. It is argued that Cypro-Minoan contains between 57 and 70 different syllabograms, depending on the validity of a number of proposed assimilations of signs that possibly are mere allographs. Secondly, phonetic values are proposed for 60 of these sign forms: nine are considered confirmed and the rest hypothetical. In the investigation of the phonetic values, interpretations are offered for RASH Atab 004 (= RS 20.25), a clay tablet from Ugarit (Syria) long thought to contain a nominal list, and a limited set of sequences found on inscriptions from Cyprus. The suggested sound values and interpretations of sign-sequences, many of which represent identifications of personal names recognizable from cuneiform sources, independently corroborate a significant number of proposals made by Nahm in the 1980s.[spa] La tesis desarrolla un estudio de la escritura chiprominoica, representada por un grupo de epígrafes silábicos fechados aproximadamente entre los siglos XVI o XV y XI a.n.e., hallados en Chipre, y, en menor grado, en la costa siria y en Tirinto. El chiprominoico se caracteriza por un corpus limitado, con una serie de problemas de investigación que dificultan el desciframiento. Pese a la reciente publicación de dos recopilaciones de inscripciones, incluyendo un inventario de 96 signos (Olivier, 2007) estructurado a partir de la división del chiprominoico en tres escrituras supuestamente distintas (CM 1, 2 y 3) propuesta por É. Masson (1974), no existe consenso en cuanto al número de signos y escrituras que integran el corpus. Buscando una metodología adaptada a los problemas que la escritura plantea, el objetivo ha sido doble: (1) establecer un signario fundamentado en análisis paleográficos y con criterio y (2) investigar los posibles valores fonéticos de estos mismos signos. El segundo objetivo se ha realizado en tres pasos metodológicos. Los dos primeros son independientes: la comparación entre los signos del chiprominoico (en cuanto a forma y valor) y los de escrituras “emparentadas” (el Lineal A y el silabario chiprogriego); y en una serie de análisis internos (distribución, interacción entre signos de valor relacionado y correcciones de escribas). El tercer método ha consistido en probar los valores fonéticos sugeridos por los dos primeros a través de la transliteración provisional de un conjunto restringido de epígrafes. Así, se presenta, por un lado, un listado revisado de silabogramas chiprominoicos basado no en la división tradicional, pero partiendo de subgrupos homogéneos de epígrafes, sin prejuicio en cuanto al número de escrituras en ellos representados. Este listado se compone de entre 57 a 70 signos, algunos seguramente meros alógrafos. Por otro lado, se proponen valores fonéticos para 60 de estos caracteres, de los cuales nueve se consideran confirmados y los demás hipotéticos. Simultáneamente, se ofrecen interpretaciones de la tablilla RASH Atab 004 (= RS 20.25), procedente de Ugarit, y algunas secuencias en epígrafes de Chipre. Los resultados, que incluyen mayormente identificaciones de antropónimos conocidos de las fuentes cuneiformes y no suponen un desciframiento definitivo, corroboran varias propuestas ya esbozadas por Nahm (1981; 1984)

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Tesis Doctorals en Xarxa

Diposit Digital de la Universitat de Barcelona

English in Kiribati: a historical, linguistic and sociophonetic report on a Micronesian variety

Author: Leonhardt Tobias
Publication venue: Universität Bern
Publication date
Field of study

The 33 islands of Kiribati are situated in Micronesia, in the middle of the Pacific. Contact between islanders and Europeans only began towards the end of the 18th century and has never been intense. No immediately discernible changes were introduced when the islands were eventually claimed by the British; the English language was hardly ever heard. After the Second World War, decolonisation was worked towards and considerably more attention was paid to education, particularly that of English, but progress was slow. Kiribati became independent in 1979 and English an official language to which most have positive attitudes. Moreover, instrumental motivations are commonplace: many want to learn it in order to secure local employment, to participate in international study or labour mobility programs, or to safeguard for a future that is uncertain in light of climate change issues making life on Kiribati more and more difficult. This dissertation is the first sociolinguistic report of English in Kiribati of its kind. It consists of three main parts: firstly, a historical account of how English has arrived and spread; secondly, a detailed description of features of phonetics and phonology, grammar and syntax, lexis and pragmatics, as well as of language use and linguistic attitudes; and thirdly, a sociophonetic analysis of alveolar plosives. These investigations reveal that issues in the educational system prevail and English proficiency levels remain low, that a high degree of substrate influence and parallels to other learner varieties exist, and that affrication establishes a new contrast between alveolar plosive phonemes

BORIS Theses

A Silent-Speech Interface using Electro-Optical Stomatography

Author: Stone Simon
Publication venue: Thelem Universitätsverlag & Buchhandlung GmbH & Co. KG
Publication date: 21/06/2022
Field of study

Sprachtechnologie ist eine große und wachsende Industrie, die das Leben von technologieinteressierten Nutzern auf zahlreichen Wegen bereichert. Viele potenzielle Nutzer werden jedoch ausgeschlossen: Nämlich alle Sprecher, die nur schwer oder sogar gar nicht Sprache produzieren können. Silent-Speech Interfaces bieten einen Weg, mit Maschinen durch ein bequemes sprachgesteuertes Interface zu kommunizieren ohne dafür akustische Sprache zu benötigen. Sie können außerdem prinzipiell eine Ersatzstimme stellen, indem sie die intendierten Äußerungen, die der Nutzer nur still artikuliert, künstlich synthetisieren. Diese Dissertation stellt ein neues Silent-Speech Interface vor, das auf einem neu entwickelten Messsystem namens Elektro-Optischer Stomatografie und einem neuartigen parametrischen Vokaltraktmodell basiert, das die Echtzeitsynthese von Sprache basierend auf den gemessenen Daten ermöglicht. Mit der Hardware wurden Studien zur Einzelworterkennung durchgeführt, die den Stand der Technik in der intra- und inter-individuellen Genauigkeit erreichten und übertrafen. Darüber hinaus wurde eine Studie abgeschlossen, in der die Hardware zur Steuerung des Vokaltraktmodells in einer direkten Artikulation-zu-Sprache-Synthese verwendet wurde. Während die Verständlichkeit der Synthese von Vokalen sehr hoch eingeschätzt wurde, ist die Verständlichkeit von Konsonanten und kontinuierlicher Sprache sehr schlecht. Vielversprechende Möglichkeiten zur Verbesserung des Systems werden im Ausblick diskutiert.:Statement of authorship iii Abstract v List of Figures vii List of Tables xi Acronyms xiii 1. Introduction 1 1.1. The concept of a Silent-Speech Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1.2. Structure of this work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 2. Fundamentals of phonetics 7 2.1. Components of the human speech production system . . . . . . . . . . . . . . . . . . . 7 2.2. Vowel sounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.3. Consonantal sounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.4. Acoustic properties of speech sounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.5. Coarticulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 2.6. Phonotactics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 2.7. Summary and implications for the design of a Silent-Speech Interface (SSI) . . . . . . . 21 3. Articulatory data acquisition techniques in Silent-Speech Interfaces 25 3.1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 3.2. Scope of the literature review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 3.3. Video Recordings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 3.4. Ultrasonography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 3.5. Electromyography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 3.6. Permanent-Magnetic Articulography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 3.7. Electromagnetic Articulography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 3.8. Radio waves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 3.9. Palatography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 3.10.Conclusion and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 4. Electro-Optical Stomatography 55 4.1. Contact sensors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 4.2. Optical distance sensors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 4.3. Lip sensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 4.4. Sensor Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84 4.5. Control Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 4.6. Software . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 5. Articulation-to-Text 99 5.1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 5.2. Command word recognition pilot study . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 5.3. Command word recognition small-scale study . . . . . . . . . . . . . . . . . . . . . . . . 102 6. Articulation-to-Speech 109 6.1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 6.2. Articulatory synthesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 6.3. The six point vocal tract model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 6.4. Objective evaluation of the vocal tract model . . . . . . . . . . . . . . . . . . . . . . . . 116 6.5. Perceptual evaluation of the vocal tract model . . . . . . . . . . . . . . . . . . . . . . . . 120 6.6. Direct synthesis using EOS to control the vocal tract model . . . . . . . . . . . . . . . . 125 6.7. Pitch and voicing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132 7. Summary and outlook 145 7.1. Summary of the contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145 7.2. Outlook . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146 A. Overview of the International Phonetic Alphabet 151 B. Mathematical proofs and derivations 153 B.1. Combinatoric calculations illustrating the reduction of possible syllables using phonotactics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153 B.2. Signal Averaging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155 B.3. Effect of the contact sensor area on the conductance . . . . . . . . . . . . . . . . . . . . 155 B.4. Calculation of the forward current for the OP280V diode . . . . . . . . . . . . . . . . . . 155 C. Schematics and layouts 157 C.1. Schematics of the control unit. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158 C.2. Layout of the control unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163 C.3. Bill of materials of the control unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164 C.4. Schematics of the sensor unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165 C.5. Layout of the sensor unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166 C.6. Bill of materials of the sensor unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167 D. Sensor unit assembly 169 E. Firmware flow and data protocol 177 F. Palate file format 181 G. Supplemental material regarding the vocal tract model 183 H. Articulation-to-Speech: Optimal hyperparameters 189 Bibliography 191Speech technology is a major and growing industry that enriches the lives of technologically-minded people in a number of ways. Many potential users are, however, excluded: Namely, all speakers who cannot easily or even at all produce speech. Silent-Speech Interfaces offer a way to communicate with a machine by a convenient speech recognition interface without the need for acoustic speech. They also can potentially provide a full replacement voice by synthesizing the intended utterances that are only silently articulated by the user. To that end, the speech movements need to be captured and mapped to either text or acoustic speech. This dissertation proposes a new Silent-Speech Interface based on a newly developed measurement technology called Electro-Optical Stomatography and a novel parametric vocal tract model to facilitate real-time speech synthesis based on the measured data. The hardware was used to conduct command word recognition studies reaching state-of-the-art intra- and inter-individual performance. Furthermore, a study on using the hardware to control the vocal tract model in a direct articulation-to-speech synthesis loop was also completed. While the intelligibility of synthesized vowels was high, the intelligibility of consonants and connected speech was quite poor. Promising ways to improve the system are discussed in the outlook.:Statement of authorship iii Abstract v List of Figures vii List of Tables xi Acronyms xiii 1. Introduction 1 1.1. The concept of a Silent-Speech Interface . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 1.2. Structure of this work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 2. Fundamentals of phonetics 7 2.1. Components of the human speech production system . . . . . . . . . . . . . . . . . . . 7 2.2. Vowel sounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9 2.3. Consonantal sounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10 2.4. Acoustic properties of speech sounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15 2.5. Coarticulation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 18 2.6. Phonotactics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 2.7. Summary and implications for the design of a Silent-Speech Interface (SSI) . . . . . . . 21 3. Articulatory data acquisition techniques in Silent-Speech Interfaces 25 3.1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 3.2. Scope of the literature review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 3.3. Video Recordings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 27 3.4. Ultrasonography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 3.5. Electromyography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 34 3.6. Permanent-Magnetic Articulography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 3.7. Electromagnetic Articulography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 3.8. Radio waves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 47 3.9. Palatography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 3.10.Conclusion and Discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52 4. Electro-Optical Stomatography 55 4.1. Contact sensors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 4.2. Optical distance sensors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57 4.3. Lip sensor . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 4.4. Sensor Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84 4.5. Control Unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 89 4.6. Software . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 5. Articulation-to-Text 99 5.1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 5.2. Command word recognition pilot study . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 5.3. Command word recognition small-scale study . . . . . . . . . . . . . . . . . . . . . . . . 102 6. Articulation-to-Speech 109 6.1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 6.2. Articulatory synthesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 109 6.3. The six point vocal tract model . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 6.4. Objective evaluation of the vocal tract model . . . . . . . . . . . . . . . . . . . . . . . . 116 6.5. Perceptual evaluation of the vocal tract model . . . . . . . . . . . . . . . . . . . . . . . . 120 6.6. Direct synthesis using EOS to control the vocal tract model . . . . . . . . . . . . . . . . 125 6.7. Pitch and voicing . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 132 7. Summary and outlook 145 7.1. Summary of the contributions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 145 7.2. Outlook . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 146 A. Overview of the International Phonetic Alphabet 151 B. Mathematical proofs and derivations 153 B.1. Combinatoric calculations illustrating the reduction of possible syllables using phonotactics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 153 B.2. Signal Averaging . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 155 B.3. Effect of the contact sensor area on the conductance . . . . . . . . . . . . . . . . . . . . 155 B.4. Calculation of the forward current for the OP280V diode . . . . . . . . . . . . . . . . . . 155 C. Schematics and layouts 157 C.1. Schematics of the control unit. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 158 C.2. Layout of the control unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 163 C.3. Bill of materials of the control unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164 C.4. Schematics of the sensor unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 165 C.5. Layout of the sensor unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166 C.6. Bill of materials of the sensor unit . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 167 D. Sensor unit assembly 169 E. Firmware flow and data protocol 177 F. Palate file format 181 G. Supplemental material regarding the vocal tract model 183 H. Articulation-to-Speech: Optimal hyperparameters 189 Bibliography 19

Technische Universität Dresden: Qucosa