102 research outputs found

    Automatic Transcription of Northern Prinmi Oral Art: Approaches and Challenges to Automatic Speech Recognition for Language Documentation

    Get PDF
    One significant issue facing language documentation efforts is the transcription bottleneck: each documented recording must be transcribed and annotated, and these tasks are extremely labor intensive (Ćavar et al., 2016). Researchers have sought to accelerate these tasks with partial automation via forced alignment, natural language processing, and automatic speech recognition (ASR) (Neubig et al., 2020). Neural network—especially transformer-based—approaches have enabled large advances in ASR over the last decade. Models like XLSR-53 promise improved performance on under-resourced languages by leveraging massive data sets from many different languages (Conneau et al., 2020). This project extends these efforts to a novel context, applying XLSR-53 to Northern Prinmi, a Tibeto-Burman Qiangic language spoken in Southwest China (Daudey & Pincuo, 2020). Specifically, this thesis aims to answer two questions. First, is the XLSR-53 ASR model useful for first-pass transcription of oral art recordings from Northern Prinmi, an under-resourced tonal language? Second, does preprocessing target transcripts to combine grapheme clusters—multi-character representations of lexical tones and characters with modifying diacritics—into more phonologically salient units improve the model\u27s predictions? Results indicate that—with substantial adaptations—XLSR-53 will be useful for this task, and that preprocessing to combine grapheme clusters does improve model performance

    The direction of technical change in AI and the trajectory effects of government funding

    Get PDF
    Government funding of innovation can have a significant impact not only on the rate of technical change, but also on its direction. In this paper, we examine the role that government grants and government departments played in the development of artificial intelligence (AI), an emergent general purpose technology with the potential to revolutionize many aspects of the economy and society. We analyze all AI patents filed at the US Patent and Trademark Office and develop network measures that capture each patent’s influence on all possible sequences of follow-on innovation. By identifying the effect of patents on technological trajectories, we are able to account for the long-term cumulative impact of new knowledge that is not captured by standard patent citation measures. We show that patents funded by government grants, but above all patents filed by federal agencies and state departments, profoundly influenced the development of AI. These long-term effects were especially significant in early phases, and weakened over time as private incentives took over. These results are robust to alternative specifications and controlling for endogeneity

    A Computational Approach to the Phonology of Connected Speech

    Get PDF
    This thesis attempts to answer the question "How do we store and retrieve linguistic information?", and to show how this is intimately related to the question of connected speech phonology. The main discussion begins in Chapter One with a non-linguistic introduction to the problem of looking things up, and considers in particular the hashtable and its properties. The theme is developed directly in the latter part of the chapter, and further in Chapter Two, where it is proposed not only that the hashtable is the mechanism actually used by the language faculty, but also that phonology is that mechanism. Chapter Two develops in detail a radically new theory of phonology based on this hypothesis, and examines at length its ramifications. As a foundation for understanding how the phonological and the conceptual-semantic forms of utterances are related, we undertake a detailed study of the relationship between "form" and "meaning" in Chapter Three. We propose a general algorithm, which we claim is a real mechanism driving the acquisition of morphological knowledge, that can abstract and generalise these sorts of morphological relationships. We examine its computational properties, which are surprisingly favourable, and provide a detailed quasi-experimental case-study. By Chapter Four, all the theoretical necessities for describing and ex- plaining what are traditionally believed to be phonological processes operating at the level of the sentence have been introduced. The chapter is used to show how the pieces of Chapters One, Two and Three fit together to tell this story. The chapter also offers some well-motivated speculation on new lines research suggested by some of the computational results obtained throughout this work, and provides a meta-level framework for the future development of a full-scale theory of syntactic function and its acquisition

    The syntax of question particles

    Get PDF
    PhD ThesisCross-linguistically, languages are largely head initial or head final. Most permit some disharmony, but Holmberg (2000) and Biberauer, Holmberg & Roberts (2012), among others, have argued that the structure shown in (1) is ruled out, where YP is X’s complement and ZP is Y’s complement: (1) *[XP [YP Y ZP] X] In structures such as (1), a head-final phrase immediately dominates a head-initial phrase, violating the so-called ‘Final-Over-Final Constraint’ (FOFC). Descriptively however, final question particles are readily found in languages with VO order, resulting in a structure that appears to violate FOFC. (2) illustrates this violation in Tetun (an Austronesian language of East/West Timor), and (3) shows the structure, with a final question particle ka immediately dominating a head-initial TP: (2) ó la bá sekola ká? 2S not go school or (Said to child playing:) ‘Didn't you go to school?’ (Van Klinken 1999: 212) (3) iii If ka constitutes the C head of CP, as is standardly assumed, the structure in (3) violates FOFC. I show, following Aldridge (2011), that these particles are best analysed as disjunctive elements, heading an elided clause: (4) [ConjP CP [Conj CP]] The particle is the head of the phrase, with the second CP as its complement and the first (pronounced) CP in Spec,ConjP. This solves the FOFC problem because the ‘particle’ is not final, and therefore the derivation does not include a head-final phrase dominating a head-initial phrase. Instead, the particle precedes its complement (which is not pronounced), and the clause that it follows (which is pronounced) is its specifier. I provide evidence for this position through typological investigation and theoretical analysis. In addition, the various proposals that have been put forward in the literature to avoid this FOFC-violation are considered, but are shown to be problematic in different respects. I discuss the idea that particles are not heads (Biberauer, Holmberg & Roberts 2012). However, they cannot be specifiers and an adjunction analysis fails to explain their properties, so it is unclear what they could be if not heads. Julien (2001), Lee (2005, 2008) and Simpson & Wu (2002) argue that final particles are derived by TP-movement to a Topic or Focus position. This is a promising explanation, but fails to derive the difference between final particles and other types. If the particle is syncategorematic (Biberauer, Holmberg & Roberts 2012), the fact that they appear in fixed positions is mysterious. Processing explanations of the data (Hawkins 2004, Philip 2012) go some way towards deriving the FOFC facts but do not, among other things, explain the high number of final particles in VO languages. The syntax of question particles is discussed in detail, and it is proposed that polar questions consist of two functional heads in combination: Force, giving a (main clause) question illocutionary force, and Polarity, giving a (neutral) iv question open polarity. A true polar question particle is therefore related to one or both of these heads: (5) With this background, the argument is defended in subsequent chapters that some particles cannot be true question elements in this sense and are instead instantiations of the disjunction. Cross-linguistic data demonstrate that final particles in VO languages differ from other types of question particle (initial particles, or final particles in OV languages) in very rarely marking embedded questions: they do so in only one language in the corpus. Homophony between the question particle and disjunction in many languages, combined with attested grammaticalisation paths, adds support to this claim. Furthermore, this analysis explains a number of properties of such particles in addition to their propensity to violate FOFC, including their frequent absence from negative questions, alternative questions and wh-questions. All of these are straightforward consequences of the particle being a disjunction. Finally, the analysis is applied to a particular language, Thai, as a case study, and it is compared with languages of the other types. It is shown that the disjunctive analysis is best able to explain the data and offer an elegant explanation of the FOFC facts.Arts and Humanities Research Council: Faculty of Humanities and Social Sciences, the School of English Literature,Language and Linguistics: The Centre for Research in Linguistics and Language Sciences

    The syntax of question particles

    Get PDF
    Cross-linguistically, languages are largely head initial or head final. Most permit some disharmony, but Holmberg (2000) and Biberauer, Holmberg & Roberts (2012), among others, have argued that the structure shown in (1) is ruled out, where YP is X’s complement and ZP is Y’s complement: (1) *[XP [YP Y ZP] X] In structures such as (1), a head-final phrase immediately dominates a head-initial phrase, violating the so-called ‘Final-Over-Final Constraint’ (FOFC). Descriptively however, final question particles are readily found in languages with VO order, resulting in a structure that appears to violate FOFC. (2) illustrates this violation in Tetun (an Austronesian language of East/West Timor), and (3) shows the structure, with a final question particle ka immediately dominating a head-initial TP: (2) ó la bá sekola ká? 2S not go school or (Said to child playing:) ‘Didn't you go to school?’ (Van Klinken 1999: 212) (3) iii If ka constitutes the C head of CP, as is standardly assumed, the structure in (3) violates FOFC. I show, following Aldridge (2011), that these particles are best analysed as disjunctive elements, heading an elided clause: (4) [ConjP CP [Conj CP]] The particle is the head of the phrase, with the second CP as its complement and the first (pronounced) CP in Spec,ConjP. This solves the FOFC problem because the ‘particle’ is not final, and therefore the derivation does not include a head-final phrase dominating a head-initial phrase. Instead, the particle precedes its complement (which is not pronounced), and the clause that it follows (which is pronounced) is its specifier. I provide evidence for this position through typological investigation and theoretical analysis. In addition, the various proposals that have been put forward in the literature to avoid this FOFC-violation are considered, but are shown to be problematic in different respects. I discuss the idea that particles are not heads (Biberauer, Holmberg & Roberts 2012). However, they cannot be specifiers and an adjunction analysis fails to explain their properties, so it is unclear what they could be if not heads. Julien (2001), Lee (2005, 2008) and Simpson & Wu (2002) argue that final particles are derived by TP-movement to a Topic or Focus position. This is a promising explanation, but fails to derive the difference between final particles and other types. If the particle is syncategorematic (Biberauer, Holmberg & Roberts 2012), the fact that they appear in fixed positions is mysterious. Processing explanations of the data (Hawkins 2004, Philip 2012) go some way towards deriving the FOFC facts but do not, among other things, explain the high number of final particles in VO languages. The syntax of question particles is discussed in detail, and it is proposed that polar questions consist of two functional heads in combination: Force, giving a (main clause) question illocutionary force, and Polarity, giving a (neutral) iv question open polarity. A true polar question particle is therefore related to one or both of these heads: (5) With this background, the argument is defended in subsequent chapters that some particles cannot be true question elements in this sense and are instead instantiations of the disjunction. Cross-linguistic data demonstrate that final particles in VO languages differ from other types of question particle (initial particles, or final particles in OV languages) in very rarely marking embedded questions: they do so in only one language in the corpus. Homophony between the question particle and disjunction in many languages, combined with attested grammaticalisation paths, adds support to this claim. Furthermore, this analysis explains a number of properties of such particles in addition to their propensity to violate FOFC, including their frequent absence from negative questions, alternative questions and wh-questions. All of these are straightforward consequences of the particle being a disjunction. Finally, the analysis is applied to a particular language, Thai, as a case study, and it is compared with languages of the other types. It is shown that the disjunctive analysis is best able to explain the data and offer an elegant explanation of the FOFC facts

    Semi-Supervised Named Entity Recognition:\ud Learning to Recognize 100 Entity Types with Little Supervision\ud

    Get PDF
    Named Entity Recognition (NER) aims to extract and to classify rigid designators in text such as proper names, biological species, and temporal expressions. There has been growing interest in this field of research since the early 1990s. In this thesis, we document a trend moving away from handcrafted rules, and towards machine learning approaches. Still, recent machine learning approaches have a problem with annotated data availability, which is a serious shortcoming in building and maintaining large-scale NER systems. \ud \ud In this thesis, we present an NER system built with very little supervision. Human supervision is indeed limited to listing a few examples of each named entity (NE) type. First, we introduce a proof-of-concept semi-supervised system that can recognize four NE types. Then, we expand its capacities by improving key technologies, and we apply the system to an entire hierarchy comprised of 100 NE types. \ud \ud Our work makes the following contributions: the creation of a proof-of-concept semi-supervised NER system; the demonstration of an innovative noise filtering technique for generating NE lists; the validation of a strategy for learning disambiguation rules using automatically identified, unambiguous NEs; and finally, the development of an acronym detection algorithm, thus solving a rare but very difficult problem in alias resolution. \ud \ud We believe semi-supervised learning techniques are about to break new ground in the machine learning community. In this thesis, we show that limited supervision can build complete NER systems. On standard evaluation corpora, we report performances that compare to baseline supervised systems in the task of annotating NEs in texts. \u
    • …
    corecore