Search CORE

210 research outputs found

A Deep Generative Model of Vowel Formant Typology

Author: Cotterell Ryan
Eisner Jason
Publication venue
Publication date: 01/01/2018
Field of study

What makes some types of languages more probable than others? For instance, we know that almost all spoken languages contain the vowel phoneme /i/; why should that be? The field of linguistic typology seeks to answer these questions and, thereby, divine the mechanisms that underlie human language. In our work, we tackle the problem of vowel system typology, i.e., we propose a generative probability model of which vowels a language contains. In contrast to previous work, we work directly with the acoustic information -- the first two formant values -- rather than modeling discrete sets of phonemic symbols (IPA). We develop a novel generative probability model and report results based on a corpus of 233 languages.Comment: NAACL 201

arXiv.org e-Print Archive

Crossref

One-Shot Neural Cross-Lingual Transfer for Paradigm Completion

Author: Cotterell Ryan
Kann Katharina
Schütze Hinrich
Publication venue
Publication date: 01/01/2017
Field of study

We present a novel cross-lingual transfer method for paradigm completion, the task of mapping a lemma to its inflected forms, using a neural encoder-decoder model, the state of the art for the monolingual task. We use labeled data from a high-resource language to increase performance on a low-resource language. In experiments on 21 language pairs from four different language families, we obtain up to 58% higher accuracy than without transfer and show that even zero-shot and one-shot learning are possible. We further find that the degree of language relatedness strongly influences the ability to transfer morphological knowledge.Comment: Accepted at ACL 201

arXiv.org e-Print Archive

Crossref

Context-Aware Prediction of Derivational Word-forms

Author: Baldwin Timothy
Cohn Trevor
Cotterell Ryan
Vylomova Ekaterina
Publication venue
Publication date: 01/01/2017
Field of study

Derivational morphology is a fundamental and complex characteristic of language. In this paper we propose the new task of predicting the derivational form of a given base-form lemma that is appropriate for a given context. We present an encoder--decoder style neural network to produce a derived form character-by-character, based on its corresponding character-level representation of the base form and the context. We demonstrate that our model is able to generate valid context-sensitive derivations from known base forms, but is less accurate under a lexicon agnostic setting

arXiv.org e-Print Archive

Crossref

A Fast Algorithm for Computing Prefix Probabilities

Author: Cotterell Ryan
Nowak Franz
Publication venue
Publication date: 04/06/2023
Field of study

Multiple algorithms are known for efficiently calculating the prefix probability of a string under a probabilistic context-free grammar (PCFG). Good algorithms for the problem have a runtime cubic in the length of the input string. However, some proposed algorithms are suboptimal with respect to the size of the grammar. This paper proposes a novel speed-up of Jelinek and Lafferty's (1991) algorithm, which runs in

\mathcal{O}({N^3 |\mathcal{N}|^3 + |\mathcal{N}|^4})

, where

N

is the input length and

|\mathcal{N}|

is the number of non-terminals in the grammar. In contrast, our speed-up runs in

\mathcal{O}({N^2 |\mathcal{N}|^3+N^3|\mathcal{N}|^2})

.Comment: To be published in the Proceedings of ACL 202

arXiv.org e-Print Archive

Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

Author: Cotterell Ryan
Vargas Francisco
Publication venue
Publication date: 07/10/2020
Field of study

Bolukbasi et al. (2016) presents one of the first gender bias mitigation techniques for word embeddings. Their method takes pre-trained word embeddings as input and attempts to isolate a linear subspace that captures most of the gender bias in the embeddings. As judged by an analogical evaluation task, their method virtually eliminates gender bias in the embeddings. However, an implicit and untested assumption of their method is that the bias sub-space is actually linear. In this work, we generalize their method to a kernelized, non-linear version. We take inspiration from kernel principal component analysis and derive a non-linear bias isolation technique. We discuss and overcome some of the practical drawbacks of our method for non-linear gender bias mitigation in word embeddings and analyze empirically whether the bias subspace is actually linear. Our analysis shows that gender bias is in fact well captured by a linear subspace, justifying the assumption of Bolukbasi et al. (2016)

arXiv.org e-Print Archive

Repository for Publications and Research Data