4,951 research outputs found

    Geometry of language and linguistic circuitry

    Get PDF
    We illustrate the potential for geometry of language and linguistic circuitry under the rendering of the syntactic structures of Lambek categorial grammar as proof nets. This empirical application sees sentences as proof nets and words as partial proof nets, and well-formedness/meaningfulness as a global harmony of categorial syntactic connection. The global cohesion coincides with a dynamic connectivity remaniscent of circuits, but whereas circuits are just generalisations of formulas, our syntactic structures are much more sublime objects: proofs.Postprint (published version

    Unsupervised syntactic chunking with acoustic cues: Computational models for prosodic bootstrapping

    Get PDF
    Learning to group words into phrases without supervision is a hard task for NLP systems, but infants routinely accomplish it. We hypothesize that infants use acoustic cues to prosody, which NLP systems typically ignore. To evaluate the utility of prosodic information for phrase discovery, we present an HMM-based unsupervised chunker that learns from only transcribed words and raw acoustic correlates to prosody. Unlike previous work on unsupervised parsing and chunking, we use neither gold standard part-of-speech tags nor punctuation in the input. Evaluated on the Switchboard corpus, our model outperforms several baselines that exploit either lexical or prosodic information alone, and, despite producing a flat structure, performs competitively with a state-of-the-art unsupervised lexicalized parser, with a substantial advantage in precision. Our results support the hypothesis that acoustic-prosodic cues provide useful evidence about syntactic phrases for language-learning infants.10 page(s

    Parsing Speech: A Neural Approach to Integrating Lexical and Acoustic-Prosodic Information

    Full text link
    In conversational speech, the acoustic signal provides cues that help listeners disambiguate difficult parses. For automatically parsing spoken utterances, we introduce a model that integrates transcribed text and acoustic-prosodic features using a convolutional neural network over energy and pitch trajectories coupled with an attention-based recurrent neural network that accepts text and prosodic features. We find that different types of acoustic-prosodic features are individually helpful, and together give statistically significant improvements in parse and disfluency detection F1 scores over a strong text-only baseline. For this study with known sentence boundaries, error analyses show that the main benefit of acoustic-prosodic features is in sentences with disfluencies, attachment decisions are most improved, and transcription errors obscure gains from prosody.Comment: Accepted in NAACL HLT 201

    Multi-Tier Annotations in the Verbmobil Corpus

    Get PDF
    In very large and diverse scientific projects where as different groups as linguists and engineers with different intentions work on the same signal data or its orthographic transcript and annotate new valuable information, it will not be easy to build a homogeneous corpus. We will describe how this can be achieved, considering the fact that some of these annotations have not been updated properly, or are based on erroneous or deliberately changed versions of the basis transcription. We used an algorithm similar to dynamic programming to detect differences between the transcription on which the annotation depends and the reference transcription for the whole corpus. These differences are automatically mapped on a set of repair operations for the transcriptions such as splitting compound words and merging neighbouring words. On the basis of these operations the correction process in the annotation is carried out. It always depends on the type of the annotation as well as on the position and the nature of the difference, whether a correction can be carried out automatically or has to be fixed manually. Finally we present a investigation in which we exploit the multi-tier annotations of the Verbmobil corpus to find out how breathing is correlated with prosodic-syntactic boundaries and dialog acts. 1

    Pauses and the temporal structure of speech

    Get PDF
    Natural-sounding speech synthesis requires close control over the temporal structure of the speech flow. This includes a full predictive scheme for the durational structure and in particuliar the prolongation of final syllables of lexemes as well as for the pausal structure in the utterance. In this chapter, a description of the temporal structure and the summary of the numerous factors that modify it are presented. In the second part, predictive schemes for the temporal structure of speech ("performance structures") are introduced, and their potential for characterising the overall prosodic structure of speech is demonstrated

    Recent developments in phonology

    Get PDF
    Corno quase nenhuma outra disciplina lingüística, a fonologia passou por uma evolução turbulenta nas duas últimas décadas. Ao contrário da abordagem cöássica da Gramática Gerativa, que se concentrou na descrição de cadeias de segmentos fonológicos e de suas transformações em virtude de regras fonológicas, a Fonologia Não-linear colocou as relações prosódicas em enunciados em primeiro plano. A sílaba foi redescoberta como unidade prosódica; muitos trabalhos foram dedicados à análise de estruturas silábicas e de relações de sonoridade. Acima da sílaba, o pé a palavra fonológica foram utilizados como unidades prosódicas relevantes para a descrição das estruturas de acento e entonação. Abaixo da sílaba, reabilitou-se a mora já conhecida a partir da Filologia Clássica. No presente artigo, descrevem-se, a partir de exemplos do alemão e de outras línguas, as duas abordagens principais da Fonologia Não-linear, a Fonologia Autosegmental e a Fonologia Métrica. Procura-se mostrar que, com esses modelos, alguns fenômenos prosódico-fonológicos que antes só podiam ser descritos com grandes dificuldades ou eram até mesmo indescritíveis podem ser analisados de maneira adequada e elegante.Wie kaum eine andere linguistische Disziplin hat die Phonologie in den letzten zwei Dekaden eine stürmische Entwicklung durchgemacht. Im Gegensatz zum klassischen Ansatz der Generativen Grammatik, der sich auf die Beschreibung phonologischer Segmentketten und ihrer Veränderungen durch phonologische Regeln. konzentrierte, hat die Nichtlineare Phonologie prosodische Relationen in Äußerungsketten in den Mittelpunkt gestellt. Die Silbe wurde als prosodische Einheit wiederentdeckt; viele Arbeiten widmeten sich der Analyse von Silbenstrukturen und Sonoritärsrelationen. Oberhalb der Silbe wurden der Fuß und das phonologische Wort als relevante prosodische Einheiten zur Beschreibung von Akzent- und Intonationsstrukturen verwendet. Unterhalb der Silbe kam die aus der Klassischen Philologie bekannte Wort zu neuen Ehren. Im vorliegenden Aufsatz werden die beiden Hauptansätze der Nichtlinearen Phonologie, Autosegmentale und Metrische Phonologie, anhand von Beispielen aus dem Deutschen und anderen Sprachen beschrieben. Es wird versucht zu zeigen, des einige vorher nicht oder nur sehr umständlich beschreibbare prosodisch-phonologische Phänomenen ach diesen Ansätzen adäquat und elegant analysierbar sind
    corecore