Search CORE

823 research outputs found

SKOPE: A connectionist/symbolic architecture of spoken Korean processing

Author: Lee Geunbae
Lee Jong-Hyeok
Publication venue
Publication date: 24/04/1995
Field of study

Spoken language processing requires speech and natural language integration. Moreover, spoken Korean calls for unique processing methodology due to its linguistic characteristics. This paper presents SKOPE, a connectionist/symbolic spoken Korean processing engine, which emphasizes that: 1) connectionist and symbolic techniques must be selectively applied according to their relative strength and weakness, and 2) the linguistic characteristics of Korean must be fully considered for phoneme recognition, speech and language integration, and morphological/syntactic processing. The design and implementation of SKOPE demonstrates how connectionist/symbolic hybrid architectures can be constructed for spoken agglutinative language processing. Also SKOPE presents many novel ideas for speech and language processing. The phoneme recognition, morphological analysis, and syntactic analysis experiments show that SKOPE is a viable approach for the spoken Korean processing.Comment: 8 pages, latex, use aaai.sty & aaai.bst, bibfile: nlpsp.bib, to be presented at IJCAI95 workshops on new approaches to learning for natural language processin

arXiv.org e-Print Archive

Crossref

포항공과대학교

Integrating Syntactic and Prosodic Information for the Efficient Detection of Empty Categories

Author: Batliner Anton
Feldhaus Anke
Geissler Stefan
Kiessling Andreas
Kiss Tibor
Kompe Ralf
Noeth Elmar
Publication venue
Publication date: 01/01/1996
Field of study

We describe a number of experiments that demonstrate the usefulness of prosodic information for a processing module which parses spoken utterances with a feature-based grammar employing empty categories. We show that by requiring certain prosodic properties from those positions in the input where the presence of an empty category has to be hypothesized, a derivation can be accomplished more efficiently. The approach has been implemented in the machine translation project VERBMOBIL and results in a significant reduction of the work-load for the parser.Comment: To appear in the Proceedings of Coling 1996, Copenhagen. 6 page

arXiv.org e-Print Archive

Distributed parsing with HPSG grammars

Author: Diagne Abdel Kader
Kasper Walter
Krieger Hans-Ulrich
Publication venue: Sonstige Einrichtungen. DFKI Deutsches Forschungszentrum für Künstliche Intelligenz
Publication date: 01/01/1995
Field of study

Unification-based theories of grammar allow for an integration of different levels of linguistic descriptions in the common framework of typed feature structures. Dependencies among the levels are expressed by coreferences. Though highly attractive theoretically, using such codescriptions for analysis create problems of efficiency. We present an approach to a modular use of codescriptions on the syntactic and semantic level. Grammatical analysis is performed by tightly coupled parsers running in tandem, each using only designated parts of the grammatical description. In the paper we describe the partitioning of grammatical information for the parsers and present results about the performance

Universaar

Acronym

A Survey of Word Reordering in Statistical Machine Translation: Computational Models and Language Phenomena

Author: Bisazza Arianna
Federico Marcello
Publication venue: 'MIT Press - Journals'
Publication date: 14/03/2016
Field of study

Word reordering is one of the most difficult aspects of statistical machine translation (SMT), and an important factor of its quality and efficiency. Despite the vast amount of research published to date, the interest of the community in this problem has not decreased, and no single method appears to be strongly dominant across language pairs. Instead, the choice of the optimal approach for a new translation task still seems to be mostly driven by empirical trials. To orientate the reader in this vast and complex research area, we present a comprehensive survey of word reordering viewed as a statistical modeling challenge and as a natural language phenomenon. The survey describes in detail how word reordering is modeled within different string-based and tree-based SMT frameworks and as a stand-alone task, including systematic overviews of the literature in advanced reordering modeling. We then question why some approaches are more successful than others in different language pairs. We argue that, besides measuring the amount of reordering, it is important to understand which kinds of reordering occur in a given language pair. To this end, we conduct a qualitative analysis of word reordering phenomena in a diverse sample of language pairs, based on a large collection of linguistic knowledge. Empirical results in the SMT literature are shown to support the hypothesis that a few linguistic facts can be very useful to anticipate the reordering characteristics of a language pair and to select the SMT framework that best suits them.Comment: 44 pages, to appear in Computational Linguistic

arXiv.org e-Print Archive

Crossref

Archivio della ricerca - Fondazione Bruno Kessler

UvA-DARE

International Migration, Integration and Social Cohesion online publications

Overview of the SPMRL 2013 shared task: cross-framework evaluation of parsing morphologically rich languages

Author: Candito Marie
Choi Jinho
Farkas Richard
Foster Jennifer
Goenaga Iakes
Gojenola Koldo
Goldberg Yoav
Green Spence
Habash Nizar
Kuhlmann Marco
Kübler Sandra
Maier Wolfgang
Nivre Joakim
Przepiórkowski Adam
Roth Ryan
Seddah Djamé
Seeker Wolfgang
Tsarfaty Reut
Versley Yannick
Villemonte de la Clérgerie Eric
Vincze Veronika
Wolinski Marcin
Wróblewska Alina
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 18/10/2013
Field of study

This paper reports on the first shared task on statistical parsing of morphologically rich languages (MRLs). The task features data sets from nine languages, each available both in constituency and dependency annotation. We report on the preparation of the data sets, on the proposed parsing scenarios, and on the evaluation metrics for parsing MRLs given different representation types. We present and analyze parsing results obtained by the task participants, and then provide an analysis and comparison of the parsers across languages and frameworks, reported for gold input as well as more realistic parsing scenarios

Irish Universities

DCU Online Research Access Service

Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages

Author: Candito Marie
Choi Jinho D.
Farkas Richárd
Foster Jennifer
Goenaga Iakes
Gojenola Galletebeitia Koldo
Goldberg Yoav
Green Spence
Habash Nizar
Kuhlmann Marco
Kübler Sandra
Maier Wolfgang
Nivre Joakim
PrzepiÓrkowski Adam
Roth Ryan
Seddah Djamé
Seeker Wolfgang
Tsarfaty Reut
Versley Yannick
Villemonte de La Clergerie Éric
Vincze Veronika
Wolińsk Marcin
WrÓblewska Alina
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 18/10/2013
Field of study

International audienceThis paper reports on the first shared task on statistical parsing of morphologically rich lan- guages (MRLs). The task features data sets from nine languages, each available both in constituency and dependency annotation. We report on the preparation of the data sets, on the proposed parsing scenarios, and on the eval- uation metrics for parsing MRLs given dif- ferent representation types. We present and analyze parsing results obtained by the task participants, and then provide an analysis and comparison of the parsers across languages and frameworks, reported for gold input as well as more realistic parsing scenarios

INRIA a CCSD electronic archive server

Hal-Diderot