Search CORE

1,173 research outputs found

A unified representation for morphological, syntactic, semantic, and referential annotations

Author: Hinrichs Erhard
Kübler Sandra
Naumann Karin
Publication venue
Publication date: 01/01/2004
Field of study

This paper reports on the SYN-RA (SYNtax-based Reference Annotation) project, an on-going project of annotating German newspaper texts with referential relations. The project has developed an inventory of anaphoric and coreference relations for German in the context of a unified, XML-based annotation scheme for combining morphological, syntactic, semantic, and anaphoric information. The paper discusses how this unified annotation scheme relates to other formats currently discussed in the literature, in particular the annotation graph model of Bird and Liberman (2001) and the pie-in-thesky scheme for semantic annotation

Crossref

Hochschulschriftenserver - Universität Frankfurt am Main

Reference resolution in multi-modal interaction: Preliminary observations

Author: Nijholt A.
Publication venue: Universidad de Pinar del Rio "Hermanos Saiz Montes de Oca"
Publication date: 01/01/2002
Field of study

In this paper we present our research on multimodal interaction in and with virtual environments. The aim of this presentation is to emphasize the necessity to spend more research on reference resolution in multimodal contexts. In multi-modal interaction the human conversational partner can apply more than one modality in conveying his or her message to the environment in which a computer detects and interprets signals from different modalities. We show some naturally arising problems but do not give general solutions. Rather we decide to perform more detailed research on reference resolution in uni-modal contexts to obtain methods generalizable to multi-modal contexts. Since we try to build applications for a Dutch audience and since hardly any research has been done on reference resolution for Dutch, we give results on the resolution of anaphoric and deictic references in Dutch texts. We hope to be able to extend these results to our multimodal contexts later

University of Twente Research Information

Computational Approach to Anaphora Resolution in Spanish Dialogues

Author: Martinez-Barco P.
Palomar M.
Publication venue: 'AI Access Foundation'
Publication date: 01/01/2001
Field of study

This paper presents an algorithm for identifying noun-phrase antecedents of pronouns and adjectival anaphors in Spanish dialogues. We believe that anaphora resolution requires numerous sources of information in order to find the correct antecedent of the anaphor. These sources can be of different kinds, e.g., linguistic information, discourse/dialogue structure information, or topic information. For this reason, our algorithm uses various different kinds of information (hybrid information). The algorithm is based on linguistic constraints and preferences and uses an anaphoric accessibility space within which the algorithm finds the noun phrase. We present some experiments related to this algorithm and this space using a corpus of 204 dialogues. The algorithm is implemented in Prolog. According to this study, 95.9% of antecedents were located in the proposed space, a precision of 81.3% was obtained for pronominal anaphora resolution, and 81.5% for adjectival anaphora

arXiv.org e-Print Archive

Repositorio Institucional de la Universidad de Alicante

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Utilizing Features of Verbs in Statistical Zero Pronoun Resolution for Japanese Speech

Author: Nagata Masaaki
Yoshida Sen
Publication venue: City University of Hong Kong
Publication date: 01/01/2009
Field of study

PACLIC 23 / City University of Hong Kong / 3-5 December 200

Waseda University Repository

What linguists always wanted to know about german and did not know how to estimate

Author: Hinrichs Erhard
Kübler Sandra
Publication venue
Publication date: 01/01/2006
Field of study

This paper profiles significant differences in syntactic distribution and differences in word class frequencies for two treebanks of spoken and written German: the TüBa-D/S, a treebank of transliterated spontaneous dialogues, and the TüBa-D/Z treebank of newspaper articles published in the German daily newspaper die tageszeitung´(taz). The approach can be used more generally as a means of distinguishing and classifying language corpora of different genres

Hochschulschriftenserver - Universität Frankfurt am Main