1,173 research outputs found
A unified representation for morphological, syntactic, semantic, and referential annotations
This paper reports on the SYN-RA (SYNtax-based Reference Annotation) project, an on-going project of annotating German newspaper texts with referential relations. The project has developed an inventory of anaphoric and coreference relations for German in the context of a unified, XML-based annotation scheme for combining morphological, syntactic, semantic, and anaphoric information. The paper discusses how this unified annotation scheme relates to other formats currently discussed in the literature, in particular the annotation graph model of Bird and Liberman (2001) and the pie-in-thesky scheme for semantic annotation
Reference resolution in multi-modal interaction: Preliminary observations
In this paper we present our research on multimodal interaction in and with virtual environments. The aim of this presentation is to emphasize the necessity to spend more research on reference resolution in multimodal contexts. In multi-modal interaction the human conversational partner can apply more than one modality in conveying his or her message to the environment in which a computer detects and interprets signals from different modalities. We show some naturally arising problems but do not give general solutions. Rather we decide to perform more detailed research on reference resolution in uni-modal contexts to obtain methods generalizable to multi-modal contexts. Since we try to build applications for a Dutch audience and since hardly any research has been done on reference resolution for Dutch, we give results on the resolution of anaphoric and deictic references in Dutch texts. We hope to be able to extend these results to our multimodal contexts later
Computational Approach to Anaphora Resolution in Spanish Dialogues
This paper presents an algorithm for identifying noun-phrase antecedents of
pronouns and adjectival anaphors in Spanish dialogues. We believe that anaphora
resolution requires numerous sources of information in order to find the
correct antecedent of the anaphor. These sources can be of different kinds,
e.g., linguistic information, discourse/dialogue structure information, or
topic information. For this reason, our algorithm uses various different kinds
of information (hybrid information). The algorithm is based on linguistic
constraints and preferences and uses an anaphoric accessibility space within
which the algorithm finds the noun phrase. We present some experiments related
to this algorithm and this space using a corpus of 204 dialogues. The algorithm
is implemented in Prolog. According to this study, 95.9% of antecedents were
located in the proposed space, a precision of 81.3% was obtained for pronominal
anaphora resolution, and 81.5% for adjectival anaphora
Utilizing Features of Verbs in Statistical Zero Pronoun Resolution for Japanese Speech
PACLIC 23 / City University of Hong Kong / 3-5 December 200
What linguists always wanted to know about german and did not know how to estimate
This paper profiles significant differences in syntactic distribution and differences in word class frequencies for two treebanks of spoken and written German: the TüBa-D/S, a treebank of transliterated spontaneous dialogues, and the TüBa-D/Z treebank of newspaper articles published in the German daily newspaper die tageszeitung´(taz). The approach can be used more generally as a means of distinguishing and classifying language corpora of different genres
- …