55 research outputs found
Annotation des informations temporelles dans des textes en français.
National audienceLe traitement des informations temporelles est crucial pour la compréhension de textes en langue naturelle. Le langage de spécification TimeML a été conçu afin de permettre le repérage et la normalisation des expressions temporelles et des événements dans des textes écrits en anglais. L'objectif des divers projets TimeML a été de formuler un schéma d'annotation pouvant s'appliquer à du texte libre, comme ce que l'on trouve sur le Web, par exemple. Des efforts ont été faits pour l'application de TimeML à d'autres langues que l'anglais, notamment le chinois, le coréen, l'italien, l'espagnol et l'allemand. Pour le français, il y a eu des efforts allant dans ce sens, mais ils sont encore un peu éparpillés. Dans cet article, nous détaillons nos travaux actuels qui visent à élaborer des ressources complètes pour l'annotation de textes en français selon TimeML - notamment un guide d'annotation, un corpus de référence (Gold Standard) et des modules d'annotation automatique
Animation Motion in NarrativeML
This paper describes qualitative spatial representations relevant to cartoon motion incorporated into NarrativeML, an annotation scheme intended to capture some of the core aspects of narrative. These representations are motivated by linguistic distinctions drawn from cross-linguistic studies. Motion is modeled in terms of transitions in spatial configurations, using an expressive dynamic logic with the manner and path of motion being derived from a few basic primitives. The manner is elaborated to represent properties of motion that bear on character affect. Such representations can potentially be used to support cartoon narrative summarization and question-answering. The paper discusses annotation challenges, and the use of computer vision to help in annotation. Work is underway on annotating a cartoon corpus in terms of this scheme
A Pattern-mining Driven Study on Differences of Newspapers in Expressing Temporal Information
This paper studies the differences between different types of newspapers in
expressing temporal information, which is a topic that has not received much
attention. Techniques from the fields of temporal processing and pattern mining
are employed to investigate this topic. First, a corpus annotated with temporal
information is created by the author. Then, sequences of temporal information
tags mixed with part-of-speech tags are extracted from the corpus. The TKS
algorithm is used to mine skip-gram patterns from the sequences. With these
patterns, the signatures of the four newspapers are obtained. In order to make
the signatures uniquely characterize the newspapers, we revise the signatures
by removing reference patterns. Through examining the number of patterns in the
signatures and revised signatures, the proportion of patterns containing
temporal information tags and the specific patterns containing temporal
information tags, it is found that newspapers differ in ways of expressing
temporal information.Comment: 19 page
USFD at KBP 2011: Entity Linking, Slot Filling and Temporal Bounding
This paper describes the University of Sheffield's entry in the 2011 TAC KBP
entity linking and slot filling tasks. We chose to participate in the
monolingual entity linking task, the monolingual slot filling task and the
temporal slot filling tasks. We set out to build a framework for
experimentation with knowledge base population. This framework was created, and
applied to multiple KBP tasks. We demonstrated that our proposed framework is
effective and suitable for collaborative development efforts, as well as useful
in a teaching environment. Finally we present results that, while very modest,
provide improvements an order of magnitude greater than our 2010 attempt.Comment: Proc. Text Analysis Conference (2011
- …