Search CORE

45 research outputs found

A constraint-based approach to noun phrase coreference resolution in German newspaper text

Author: Versley Yannick
Publication venue
Publication date: 01/01/2006
Field of study

In this paper, we investigate the usefulness of a wide range of features for their usefulness in the resolution of nominal coreference, both as hard constraints (i.e. completely removing elements from the list of possible candidates) as well as soft constraints (where a cumulation of violations of soft constraints will make it less likely that a candidate is chosen as the antecedent). We present a state of the art system based on such constraints and weights estimated with a maximum entropy model, using lexical information to resolve cases of coreferent bridging

Hochschulschriftenserver - Universität Frankfurt am Main

Improving coreference resolution by using conversational metadata

Author: Radu Florian
Todd Ward
Xiaoqiang Luo
Publication venue
Publication date: 01/01/2009
Field of study

In this paper, we propose the use of metadata contained in documents to improve coreference resolution. Specifically, we quantify the impact of speaker and turn information on the performance of our coreference system, and show that the metadata can be effectively encoded as features of a statistical resolution system, which leads to a statistically significant improvement in performance.

CiteSeerX

Crossref

Crime Analysis Using Self Learning

Author: Pranav Ruke, Stephy Mathew, Meghna Mohanty, Pankaja Alappanavar, Gandhali Gurjar
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/03/2014
Field of study

An unsupervised algorithm for event extraction is proposed . Some small number of seed examples and corpus of text documents are used as inputs. Here, we are interested in finding out relationships which may be spanned over the entire length of the document. The goal is to extract relations among mention that lie across sentences. These mention relations can be binary, ternary or even quaternary relations. For this paper our algorithm concentrates on picking out a specific binary relation in a tagged data set. We are using co reference resolution to solve the problem of relation extraction. Earlier approaches co - refer identity relations while our approach co - refers independent mention pairs based on feature rules. This paper proposes an approach for coreference resolution which uses the EM (Expectation Maximization) algorithm as a reference to train data and co relate entities inter sentential

International Journal on Recent and Innovation Trends in Computing and Communication

Dynamic Entity Representations in Neural Language Models

Author: Choi Yejin
Ji Yangfeng
Martschat Sebastian
Smith Noah A.
Tan Chenhao
Publication venue
Publication date: 01/01/2017
Field of study

Understanding a long document requires tracking how entities are introduced and evolve over time. We present a new type of language model, EntityNLM, that can explicitly model entities, dynamically update their representations, and contextually generate their mentions. Our model is generative and flexible; it can model an arbitrary number of entities in context while generating each entity mention at an arbitrary length. In addition, it can be used for several different tasks such as language modeling, coreference resolution, and entity prediction. Experimental results with all these tasks demonstrate that our model consistently outperforms strong baselines and prior work.Comment: EMNLP 2017 camera-ready versio

arXiv.org e-Print Archive

Crossref

Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR

Author: Antoine Jean-Yves
Dinarelli Marco
Désoyer Adèle
Landragin Frédéric
Lefeuvre Anaïs
Tellier Isabelle
Publication venue: HAL CCSD
Publication date: 03/04/2016
Field of study

International audienceWe present CROC (Coreference Resolution for Oral Corpus), the first machine learning system for coreference resolution in French. One specific aspect of the system is that it has been trained on data that come exclusively from transcribed speech, namely ANCOR (ANaphora and Coreference in ORal corpus), the first large-scale French corpus with anaphorical relation annotations. In its current state, the CROC system requires pre-annotated mentions. We detail the features used for the learning algorithms, and we present a set of experiments with these features. The scores we obtain are close to those of state-of-the-art systems for written English

HAL Université de Tours