Search CORE

6 research outputs found

A Dutch coreference resolution system with an evaluation on literary fiction

Author: Cranenburgh van, Andreas
Publication venue
Publication date: 18/12/2019
Field of study

Coreference resolution is the task of identifying descriptions that refer to the same entity. In this paper we consider the task of entity coreference resolution for Dutch with a particular focus on literary texts. We make three main contributions. First, we propose a simplified annotation scheme to reduce annotation effort. This scheme is used for the annotation of a corpus of 107k tokens from 21 contemporary works of literature. Second, we present a rule-based coreference resolution system for Dutch based on the Stanford deterministic multi-sieve coreference architecture and heuristic rules for quote attribution. Our system (dutchcoref) forms a simple but strong baseline and improves on previous systems in shared task evaluations. Finally, we perform an evaluation and error analysis on literary texts which highlights difficult cases of coreference in general, and the literary domain in particular. The code of our system is made available at https://github.com/andreasvc/dutchcoref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

A Dutch coreference resolution system with an evaluation on literary fiction

Author: Cranenburgh van, Andreas
Publication venue
Publication date: 18/12/2019
Field of study

University of Groningen

Visualization, Search, and Error Analysis for Coreference Annotations

Author
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2014
Field of study

Crossref

Digitale Infrastrukturen für die germanistische Forschung

Author
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 21/11/2022
Field of study

Modern research in linguistics is increasingly reliant on digital infrastructure and information systems. This development began at the turn of the millennium and has since accelerated. The volume examines national and European infrastructure networks and the range of language resources in German linguistics that can be discovered, disclosed, and re-applied through digital infrastructure

Directory of Open Access Books (DOAB)

Digitale Infrastrukturen für die germanistische Forschung

Author
Publication venue: 'Walter de Gruyter GmbH'
Publication date
Field of study

OAPEN Library