Search CORE

25 research outputs found

Reliability measurement without limits

Author: Carletta J.
Reidsma D.
Publication venue: MIT Press
Publication date: 01/01/2008
Field of study

In computational linguistics, a reliability measurement of 0.8 on some statistic such as

\kappa

is widely thought to guarantee that hand-coded data is fit for purpose, with lower values suspect. We demonstrate that the main use of such data, machine learning, can tolerate data with a low reliability as long as any disagreement among human coders looks like random noise. When it does not, however, data can have a reliability of more than 0.8 and still be unsuitable for use: the disagreement may indicate erroneous patterns that machine-learning can learn, and evaluation against test data that contain these same erroneous patterns may lead us to draw wrong conclusions about our machine-learning algorithms. Furthermore, lower reliability values still held as acceptable by many researchers, between 0.67 and 0.8, may even yield inflated performance figures in some circumstances. Although this is a common sense result, it has implications for how we work that are likely to reach beyond the machine-learning applications we discuss. At the very least, computational linguists should look for any patterns in the disagreement among coders and assess what impact they will have

University of Twente Research Information

Exploiting `Subjective' Annotations

Author: Akker Rieks op den
Reidsma Dennis
Publication venue: Coling 2008 Organizing Committee
Publication date: 01/01/2008
Field of study

Many interesting phenomena in conversation can only be annotated as a subjective task, requiring interpretative judgements from annotators. This leads to data which is annotated with lower levels of agreement not only due to errors in the annotation, but also due to the differences in how annotators interpret conversations. This paper constitutes an attempt to find out how subjective annotations with a low level of agreement can profitably be used for machine learning purposes. We analyse the (dis)agreements between annotators for two different cases in a multimodal annotated corpus and explicitly relate the results to the way machine-learning algorithms perform on the annotated data. Finally we present two new concepts, namely `subjective entity' classifiers resp. `consensus objective' classifiers, and give recommendations for using subjective data in machine-learning applications.\u

Crossref

University of Twente Research Information

The use of discourse data in language use research

Author: Spooren W.P.M.S.
Publication venue: Amsterdam
Publication date: 01/01/2004
Field of study

VU Research Portal

Comparando anotações linguísticas na Gramateca: filosofia, ferramentas e exemplos

Author: Freitas Cláudia
Marques Rui
Mota Cristina
Santos Diana
Simões Alberto
Publication venue
Publication date: 01/01/2015
Field of study

Neste artigo apresentamos a filosofia geral da Gramateca – um ambiente para fazer uma gramática da língua portuguesa baseada em corpos – e alguns estudos no seu âmbito, nomeadamente o estudo (1) dos conectores condicionais, (2) das palavras referentes ao corpo humano e (3) das emoções na língua. A ênfase é na metodologia, e apresentamos detalhadamente o sistema Rêve para rever e partilhar anotações linguísticas. Ao descrever os vários estudos, indicamos também as metamorfoses e melhorias por que essa ferramenta passou, assim como o tipo de perguntas e de resultados que já conseguimos obter em áreas muito diversas.This paper presents the general philosophy of Gramateca, for corpus-based Portuguese grammar studies, by reporting on three different studies – conditional connectives, body terms, and emotions – emphasizing methodological aspects. It presents in detail the Rêve system, which allows revising and sharing annotations of Rêve’s underlying corpora. While describing the different studies we also report on the improvement of the Rêve tool, and discuss the kinds of questions and results already available for diverse fields

Universidade do Minho: RepositoriUM

Crossref

Repositório Comum

Directory of Open Access Journals

A Simulation Approach to Validate Models Derived from Observational Studies

Author: Pierre N. Robillard
Simon Labelle
Publication venue: 'IntechOpen'
Publication date: 01/01/2012
Field of study

IntechOpen

Crossref

PolyPublie

Inter-Coder Agreement for Computational Linguistics

Author: Atkins Sue
Carletta Jean
Carletta Jean
Grosz Barbara J
Hearst Marti A
Krippendorff Klaus
Krippendorff Klaus
Marcus Mitchell P
Massimo Poesio
Passonneau Rebecca J
Poesio Massimo
Reinhart T.
Ron Artstein
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2008
Field of study

This article is a survey of methods for measuring agreement among corpus annotators. It exposes the mathematics and underlying assumptions of agreement coefficients, covering Krippendorff's alpha as well as Scott's pi and Cohen's kappa; discusses the use of coefficients in several annotation tasks; and argues that weighted, alpha-like coefficients, traditionally less used than kappa-like measures in computational linguistics, may be more appropriate for many corpus annotation tasks—but that their use makes the interpretation of the value of the coefficient even harder. </jats:p

University of Essex Research Repository

CiteSeerX

Crossref