Search CORE

6 research outputs found

A Retrospective Analysis of the Fake News Challenge Stance Detection Task

Author: Caspelherr Felix
Chaudhuri Debanjan
Gurevych Iryna
Hanselowski Andreas
Meyer Christian M.
PVS Avinesh
Schiller Benjamin
Publication venue
Publication date: 13/06/2018
Field of study

The 2017 Fake News Challenge Stage 1 (FNC-1) shared task addressed a stance classification task as a crucial first step towards detecting fake news. To date, there is no in-depth analysis paper to critically discuss FNC-1's experimental setup, reproduce the results, and draw conclusions for next-generation stance classification methods. In this paper, we provide such an in-depth analysis for the three top-performing systems. We first find that FNC-1's proposed evaluation metric favors the majority class, which can be easily classified, and thus overestimates the true discriminative power of the methods. Therefore, we propose a new F1-based metric yielding a changed system ranking. Next, we compare the features and architectures used, which leads to a novel feature-rich stacked LSTM model that performs on par with the best systems, but is superior in predicting minority classes. To understand the methods' ability to generalize, we derive a new dataset and perform both in-domain and cross-domain experiments. Our qualitative and quantitative study helps interpreting the original FNC-1 scores and understand which features help improving performance and why. Our new dataset and all source code used during the reproduction study are publicly available for future research

arXiv.org e-Print Archive

TUbiblio

A Corpus Factory for many languages

Author: Adam Kilgarriff
Avinesh Pvs
Jan Pomikálek
Siva Reddy
Publication venue
Publication date: 01/01/2010
Field of study

For many languages there are no large, general-language corpora available. Until the web, all but the richest institutions could do little but shake their heads in dismay as corpus-building was long, slow and expensive. But with the advent of the Web it can be highly automated and thereby fast and inexpensive. We have developed a `corpus factory ' where we build large corpora. In this paper we describe the method we use, and how it has worked, and how various problems wer

CiteSeerX

ThamizhiMorph: A morphological parser for the Tamil language

Author: A Shanmugadas
AH Arden
B Premjith
C Allauzen
E Annamalai
GU Pope
HF Schiffman
K Graul
K Paramasivam
K Sarveswaran
K Sarveswaran
KR Beesley
L Krister
L Lisker
M Butt
M Kumar
M Kumar
MA Nuhman
P Bhattacharyya
PJ Antony
PVS Avinesh
R Caldwell
R Crouch
S Agesthialingom
S Rajaram
Senavaraiyar
SN Thesikar
T Bögel
T Lehmann
T Lehmann
T Mokanarangan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref