Search CORE

20 research outputs found

SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)

Author: Atanasova Pepa
Coltekin Cagri
Derczynski Leon
Karadzhov Georgi
Mubarak Hamdy
Nakov Preslav
Pitenis Zeses
Rosenthal Sara
Zampieri Marcos
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 17/07/2020
Field of study

We present the results and main findings of SemEval-2020 Task 12 on Multilingual Offensive Language Identification in Social Media (OffensEval 2020). The task involves three subtasks corresponding to the hierarchical taxonomy of the OLID schema (Zampieri et al., 2019a) from OffensEval 2019. The task featured five languages: English, Arabic, Danish, Greek, and Turkish for Subtask A. In addition, English also featured Subtasks B and C. OffensEval 2020 was one of the most popular tasks at SemEval-2020 attracting a large number of participants across all subtasks and also across all languages. A total of 528 teams signed up to participate in the task, 145 teams submitted systems during the evaluation period, and 70 submitted system description papers.Comment: Proceedings of the International Workshop on Semantic Evaluation (SemEval-2020

arXiv.org e-Print Archive

The IT University of Copenhagen's Repository

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Recommended from our members

"Cause" and affect : evaluative and emotive parameters of meaning among the periphrastic causative verb in English

Author: Childers Zachary Witter
Publication venue
Publication date: 17/05/2017
Field of study

This dissertation investigates the so-called periphrastic causative verbs in English – verbs such as cause, make, have, force, and let – and distinguishes them with respect to their selectional behavior and inferential properties. I suggest that these verbs are primarily differentiated in terms of the evaluative and affective dispositions of participants in the speech act and the caused eventuality. The empirical basis for this claim incorporates corpora as well as experimental elicitation and judgment tasks. Based on these findings, it is proposed that the selection of periphrastic causative verb in the expression of a directive causative event is governed by the evaluative stance of the patient of the causative verb. I argue that the English verb cause in particular is less general than has previously been assumed, that it has at least two different senses, and that its primary sense is restricted to cases of negative speaker sentiment.Linguistic

Texas ScholarWorks

Tune your brown clustering, please

Author: Bøgh K.S.
Chester S.
Derczynski L.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2015
Field of study

Brown clustering, an unsupervised hierarchical clustering technique based on ngram mutual information, has proven useful in many NLP applications. However, most uses of Brown clustering employ the same default configuration; the appropriateness of this configuration has gone predominantly unexplored. Accordingly, we present information for practitioners on the behaviour of Brown clustering in order to assist hyper-parametre tuning, in the form of a theoretical model of Brown clustering utility. This model is then evaluated empirically in two sequence labelling tasks over two text types. We explore the dynamic between the input corpus size, chosen number of classes, and quality of the resulting clusters, which has an impact for any approach using Brown clustering. In every scenario that we examine, our results reveal that the values most commonly used for the clustering are sub-optimal

White Rose Research Online

Mining for Parsing Failures

Author: de Kok Daniël
van Noord Gerardus
Publication venue: College Publications
Publication date: 01/01/2017
Field of study

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Low-Resource Unsupervised NMT:Diagnosing the Problem and Providing a Linguistically Motivated Solution

Author: Edman Lukas
Noord van, Gertjan
Toral Ruiz Antonio
Publication venue
Publication date: 01/01/2020
Field of study

ARTS repository - University of Groningen

Low-Resource Unsupervised NMT:Diagnosing the Problem and Providing a Linguistically Motivated Solution

Author: Edman Lukas
Noord van, Gertjan
Toral Ruiz Antonio
Publication venue
Publication date: 01/01/2020
Field of study

Unsupervised Machine Translation hasbeen advancing our ability to translatewithout parallel data, but state-of-the-artmethods assume an abundance of mono-lingual data. This paper investigates thescenario where monolingual data is lim-ited as well, finding that current unsuper-vised methods suffer in performance un-der this stricter setting. We find that theperformance loss originates from the poorquality of the pretrained monolingual em-beddings, and we propose using linguis-tic information in the embedding train-ing scheme. To support this, we look attwo linguistic features that may help im-prove alignment quality: dependency in-formation and sub-word information. Us-ing dependency-based embeddings resultsin a complementary word representationwhich offers a boost in performance ofaround 1.5 BLEU points compared to stan-dardWORD2VECwhen monolingual datais limited to 1 million sentences per lan-guage. We also find that the inclusion ofsub-word information is crucial to improv-ing the quality of the embedding

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen