Search CORE

102 research outputs found

Grammatical error correction using hybrid systems and type filtering

Author: Andersen ØE
Felice M
Kochmar E
Yannakoudakis H
Yuan Z
Publication venue: CoNLL 2014 - 18th Conference on Computational Natural Language Learning, Proceedings of the Shared Task
Publication date: 01/01/2014
Field of study

This paper describes our submission to the CoNLL 2014 shared task on grammatical error correction using a hybrid approach, which includes both a rule-based and an SMT system augmented by a large webbased language model. Furthermore, we demonstrate that correction type estimation can be used to remove unnecessary corrections, improving precision without harming recall. Our best hybrid system achieves state of-the-art results, ranking first on the original test set and second on the test set with alternative annotations.[We would like to thank] Cambridge English Language Assessment, a division of Cambridge Assessment, for supporting this research

CiteSeerX

Crossref

Apollo (Cambridge)

Evaluating Word Embeddings in Multi-label Classification Using Fine-grained Name Typing

Author: Kann Katharina
Schütze Hinrich
Yaghoobzadeh Yadollah
Publication venue
Publication date: 01/01/2018
Field of study

Embedding models typically associate each word with a single real-valued vector, representing its different properties. Evaluation methods, therefore, need to analyze the accuracy and completeness of these properties in embeddings. This requires fine-grained analysis of embedding subspaces. Multi-label classification is an appropriate way to do so. We propose a new evaluation method for word embeddings based on multi-label classification given a word embedding. The task we use is fine-grained name typing: given a large corpus, find all types that a name can refer to based on the name embedding. Given the scale of entities in knowledge bases, we can build datasets for this task that are complementary to the current embedding evaluation datasets in: they are very large, contain fine-grained classes, and allow the direct evaluation of embeddings without confounding factors like sentence contextComment: 6 pages, The 3rd Workshop on Representation Learning for NLP (RepL4NLP @ ACL2018

arXiv.org e-Print Archive

Crossref

Argumentation Mining in User-Generated Web Discourse

Author: Gurevych Iryna
Habernal Ivan
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2015
Field of study

The goal of argumentation mining, an evolving research field in computational linguistics, is to design methods capable of analyzing people's argumentation. In this article, we go beyond the state of the art in several ways. (i) We deal with actual Web data and take up the challenges given by the variety of registers, multiple domains, and unrestricted noisy user-generated Web discourse. (ii) We bridge the gap between normative argumentation theories and argumentation phenomena encountered in actual data by adapting an argumentation model tested in an extensive annotation study. (iii) We create a new gold standard corpus (90k tokens in 340 documents) and experiment with several machine learning methods to identify argument components. We offer the data, source codes, and annotation guidelines to the community under free licenses. Our findings show that argumentation mining in user-generated Web discourse is a feasible but challenging task.Comment: Cite as: Habernal, I. & Gurevych, I. (2017). Argumentation Mining in User-Generated Web Discourse. Computational Linguistics 43(1), pp. 125-17

arXiv.org e-Print Archive

TUbiblio

Crossref

Directory of Open Access Journals

TUdatalib Repository (TU Darmstadt)

Ask, and shall you receive?: Understanding Desire Fulfillment in Natural Language Text

Author: Chaturvedi Snigdha
Daume III Hal
Goldwasser Dan
Publication venue
Publication date: 30/11/2015
Field of study

The ability to comprehend wishes or desires and their fulfillment is important to Natural Language Understanding. This paper introduces the task of identifying if a desire expressed by a subject in a given short piece of text was fulfilled. We propose various unstructured and structured models that capture fulfillment cues such as the subject's emotional state and actions. Our experiments with two different datasets demonstrate the importance of understanding the narrative and discourse structure to address this task

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Adapting state-of-the-art deep language models to clinical information extraction systems: Potentials, challenges, and solutions

Author: Gedeon Tom
Suominen Hanna
Zhou Liyuan
Publication venue: 'JMIR Publications Inc.'
Publication date: 25/04/2019
Field of study

University of Canberra Research Repository