Search CORE

11,109 research outputs found

Unsupervised improvement of named entity extraction in short informal context using disambiguation clues

Author: Habib Mena B.
Keulen Maurice van
Publication venue: CEUR-WS.org
Publication date: 01/01/2012
Field of study

Short context messages (like tweets and SMS’s) are a potentially rich source of continuously and instantly updated information. Shortness and informality of such messages are challenges for Natural Language Processing tasks. Most efforts done in this direction rely on machine learning techniques which are expensive in terms of data collection and training. In this paper we present an unsupervised Semantic Web-driven approach to improve the extraction process by using clues from the disambiguation process. For extraction we used a simple Knowledge-Base matching technique combined with a clustering-based approach for disambiguation. Experimental results on a self-collected set of tweets (as an example of short context messages) show improvement in extraction results when using unsupervised feedback from the disambiguation process

CiteSeerX

Maastricht University Research Portal

University of Twente Research Information

Synapse at CAp 2017 NER challenge: Fasttext CRF

Author: Alexandra J. Weisberg (4234153)
Briana S. Bullington (4234156)
Eric R. Moore (4234150)
Jeff Chang (228277)
Kimberly H. Halsey (208487)
Yuan Jiang (296541)
Publication venue
Publication date: 01/01/2017
Field of study

We present our system for the CAp 2017 NER challenge which is about named entity recognition on French tweets. Our system leverages unsupervised learning on a larger dataset of French tweets to learn features feeding a CRF model. It was ranked first without using any gazetteer or structured external data, with an F-measure of 58.89\%. To the best of our knowledge, it is the first system to use fasttext embeddings (which include subword representations) and an embedding-based sentence representation for NER

arXiv.org e-Print Archive

Directory of Open Access Journals

FigShare