Search CORE

3,625 research outputs found

Fine-grained Dutch named entity recognition

Author: Desmet Bart
Hoste Veronique
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

This paper describes the creation of a fine-grained named entity annotation scheme and corpus for Dutch, and experiments on automatic main type and subtype named entity recognition. We give an overview of existing named entity annotation schemes, and motivate our own, which describes six main types (persons, organizations, locations, products, events and miscellaneous named entities) and finer-grained information on subtypes and metonymic usage. This was applied to a one-million-word subset of the Dutch SoNaR reference corpus. The classifier for main type named entities achieves a micro-averaged F-score of 84.91 %, and is publicly available, along with the corpus and annotations

Ghent University Academic Bibliography

Synapse at CAp 2017 NER challenge: Fasttext CRF

Author: Alexandra J. Weisberg (4234153)
Briana S. Bullington (4234156)
Eric R. Moore (4234150)
Jeff Chang (228277)
Kimberly H. Halsey (208487)
Yuan Jiang (296541)
Publication venue
Publication date: 01/01/2017
Field of study

We present our system for the CAp 2017 NER challenge which is about named entity recognition on French tweets. Our system leverages unsupervised learning on a larger dataset of French tweets to learn features feeding a CRF model. It was ranked first without using any gazetteer or structured external data, with an F-measure of 58.89\%. To the best of our knowledge, it is the first system to use fasttext embeddings (which include subword representations) and an embedding-based sentence representation for NER

arXiv.org e-Print Archive

Directory of Open Access Journals

FigShare