Search CORE

50,753 research outputs found

NeuralREG: An end-to-end approach to referring expression generation

Author: Ferreira Thiago Castro
Krahmer Emiel
Kádár Ákos
Moussallem Diego
Wubben Sander
Publication venue
Publication date: 01/01/2018
Field of study

Traditionally, Referring Expression Generation (REG) models first decide on the form and then on the content of references to discourse entities in text, typically relying on features such as salience and grammatical function. In this paper, we present a new approach (NeuralREG), relying on deep neural networks, which makes decisions about form and content in one go without explicit feature extraction. Using a delexicalized version of the WebNLG corpus, we show that the neural model substantially improves over two strong baselines. Data and models are publicly available.Comment: Accepted for presentation at ACL 201

arXiv.org e-Print Archive

Crossref

Tilburg University Repository

An automatically built named entity lexicon for Arabic

Author: Attia Mohammed
Monachini Monica
Toral Antonio
Tounsi Lamia
van Genabith Josef
Publication venue: European Language Resources Association
Publication date: 01/01/2010
Field of study

We have successfully adapted and extended the automatic Multilingual, Interoperable Named Entity Lexicon approach to Arabic, using Arabic WordNet (AWN) and Arabic Wikipedia (AWK). First, we extract AWN’s instantiable nouns and identify the corresponding categories and hyponym subcategories in AWK. Then, we exploit Wikipedia inter-lingual links to locate correspondences between articles in ten different languages in order to identify Named Entities (NEs). We apply keyword search on AWK abstracts to provide for Arabic articles that do not have a correspondence in any of the other languages. In addition, we perform a post-processing step to fetch further NEs from AWK not reachable through AWN. Finally, we investigate diacritization using matching with geonames databases, MADA-TOKAN tools and different heuristics for restoring vowel marks of Arabic NEs. Using this methodology, we have extracted approximately 45,000 Arabic NEs and built, to the best of our knowledge, the largest, most mature and well-structured Arabic NE lexical resource to date. We have stored and organised this lexicon following the Lexical Markup Framework (LMF) ISO standard. We conduct a quantitative and qualitative evaluation of the lexicon against a manually annotated gold standard and achieve precision scores from 95.83% (with 66.13% recall) to 99.31% (with 61.45% recall) according to different values of a threshold

CiteSeerX

Irish Universities

DCU Online Research Access Service

Information extraction

Author: Hoede C.
Zhang Lei
Publication venue: University of Twente, Department of Applied Mathematics
Publication date: 01/01/2002
Field of study

In this paper we present a new approach to extract relevant information by knowledge graphs from natural language text. We give a multiple level model based on knowledge graphs for describing template information, and investigate the concept of partial structural parsing. Moreover, we point out that expansion of concepts plays an important role in thinking, so we study the expansion of knowledge graphs to use context information for reasoning and merging of templates

University of Twente Research Information

Artequakt: Generating tailored biographies from automatically annotated fragments from the web

Author: Alani Harith
Hall Wendy
Kim Sanghee
Lewis Paul
Millard David
Shadbolt Nigel
Weal Mark
Publication venue
Publication date: 01/01/2002
Field of study

The Artequakt project seeks to automatically generate narrativebiographies of artists from knowledge that has been extracted from the Web and maintained in a knowledge base. An overview of the system architecture is presented here and the three key components of that architecture are explained in detail, namely knowledge extraction, information management and biography construction. Conclusions are drawn from the initial experiences of the project and future progress is detailed

Southampton (e-Prints Soton)

Open Research Online (The Open University)