Search CORE

2,705 research outputs found

NERD: Evaluating Named Entity Recognition Tools in the Web of Data

Author: Rizzo G. Troncy R.
Publication venue
Publication date: 01/01/2011
Field of study

EURECOM Repository

PORTO Publications Open Repository TOrino

Journal Name Extraction from Japanese Scientific News Articles

Author: collins
manning
nadeau
nakano
riloff
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/06/2019
Field of study

In Japanese scientific news articles, although the research results are described clearly, the article's sources tend to be uncited. This makes it difficult for readers to know the details of the research. In this paper, we address the task of extracting journal names from Japanese scientific news articles. We hypothesize that a journal name is likely to occur in a specific context. To support the hypothesis, we construct a character-based method and extract journal names using this method. This method only uses the left and right context features of journal names. The results of the journal name extractions suggest that the distribution hypothesis plays an important role in identifying the journal names.Comment: The Asia-Pacific Signal and Information Processing Association Annual Summit and Conference 2018 (APSIPA ASC 2018

arXiv.org e-Print Archive

Crossref

Efficient deep processing of japanese

Author: Bender Emily M.
Siegel Melanie
Publication venue
Publication date: 01/01/2002
Field of study

We present a broad coverage Japanese grammar written in the HPSG formalism with MRS semantics. The grammar is created for use in real world applications, such that robustness and performance issues play an important role. It is connected to a POS tagging and word segmentation tool. This grammar is being developed in a multilingual context, requiring MRS structures that are easily comparable across languages

arXiv.org e-Print Archive

Crossref

Publications at Bielefeld University

Hochschulschriftenserver - Universität Frankfurt am Main

Indonesian Named-entity Recognition for 15 Classes Using Ensemble Supervised Learning

Author: Purwarianti Ayu
Wibawa Aditya Satrya
Publication venue: The Author(s). Published by Elsevier B.V.
Publication date: 31/12/2016
Field of study

AbstractHere, we describe our effort in building Indonesian Named Entity Recognition (NER) for newspaper article with 15 classes which is larger number of class type compared to existing Indonesian NER. We employed supervised machine learning in the NER and conducted experiments to find the best attribute combination and the best algorithm with highest accuracy. We compared the attribute of word level, sentence level and document level. In the algorithm, we compared several single machine learning algorithms and also an ensembled one. Using 457 news articles, the best accuracy was achieved by using ensemble technique where the result of several machine learning algorithms were used as the feature for one machine learning algorithm

Elsevier - Publisher Connector

New approach for Arabic named entity recognition on social media based on feature selection using genetic algorithm

Author: Benali Brahim Ait
El Bazi Ismail
Laachfoubi Nabil
Mihi Soukaina
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/04/2021
Field of study

Many features can be extracted from the massive volume of data in different types that are available nowadays on social media. The growing demand for multimedia applications was an essential factor in this regard, particularly in the case of text data. Often, using the full feature set for each of these activities can be time-consuming and can also negatively impact performance. It is challenging to find a subset of features that are useful for a given task due to a large number of features. In this paper, we employed a feature selection approach using the genetic algorithm to identify the optimized feature set. Afterward, the best combination of the optimal feature set is used to identify and classify the Arabic named entities (NEs) based on support vector. Experimental results show that our system reaches a state-of-the-art performance of the Arab NER on social media and significantly outperforms the previous systems

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Institute of Advanced Engineering and Science