Search CORE

174,464 research outputs found

Named Entity Recognizer for Telugu language using Hybrid approach

Author: Dr. M. Humera Khanam, Miss. P. Sindhu Sree
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 31/01/2016
Field of study

The main goal of Named Entity Recognition (NER) is to classify all Named Entities (NE) in a document into predefined classes like Person name, Location name, Organization name and Miscellaneous. This paper outlines Named Entity Recognizer using hybrid approach i.e., combination of Rule based approach and one of the Machine learning technique i.e, Conditional Random Field (CRF). In Rule based approach we have prepared Gazetteer lists for names of persons, locations and organizations; some suffix and prefix features and dictionary consisting 350266 words to recognize the category of named entities. If ambiguity is rised while we are using Rule based approach, we use Machine learning technique i.e., CRF in order to improve the accuracy

International Journal on Recent and Innovation Trends in Computing and Communication

Hybrid Approach to English-Hindi Name Entity Transliteration

Author: Mathur Shruti
Saxena Varun Prakash
Publication venue
Publication date: 28/03/2014
Field of study

Machine translation (MT) research in Indian languages is still in its infancy. Not much work has been done in proper transliteration of name entities in this domain. In this paper we address this issue. We have used English-Hindi language pair for our experiments and have used a hybrid approach. At first we have processed English words using a rule based approach which extracts individual phonemes from the words and then we have applied statistical approach which converts the English into its equivalent Hindi phoneme and in turn the corresponding Hindi word. Through this approach we have attained 83.40% accuracy.Comment: Proceedings of IEEE Students' Conference on Electrical, Electronics and Computer Sciences 201

arXiv.org e-Print Archive

Crossref

Application of semantic web technologies for automatic multimedia annotation

Author: Mannens Erik
Poppe Chris
Van de Walle Rik
Van Deursen Davy
Verborgh Ruben
Publication venue: Future Technology Research Association International (FTRA)
Publication date: 01/01/2010
Field of study

Ghent University Academic Bibliography

Rule Based Transliteration Scheme for English to Punjabi

Author: Bhalla Deepti
Joshi Nisheeth
Mathur Iti
Publication venue
Publication date: 01/04/2013
Field of study

Machine Transliteration has come out to be an emerging and a very important research area in the field of machine translation. Transliteration basically aims to preserve the phonological structure of words. Proper transliteration of name entities plays a very significant role in improving the quality of machine translation. In this paper we are doing machine transliteration for English-Punjabi language pair using rule based approach. We have constructed some rules for syllabification. Syllabification is the process to extract or separate the syllable from the words. In this we are calculating the probabilities for name entities (Proper names and location). For those words which do not come under the category of name entities, separate probabilities are being calculated by using relative frequency through a statistical machine translation toolkit known as MOSES. Using these probabilities we are transliterating our input text from English to Punjabi

arXiv.org e-Print Archive

CogPrints Cognitive Sciences Eprint Archive