Search CORE

144 research outputs found

Concept Mining and Inner Relationship Discovery from Text

Author: Jiayu Zhou
Shi Wang
Publication venue: 'IntechOpen'
Publication date: 01/02/2010
Field of study

An Efficient Information Extraction Mechanism with Page Ranking and a Classification Strategy based on Similarity Learning of Web Text Documents

Author: Koti B. Raja
Kumar G V S Raj
Kumar K. Naveen
Thota Sunil Kumar
Publication venue: Auricle Global Society of Education and Research
Publication date: 20/09/2023
Field of study

Users have recently had more access to information thanks to the growth of the www information system. In these situations, search engines have developed into an essential tool for consumers to find information in a big space. The difficulty of handling this wealth of knowledge grows more difficult every day. Although search engines are crucial for information gathering, many of the results they offer are not required by the user because they are ranked according on user string matches. As a result, there were semantic disparities between the terms used in the user inquiry and the importance of catch phrases in the results. The problem of grouping relevant information into categories of related topics hasn't been solved. A Ranking Based Similarity Learning Approach and SVM based classification frame work of web text to estimate the semantic comparison between words to improve extraction of information is proposed in the work. The results of the experiment suggest improvisation in order to obtain better results by retrieving more relevant results

International Journal on Recent and Innovation Trends in Computing and Communication

Word vs. Class-Based Word Sense Disambiguation

Author: Izquierdo Beviá Rubén
Rigau Claramunt German
Suárez Cueto Armando
Publication venue: 'AI Access Foundation'
Publication date: 01/01/2015
Field of study

As empirically demonstrated by the Word Sense Disambiguation (WSD) tasks of the last SensEval/SemEval exercises, assigning the appropriate meaning to words in context has resisted all attempts to be successfully addressed. Many authors argue that one possible reason could be the use of inappropriate sets of word meanings. In particular, WordNet has been used as a de-facto standard repository of word meanings in most of these tasks. Thus, instead of using the word senses defined in WordNet, some approaches have derived semantic classes representing groups of word senses. However, the meanings represented by WordNet have been only used for WSD at a very fine-grained sense level or at a very coarse-grained semantic class level (also called SuperSenses). We suspect that an appropriate level of abstraction could be on between both levels. The contributions of this paper are manifold. First, we propose a simple method to automatically derive semantic classes at intermediate levels of abstraction covering all nominal and verbal WordNet meanings. Second, we empirically demonstrate that our automatically derived semantic classes outperform classical approaches based on word senses and more coarse-grained sense groupings. Third, we also demonstrate that our supervised WSD system benefits from using these new semantic classes as additional semantic features while reducing the amount of training examples. Finally, we also demonstrate the robustness of our supervised semantic class-based WSD system when tested on out of domain corpus.This work has been partially supported by the NewsReader project (ICT-2011-316404), the Spanish project SKaTer (TIN2012-38584-C06-02)

Repositorio Institucional de la Universidad de Alicante

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Learning to Build a Semantic Thesaurus from Free Text Corpora without External Help

Author: Katia Lida Kermanidis
Publication venue: 'IntechOpen'
Publication date: 01/01/2009
Field of study

IntechOpen

News Text Classification Based on an Improved Convolutional Neural Network

Author: Dan Chang
Wenjing Tao
Publication venue: 'Mechanical Engineering Faculty in Slavonski Brod'
Publication date: 01/01/2019
Field of study

With the explosive growth in Internet news media and the disorganized status of news texts, this paper puts forward an automatic classification model for news based on a Convolutional Neural Network (CNN). In the model, Word2vec is firstly merged with Latent Dirichlet Allocation (LDA) to generate an effective text feature representation. Then when an attention mechanism is combined with the proposed model, higher attention probability values are given to key features to achieve an accurate judgment. The results show that the precision rate, the recall rate and the F1 value of the model in this paper reach 96.4%, 95.9% and 96.2% respectively, which indicates that the improved CNN, through a unique framework, can extract deep semantic features of the text and provide a strong support for establishing an efficient and accurate news text classification model

HRČAK - Portal of Croatian Scientific and Professional Journals

Hrčak - Portal of scientific journals of Croatia

Extracting and Visualizing Semantic Relationships from Chinese Biomedical Text

Author: Meng Yao
Miao Qingliang
Yu Hao
Zhang Bo
Zhang Shu
Publication venue: 'Faculty of Computer Science, Universitas Indonesia'
Publication date: 01/01/2012
Field of study

Waseda University Repository

Cross-language Ontology Learning: Incorporating and Exploiting Cross-language Data in the Ontology Learning Process

Author: Hjelm Hans
Publication venue
Publication date: 01/01/2009
Field of study

Hans Hjelm. Cross-language Ontology Learning: Incorporating and Exploiting Cross-language Data in the Ontology Learning Process. NEALT Monograph Series, Vol. 1 (2009), 159 pages. © 2009 Hans Hjelm. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/10126

Publikationer från Stockholms universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

DSpace at Tartu University Library

Quran Ontology: Review On Recent Development And Open Research Issues

Author: Azmi Mohd Sanusi
Suryana Nanna
Utomo Fandy Setyo
Publication venue: JATIT & LLS
Publication date: 01/01/2018
Field of study

Quran is the holy book of Muslims that contains the commandment of words of Allah. Quran provides instructions and guidance to humankind in achieving happiness in life in the world and the hereafter. As a holy book, Quran contains rich knowledge and scientific facts. However, humans have difficulty in understanding the Quran content. It is caused by the fact that the meaning of the searched message content depends on the interpretation. Ontology able to store the knowledge representation of Holy Quran. This paper studies recent ontology on Holy Quran research. We investigate the current trends and technology being applied. This investigation cover on several aspects, such as outcomes of previous studies, language which used on ontology development, coverage area of Quran ontology, datasets, tools to perform ontology development ontology population techniques, approaches used to integrate the knowledge of Quran and other resources into ontology, ontology testing techniques, and limitations on previous research. This review has identified four major issues involved in Quran ontology, i.e. availability of Quran ontology in various translation, ontology resources, automated process of Meronymy relationship extraction, and Instances Classification. The review of existing studies will allow future researchers to have a broad and useful background knowledge on primary and essential aspects of this research field

Universiti Teknikal Malaysia Melaka (UTeM) Repository