Search CORE

6 research outputs found

Importance of Similarity Measures in Effective Web Information Retrieval

Author: Shagun Giridhar, Kanika Bhutani
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 30/08/2018
Field of study

Information Retrieval (IR) manages recovering and showing data inside the WWW and online databases and furthermore looks through the web reports The quick development of site pages accessible on the Internet as of late, seeking applicable and coming data has turned into a pivotal issue. Data recovery is a standout amongst the most essential segments in web crawlers and their improvement would greatly affect enhancing the looking productivity because of dynamic nature of web it turns out to be much hard to discover applicable and late data. That is the reason an ever increasing number of individuals began to utilize centered crawler to get correct data in their uncommon fields today. The information retrieval field mainly deals with the grouping of similar documents to retrieve required information to the user from huge amount of data. The researchers proposed different types of similarity measures and models in information retrieval to determine the similarity between the texts and for document clustering. This research intends the study of genetic algorithm based information retrieval using similarity measures like cosine coefficient, jaccard coefficient, dice coefficient

International Journal on Recent and Innovation Trends in Computing and Communication

Evaluating the Performance of Similarity Measures in Effective Web Information Retrieval

Author: Rajesh Kr. Tejwani Mohit Mishra, Amit Kumar
Publication venue: Auricle Global Society of Education and Research
Publication date: 31/08/2016
Field of study

International Journal on Future Revolution in Computer Science & Communication Engineering

Web search engine based semantic similarity measure between words using pattern retrieval algorithm

Author: Patnaik L.M.
Pushpa C.N.
Thriveni J.
Venugopal K.R.
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 18/02/2013
Field of study

Semantic Similarity measures plays an important role in information retrieval, natural language processing and various tasks on web such as relation extraction, community mining, document clustering, and automatic meta-data extraction. In this paper, we have proposed a Pattern Retrieval Algorithm [PRA] to compute the semantic similarity measure between the words by combining both page count method and web snippets method. Four association measures are used to find semantic similarity between words in page count method using web search engines. We use a Sequential Minimal Optimization (SMO) support vector machines (SVM) to find the optimal combination of page counts-based similarity scores and top-ranking patterns from the web snippets method. The SVM is trained to classify synonymous word-pairs and nonsynonymous word-pairs. The proposed approach aims to improve the Correlation values, Precision, Recall, and F-measures, compared to the existing methods. The proposed algorithm outperforms by 89.8% of correlation value

ePrints@Bangalore University

A Survey on Important Aspects of Information Retrieval

Author: Gupta Y.
Saini A.
Saxena A.K.
Publication venue: UTeM Press Website
Publication date: 31/12/2013
Field of study

Information retrieval has become an important field of study and research under computer science due to the explosive growth of information available in the form of full text, hypertext, administrative text, directory, numeric or bibliographic text. The research work is going on various aspects of information retrieval systems so as to improve its efficiency and reliability. This paper presents a comprehensive survey discussing not only the emergence and evolution of information retrieval but also include different information retrieval models and some important aspects such as document representation, similarity measure and query expansion

Universiti Teknikal Malaysia Melaka: UTeM Open Journal System