Search CORE

3 research outputs found

Measuring Generality of Documents

Author: Dennis Mcleod
Eduard Hovy
Hyun Woong Shin
Larry Pryor
Publication venue
Publication date: 02/04/2008
Field of study

Most traditional Information Retrieval (IR) systems, including web search engines, operationalize “relevant” as the word frequency in a document of a set of keywords. Because of this limitation, traditional IR systems frequently retrieve irrelevant documents in response to a user’s request. In this paper, we propose a new criterion, “generality, ” that provides an additional basis on which to rank retrieved documents. The generality is a level of abstraction to retrieve results based on desired generality appropriate for a user’s knowledge and interests. We compared our generality quantification algorithm with human judges ’ weighting of values to show that the developed algorithm is significantly correlated. 1

CiteSeerX

A NEW CRITERION FOR MEASURING GENERALITY OF DOCUMENTS

Author: Eduard Hovy
Hyun Woong Shin
Publication venue
Publication date: 03/04/2008
Field of study

ABSTRACT: Most information retrieval systems, including Web search engines, use similarity ranking algorithms based on a vector space model to find relevant information in response to a user’s request. However, the retrieved information is frequently irrelevant, because most of the current information systems employ index terms or other techniques that are variants of term frequency. In this paper, we propose a new criterion, “generality, ” that provides an additional basis on which to rank retrieved documents. We compared our generality quantification algorithm with human judges’ weighting of values to show that the developed algorithm is significantly correlated

CiteSeerX

Generality: A New Criterion for Measuring Generality of Documents

Author: Dennis Mcleod
Eduard Hovy
Hyun Woong Shin
Larry Pryor
Publication venue
Publication date: 15/03/2012
Field of study

Most information retrieval systems, including Web search engines, use similarity ranking algorithms based on a vector space model to find relevant information in response to a user’s request. However, the retrieved information is frequently irrelevant, because most of the current information systems employ index terms or other techniques that are variants of term frequency. In this paper, we propose a new criterion, “generality, ” that provides an additional basis on which to rank retrieved documents. We compared our generality quantification algorithm with human judges’ weighting of values to show that the developed algorithm is significantly correlated. 1

CiteSeerX