Article thumbnail
Location of Repository

proximity on query expansion

By  and Olga VechtomovaYing Wang and Olga Vechtomova

Abstract

Query expansion terms are often used to enhance original query formulations in document retrieval. Such terms are usually selected from the entire documents or from windows or passages surrounding query term occurrences. Arguably, the semantic relatedness between terms weakens with the increase in the distance separating them. In this paper we report a study that was conducted to systematically evaluate different distance functions for selecting query expansion terms. We propose a distance factor that can be effectively combined with the statistical term association measure of mutual information for selecting query expansion terms. Evaluation of the TREC collection shows that distanceweighted mutual information is more effective than mutual information alone in selecting terms for query expansion

Topics: information retrieval, query expansion, term proximity, word collocation, mutual information
Year: 2005
OAI identifier: oai:CiteSeerX.psu:10.1.1.409.6094
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://www.cc.gatech.edu/~zha/... (external link)
  • http://www.cc.gatech.edu/~zha/... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.