Skip to main content
Article thumbnail
Location of Repository

Twenty-One at TREC-7: Ad-hoc and Cross-Language Track

By Djoerd Hiemstra and Wessel Kraaij

Abstract

This paper describes the o cial runs of the Twenty-One group for TREC-7. The Twenty-One group participated in the ad-hoc and the cross-language track and made the following accomplishments: We developed a new weighting algorithm, which outperforms the popular Cornell version of BM25 on the ad-hoc collection. For the CLIR task we developed a fuzzy matching algorithm to recover from missing translations and spelling variants of proper names. Also for CLIR we investigated translation strategies that make extensive use of information from our dictionaries by identifying preferred translations, main translations and synonym translations, by de ning weights of possible translations and by experimenting with probabilistic boolean matching strategies

Year: 1999
OAI identifier: oai:CiteSeerX.psu:10.1.1.134.8427
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://ciir.cs.umass.edu/ircha... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.