11 research outputs found

    Using Wikipedia and Wiktionary in Domain-Specific Information Retrieval

    No full text
    The main objective of our experiments in the domain-specific track at CLEF 2008 is utilizing semantic knowledge from collaborative knowledge bases such as Wikipedia and Wiktionary to improve the effectiveness of information retrieval. While Wikipedia has already been used in IR, the application of Wiktionary in this task is new. We evaluate two retrieval models, i.e. SR-Text and SR-Word, based on semantic relatedness by comparing their performance to a statistical model as implemented by Lucene. When Lucene is combined with the semantic models the mean average precision increases by 14 % for German, 9 % for English, and 16 % for Russian. In the bilingual task, we translate the English topics into the document language, i.e. German, by using machine translation. For SR-Text, we alternatively perform the translation process by using cross-language links in Wikipedia, whereby the terms are directly mapped to concept vectors in the target language. The evaluation shows that the latter approach especially improves the retrieval performance in cases where the machine translation system incorrectly translates query terms. When Lucene is combined with SR-Text, the mean average precision increases by 34%

    Evaluation of IR Strategies for Polish

    No full text

    CLOSED WINDOWS, OPEN DOORS: GEOPOLITICS and POST-1949 MAINLAND CHINESE IMMIGRATION TO CANADA

    No full text
    corecore