Search CORE

2 research outputs found

Document re-ranking using cluster validation and label propagation

Author: Donghong Ji
Guodong Zhou
Guozheng Xiao
Lingpeng Yang
Yu Nie
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2006
Field of study

This paper proposes a novel document re-ranking approach in information retrieval, which is done by a label propagation-based semi-supervised learning algorithm to utilize the intrinsic structure underlying in the large document data. Since no labeled relevant or irrelevant documents are generally available in IR, our approach tries to extract some pseudo labeled documents from the ranking list of the initial retrieval. For pseudo relevant documents, we determine a cluster of documents from the top ones via cluster validation-based k-means clustering; for pseudo irrelevant ones, we pick a set of documents from the bottom ones. Then the ranking of the documents can be conducted via label propagation. Evaluation on benchmark corpora shows that the approach can achieve significant improvement over standard baselines and performs better than other related approaches

CiteSeerX

Crossref

Chinese information retrieval based on terms and relevant terms

Author: Bear J.
Bouman A. C.
Chien L. F.
Dash M.
Ji D. H.
Ji Donghong
Kamps J.
Kishida K.
Kwok K. L.
Li P.
Luk R. W. P.
Mitra M.
Nie J. Y.
Niu Z. Y.
Niu Zhengyu
Palmer D.
Qu Y. L.
Robertson S. E.
Salton G.
Schutze H.
Tang Li
Xu J.
Yang L. P.
Yang L. P.
Yang Lingpeng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref