Users often issue vague queries. When we cannot predict users ’ intentions, a natural solution is to improve user satisfaction by diversifying search results. Such an area, usually called “result diversification”, lacks a systematic approach to construct a test collection, by which we can evaluate how search systems perform. In this paper, we propose leveraging the user contributed data in Wikipedia 1 to build up a test collection for ambiguous queries. A preliminary experiment shows promising results. 1
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.