research

Online Ranking: Discrete Choice, Spearman Correlation and Other Feedback

Abstract

Given a set VV of nn objects, an online ranking system outputs at each time step a full ranking of the set, observes a feedback of some form and suffers a loss. We study the setting in which the (adversarial) feedback is an element in VV, and the loss is the position (0th, 1st, 2nd...) of the item in the outputted ranking. More generally, we study a setting in which the feedback is a subset UU of at most kk elements in VV, and the loss is the sum of the positions of those elements. We present an algorithm of expected regret O(n3/2Tk)O(n^{3/2}\sqrt{Tk}) over a time horizon of TT steps with respect to the best single ranking in hindsight. This improves previous algorithms and analyses either by a factor of either Ω(k)\Omega(\sqrt{k}), a factor of Ω(logn)\Omega(\sqrt{\log n}) or by improving running time from quadratic to O(nlogn)O(n\log n) per round. We also prove a matching lower bound. Our techniques also imply an improved regret bound for online rank aggregation over the Spearman correlation measure, and to other more complex ranking loss functions

    Similar works

    Full text

    thumbnail-image

    Available Versions