Rank Aggregation for Non-stationary Data Streams

Abstract

The problem of learning over non-stationary ranking streams arises naturally, particularly in recommender systems. The rankings represent the preferences of a population, and the non-stationarity means that the distribution of preferences changes over time. We propose an algorithm that learns the current distribution of ranking in an online manner. The bottleneck of this process is a rank aggregation problem. We propose a generalization of the Borda algorithm for non-stationary ranking streams. As a main result, we bound the minimum number of samples required to output the ground truth with high probability. Besides, we show how the optimal parameters are set. Then, we generalize the whole family of weighted voting rules (the family to which Borda belongs) to situations in which some rankings are more reliable than others. We show that, under mild assumptions, this generalization can solve the problem of rank aggregation over non-stationary data streams.This work is partially funded by the Industrial Chair “Data science & Artificial Intelligence for Digitalized Industry & Services” from Telecom Paris (France), the Basque Government through the BERC 2018–2021 and the Elkartek program (KK-2018/00096, KK-2020/00049), and by the Spanish Government excellence accreditation Severo Ochoa SEV-2013-0323 (MICIU) and the project TIN2017-82626-R (MINECO). J. Del Ser also acknowledges funding support from the Basque Government (Consolidated Research Gr. MATHMODE, IT1294-19)

    Similar works