Search CORE

143,334 research outputs found

Settling the Minimax Regret in Online Learning to Rank with Top-k Feedback

Author: Zhang Mingyuan
Publication venue
Publication date: 01/04/2018
Field of study

HonorsStatisticsUniversity of Michiganhttp://deepblue.lib.umich.edu/bitstream/2027.42/162648/1/mingyuaz.pd

Deep Blue Documents at the University of Michigan

Policy-Aware Unbiased Learning to Rank for Top-k Rankings

Author: de Rijke Maarten
Oosterhuis Harrie
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2020
Field of study

Counterfactual Learning to Rank (LTR) methods optimize ranking systems using logged user interactions that contain interaction biases. Existing methods are only unbiased if users are presented with all relevant items in every ranking. There is currently no existing counterfactual unbiased LTR method for top-k rankings. We introduce a novel policy-aware counterfactual estimator for LTR metrics that can account for the effect of a stochastic logging policy. We prove that the policy-aware estimator is unbiased if every relevant item has a non-zero probability to appear in the top-k ranking. Our experimental results show that the performance of our estimator is not affected by the size of k: for any k, the policy-aware estimator reaches the same retrieval performance while learning from top-k feedback as when learning from feedback on the full ranking. Lastly, we introduce novel extensions of traditional LTR methods to perform counterfactual LTR and to optimize top-k metrics. Together, our contributions introduce the first policy-aware unbiased LTR approach that learns from top-k feedback and optimizes top-k metrics. As a result, counterfactual LTR is now applicable to the very prevalent top-k ranking setting in search and recommendation.Comment: SIGIR 2020 full conference pape

arXiv.org e-Print Archive

International Migration, Integration and Social Cohesion online publications

UvA-DARE

How to Improve Teaching Using Blended Learning

Author: B.Tabrizi
M.Driscoll
M.Oliver
Publication venue: 'IGI Global'
Publication date: 01/01/2019
Field of study

Bond University Research Portal

Crossref

How to Improve Teaching Using Blended Learning

Author: M.Driscoll
M.Oliver
B.Tabrizi
Publication venue: 'IGI Global'
Publication date: 01/01/1999
Field of study

Bond University Research Portal

Crossref

Controlling Fairness and Bias in Dynamic Learning-to-Rank

Author: Abdollahpouri Himan
Ensign Danielle
Horvitz Daniel G
Joachims T.
Kingma Diederik P
Salganik M. J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 29/05/2020
Field of study

Rankings are the primary interface through which many online platforms match users to items (e.g. news, products, music, video). In these two-sided markets, not only the users draw utility from the rankings, but the rankings also determine the utility (e.g. exposure, revenue) for the item providers (e.g. publishers, sellers, artists, studios). It has already been noted that myopically optimizing utility to the users, as done by virtually all learning-to-rank algorithms, can be unfair to the item providers. We, therefore, present a learning-to-rank approach for explicitly enforcing merit-based fairness guarantees to groups of items (e.g. articles by the same publisher, tracks by the same artist). In particular, we propose a learning algorithm that ensures notions of amortized group fairness, while simultaneously learning the ranking function from implicit feedback data. The algorithm takes the form of a controller that integrates unbiased estimators for both fairness and utility, dynamically adapting both as more data becomes available. In addition to its rigorous theoretical foundation and convergence guarantees, we find empirically that the algorithm is highly practical and robust.Comment: First two authors contributed equally. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval 202

arXiv.org e-Print Archive

Crossref