Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation

Buchholz, Alexander; Di Benedetto, Giuseppe; Lichtenberg, Jan Malte; London, Ben; Ruffini, Matteo

Double Clipping: Less-Biased Variance Reduction in Off-Policy Evaluation

Authors: Alexander Buchholz
Giuseppe Di Benedetto
Jan Malte Lichtenberg
Ben London
Matteo Ruffini
Publication date: 3 September 2023
Publisher

Abstract

"Clipping" (a.k.a. importance weight truncation) is a widely used variance-reduction technique for counterfactual off-policy estimators. Like other variance-reduction techniques, clipping reduces variance at the cost of increased bias. However, unlike other techniques, the bias introduced by clipping is always a downward bias (assuming non-negative rewards), yielding a lower bound on the true expected reward. In this work we propose a simple extension, called

\textit{double clipping}

, which aims to compensate this downward bias and thus reduce the overall bias, while maintaining the variance reduction properties of the original estimator.Comment: Presented at CONSEQUENCES '23 workshop at RecSys 2023 conference in Singapor

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2309.01120

Last time updated on 12/09/2023