Search CORE

2 research outputs found

Recommendation System-based Upper Confidence Bound for Online Advertising

Author: Khawam Kinda
Lohan Elena,
Marinca Dana
Martin Steven
Nguyen-Thanh Nhan
Quadri Dominique
Rohde David
Vasile Flavian
Publication venue: HAL CCSD
Publication date: 09/09/2019
Field of study

International audienceIn this paper, the method UCB-RS, which resorts to recommendation system (RS) for enhancing the upper-confidence bound algorithm UCB, is presented. The proposed method is used for dealing with non-stationary and large-state spaces multi-armed bandit problems. The proposed method has been targeted to the problem of the product recommendation in the online advertising. Through extensive testing with RecoGym, an OpenAI Gym-based reinforcement learning environment for the product recommendation in online advertising, the proposed method outperforms the widespread reinforcement learning schemes such as Epsilon-Greedy, Upper Confidence (UCB1) and Exponential Weights for Exploration and Exploitation (EXP3)

HAL-CentraleSupelec

arXiv.org e-Print Archive

HAL-Rennes 1