Search CORE

9 research outputs found

Reducing Offline Evaluation Bias in Recommendation Systems

Author: De Myttenaere Arnaud
Golden Boris
Grand Bénédicte Le
Rossi Fabrice
Publication venue
Publication date: 06/06/2014
Field of study

Recommendation systems have been integrated into the majority of large online systems. They tailor those systems to individual users by filtering and ranking information according to user profiles. This adaptation process influences the way users interact with the system and, as a consequence, increases the difficulty of evaluating a recommendation algorithm with historical data (via offline evaluation). This paper analyses this evaluation bias and proposes a simple item weighting solution that reduces its impact. The efficiency of the proposed solution is evaluated on real world data extracted from Viadeo professional social network.Comment: 23rd annual Belgian-Dutch Conference on Machine Learning (Benelearn 2014), Bruxelles : Belgium (2014

arXiv.org e-Print Archive

HAL-Paris1

Reducing offline evaluation bias of collaborative filtering algorithms

Author: De Myttenaere Arnaud
Golden Boris
Grand Bénédicte Le
Rossi Fabrice
Publication venue
Publication date: 22/04/2015
Field of study

Recommendation systems have been integrated into the majority of large online systems to filter and rank information according to user profiles. It thus influences the way users interact with the system and, as a consequence, bias the evaluation of the performance of a recommendation algorithm computed using historical data (via offline evaluation). This paper presents a new application of a weighted offline evaluation to reduce this bias for collaborative filtering algorithms.Comment: European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), Apr 2015, Bruges, Belgium. pp.137-142, 2015, Proceedings of the 23-th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2015

arXiv.org e-Print Archive

HAL-Paris1

Using the Mean Absolute Percentage Error for Regression Models

Author: De Myttenaere Arnaud
Golden Boris
Grand Bénédicte Le
Rossi Fabrice
Publication venue
Publication date: 22/04/2015
Field of study

We study in this paper the consequences of using the Mean Absolute Percentage Error (MAPE) as a measure of quality for regression models. We show that finding the best model under the MAPE is equivalent to doing weighted Mean Absolute Error (MAE) regression. We show that universal consistency of Empirical Risk Minimization remains possible using the MAPE instead of the MAE.Comment: European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), Apr 2015, Bruges, Belgium. 2015, Proceedings of the 23-th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2015

arXiv.org e-Print Archive

HAL-Paris1

Consistance de la minimisation du risque empirique pour l'optimisation de l'erreur relative moyenne

Author: de Myttenaere Arnaud
Le Grand Bénédicte
Rossi Fabrice
Publication venue: HAL CCSD
Publication date: 01/06/2015
Field of study

National audienceWe study in this paper the consequences of using the Mean Absolute Percentage Error (MAPE) as a measure of quality for regression models. We show that finding the best model under the MAPE is equivalent to doing weighted Mean Absolute Error (MAE) regression. We also show that, under some asumptions, universal consistency of Empirical Risk Minimization remains possible using the MAPE.Nous nous intéressons au problème de la minimisation de l'erreur relative moyenne dans le cadre des modèles de régression. Nous montrons que l'optimisation de ce critère est équivalente à la minimisation de l'erreur absolue par régressions pondérées et que l'approche par minimisation du risque empirique est, sous certaines hypothèses, consistante pour la minimisation de ce critère

HAL-Paris1

Évaluation hors-ligne d'un modèle prédictif : application aux algorithmes de recommandation et à la minimisation de l'erreur relative moyenne

Author: De Myttenaere Arnaud
Publication venue: HAL CCSD
Publication date: 04/11/2016
Field of study

The offline evaluation permits to estimate the quality of a predictive model using historical data before deploying the model in production. To be efficient, the data used to compute the offline evaluation must be representative of real data.In this thesis we describe the case when the historical data is biased. Through experiments done at Viadeo (french professional social network) we suggest a new offline evaluation procedure to estimate the quality of a recommendation algorithm when the data is biased. Then we introduce the concept of Explanatory Shift, which is a particular case of bias, and we suggest a new approach to build an efficient model under Explanatory Shift.In the second part of this thesis we discuss the importance of the loss function used to select a model using the empirical risk minimization method (ERM), and we study in detail the particular case of the Mean Absolute Percentage Error (MAPE). First we analyze necessary conditions to ensure that the risk is well defined. Then we show that the model obtained by ERM is consistant under some assumptions.L'évaluation hors-ligne permet d'estimer la qualité d'un modèle prédictif à partir de données historiques. En pratique, cette approche estime la qualité d'un modèle avant sa mise en production, sans interagir avec les clients ou utilisateurs. Pour qu'une évaluation hors-ligne soit pertinente, il est nécessaire que les données utilisées soient sans biais, c'est-à-dire représentatives des comportements observés une fois le modèle en production.Dans cette thèse, nous traitons le cas où les données à disposition sont biaisées. A partir d'expériences réalisées au sein de Viadeo nous proposons une nouvelle procédure d'évaluation hors-ligne d'un algorithme de recommandation. Cette nouvelle approche réduit l'influence du biais sur les résultats de l'évaluation hors-ligne. Nous introduisons ensuite le contexte d' Explanatory Shift, qui correspond à une situation dans laquelle le biais réside dans la distribution de la variable cible. Des expériences menées sur les données du site de e-commerce Cdiscount et la base de données Newsgroup montrent alors que, sous certaines hypothèses, il est possible d'inférer la distribution de la variable cible afin de corriger la non-représentativité de l'échantillon d'apprentissage à disposition.De façon plus théorique, nous nous intéressons ensuite au rôle de la fonction de perte utilisée pour la sélection d'un modèle à partir de la méthode de minimisation du risque empirique. Plus précisément, nous détaillons le cas particulier de la minimisation de l'erreur relative moyenne et nous introduisons le concept de régression MAPE (Mean Absolute Percentage Error). Les travaux réalisés dans ce cadre portent alors sur la consistance de l'estimateur de minimisation du risque empirique pour la régression MAPE, et sur la régression MAPE régularisée en pratique. Les expériences menées sur des données simulées ou extraites du réseau social professionnel Viadeo montrent les avantages de la régression MAPE et permettent d'illustrer des propriétés théoriques de l'estimateur obtenu

Thèses en Ligne

HAL-Paris1

Study of a bias in the offline evaluation of a recommendation algorithm

Author: de Myttenaere Arnaud
Golden Boris
Le Grand Bénédicte
Rossi Fabrice
Publication venue: Ibai Publishing
Publication date: 11/07/2015
Field of study

International audienceRecommendation systems have been integrated into the majority of large online systems to filter and rank information according to user profiles. It thus influences the way users interact with the system and, as a consequence, bias the evaluation of the performance of a recommendation algorithm computed using historical data (via offline evaluation). This paper describes this bias and discuss the relevance of a weighted offline evaluation to reduce this bias for different classes of recommendation algorithms

arXiv.org e-Print Archive

HAL-Paris1

Mean Absolute Percentage Error for Regression Models

Author: Arnaud De Myttenaere
Boris Golden
Bénédicte Le Grand
Fabrice Rossi
Publication venue
Publication date: 05/03/2020
Field of study

Abstract We study in this paper the consequences of using the Mean Absolute Percentage Error (MAPE) as a measure of quality for regression models. We prove the existence of an optimal MAPE model and we show the universal consistency of Empirical Risk Minimization based on the MAPE. We also show that finding the best model under the MAPE is equivalent to doing weighted Mean Absolute Error (MAE) regression, and we apply this weighting strategy to kernel regression. The behavior of the MAPE kernel regression is illustrated on simulated data

CiteSeerX

Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries

Author: A Nicholas
A Ricardo
A Scott
A Torralba
Abbe Mowshowitz
Abdulfatai Popoola
Abhay Sukumaran
Adam Fourney
Adam Mann
Adam N Joinson
Aditya Pal
Akshay Java
Alan Mislove
Alex Rosenblat
Alexandra Olteanu
Alexandra Olteanu
Alexandra Olteanu
Alexandra Olteanu
Alexandra Olteanu
Alexandra Olteanu
Alexandra Olteanu
Alexandra Olteanu
Alice E Marwick
Alice E Marwick
Amit Sharma
Amit Sharma
Andre Oboler
Andrew Schwartz
Andrew Yates
Andr�s Michael S Bernstein
Aniko Hannak
Aniko Hannak
Anne Archambault
Anne Aula
Anne Bowser
Anne Oeldorf-Hirsch
Arnaud De Myttenaere
Arvind Narayanan
Asaf Beasley
Axel Bruns
Axel Bruns
Axel Bruns
Aylin Caliskan-Islam
Barbara Poblete
Bernd Carsten Stahl
Bimal Viswanath
Boyd Danah
Brendan Daniel M Romero
Brent Hecht
Bryce Goodman
Caitlin Mclaughlin
Carlos Castillo
Carlos Castillo
Carlos Castillo
Chenliang Li
Chris Anderson
Chris Drummond
Chris Preist
Christian Reuter
Christian Sandvig
Christo Wilson
Claire Cain Miller
Claudia Wagner
Cliff Lampe
Cliff Lampe
Cristian Danescu-Niculescu-Mizil
Cynthia Dwork
Cynthia Rudin
D I Adam
D I Adam
D Mitja
D-P. Nguyen
Daniel Gayo Avello
Daniel Gayo-Avello
Daniel Preot�iucpreot�iuc-Pietro
Daniele Fanelli
Daphne Chang
Dario Amodei
David John Hughes
David Jurgens
David Jurgens
David Lazer
David Lazer
David Silverman
Davide Proserpio
Deen Freelon
Delia Mocanu
Delip Rao
Derek Ruths
Diego Saez-Trumper
Dimitar Nikolov
Dirk Hovy
Dong Nguyen
Dongyuan Bang Hui Lim
Dos Virgile Landeiro
Doug Schuler
Eduardo Ruiz
Edward Newell
Edward Newell
Elad Yom-Tov
Emilio Ferrara
Emilio Zagheni
Emilio Zagheni
Emre K?c?man
Emre K?c?man
Emre K?c?man
Emre K?c?man
Emre Kiciman
Emre Munmun De Choudhury
Erhard Rahm
Eric Gilbert
Eric Gilbert
Eric Horvitz
Erin Kenneally
Eszter Hargittai
Eszter Hargittai
Ethan Cohen-Cole
Eunil Sang Jib Kwon
Eyal Carmi
Eytan Bakshy
Eytan Bakshy
Eytan Bakshy
Fabrizio Silvestri
Felix Ming Fai Wong
Fernando Diaz
Fernando Diaz
Fernando Diaz
Fernando Diaz
Filip Radlinski
Florian Tramer
Fons Wijnhoven
Fred Morstatter
Fred Morstatter
Gang Wang
Gary King
Gary King
Giovanni Quattrone
Guy Shani
H Wallach
Hai Liang
Hamid Ekbia
Hannah Jean Miller
Hassan Saif
Hassan Saif
Hazim Almuhimedi
Hyunwoo Chun
H�seyin Oktay
I Ian
Ilknur Celik
Ingmar Venkata Rama Kiran Garimella
Ingmar Weber
Isaac Johnson
J Christopher
J Sean
Jacob Metcalf
Jacob Ratkiewicz
Jaime Teevan
James Grimmelmann
James Howison
James Matthew
James Mccorriston
Janne Lindqvist
Jeremy Ginsberg
Jes�s Bobadilla
Jiang Yang
Jie Tang
Jilin Chen
Jim Maddock
Joan Dimicco
Johan Ugander
Jonathan Cinnamon
Joseph Konstan
Josh Terrell
Jos� Van Dijck
Juhi Kulshrestha
Julia D Fraustino
Julia Schwarz
Jure Leskovec
Justin Cheng
Justin Cheng
Justin Cheng
Justin Cranshaw
Justin Sampson
Kalev Leetaru
Kashmir Hill
Kate Crawford
Kate Crawford
Kate Crawford
Kate Crawford
Kate Crawford
Kate Ehrlich
Kathy Charmaz
Katrin Weller
Katrin Weller
Kaveri Subrahmanyam
Kenneth Joseph
Kevin Macg
Kien Pham
Kira Radinsky
Kiri Wagstaff
Kj Ryan
Kristina Lerman
Kristina Lerman
Kristina Lerman
Kurt Thomas
L Daniel
L Norbert
L S Cynthia
Lars Backstrom
Laura Reed
Lauren Kirchner
Lev Muchnik
Lichan Hong
Lindsay Poirier
Linna Li
Liza Potts
Loizos Michael
Lucia Specia
Lucie Flekova
Luke Hutton
Luke Hutton
M Meredith
M Momin
M Momin
Maeve Duggan
Maeve Duggan
Marina Sokolova
Mark Dredze
Mark Dredze
Mark Graham
Markus Eni Mustafaraj
Martin Shelton
Matthew J Salganik
Matthew Richardson
Matthias R James W Pennebaker
Mattias Rost
Mauro Coletto
Meeyoung Cha
Meredith Ringel Munmun De Choudhury
Michael Zimmer
Michael Zimmer
Michail Vlachos
Michal Kosinski
Michele Starnini
Mike Dave D&apos
Miles Osborne
Miles Osborne
Miller Mcpherson
Moira Burke
Moira Burke
Monica Anderson
Mor Naaman
Moritz Hardt
Mossaab Bagdouri
Mrinal Kumar
Muhammad Bilal Zafar
Nan Lin
Nasir Naveed
Nicholas Diakopoulos
Nicole B Boyd
Nir Grinberg
Nitin Jindal
Norah Abokhodair
O Matthew
O&apos
Olteanu
Olteanu
Pablo Barbera
Paolo Giardullo
Parantapa Bhattacharya
Patrick Meier
Paul Bennett
Paul Ohm
Paul R Rosenbaum
Paul Resnick
Pedro Calais Guerra
R Colin
Ralph Gross
Raphael Ottoni
Raphael Ottoni
Raviv Cohen
Ricardo Baeza
Ricardo Baeza-Yates
Richard Mccreadie
Rishabh Mehrotra
Rohilla Cosma
Russell Lyons
Ruth Garcia-Gavilanes
Ryen White
Sai Teja Peddinti
Salvatore Scellato
Sam Burnett
Sandra Gonz�lez-Bail�n
Sandra Gonz�lez-Bail�n
Sara Hajian
Sarah Vieweg
Sarah Vieweg
Sarita Yardi Schoenebeck
Sauvik Das
Scott Counts
Scott Munmun De Choudhury
Shaomei Wu
Sharad Goel
Sherlock Campbell
Shirin Nilizadeh
Shuang-Hong Yang
Sinan Aral
Sitaram Asur
Sofiane Abbar
Solon Barocas
Solon Barocas
Solon Barocas
Sophie Chou
Subhasree Isaac L Johnson
Susan Dumais
Symeon Papadopoulos
Taha Yasseri
Tarleton Gillespie
Tatjana Scheffler
Tehila Minkus
Tim Harford
Tolga Bolukbasi
Tom Gruber
U N Ocha
U N Ocha
Umashanthi Pavalanathan
Umashanthi Pavalanathan
Us White House
Valentina Grasso
Virgile Landeiro
W Ryen
Wai-Tat Vera Liao
Walid Magdy
Wanita Sherchan
Wei Gao
Wei Gong
Wei Gong
William M Trochim
Winter Mason
Xin Yan
Yabing Liu
Yana Volkovich
Yang Wang
Yelena Mejova
Yishi Haji Mohammad Saleem
Yu Ru Munmun De Choudhury
Yu-Ru Lin
Yusuke Yamamoto
Yuxiao Dong
Zengbin Zhang
Zeynep Tufekci
Zoltan Gyongyi
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Crossref