All you need is ratings: A clustering approach to synthetic rating datasets generation

Monti, DIEGO MICHELE; Morisio, Maurizio; Rizzo, Giuseppe

All you need is ratings: A clustering approach to synthetic rating datasets generation

Authors: DIEGO MICHELE Monti
Maurizio Morisio
Giuseppe Rizzo
Publication date: 2 September 2019
Publisher: Organizers of REVEAL 2019

Abstract

The public availability of collections containing user preferences is of vital importance for performing offline evaluations in the field of recommender systems. However, the number of rating datasets is limited because of the costs required for their creation and the fear of violating the privacy of the users by sharing them. For this reason, numerous research attempts investigated the creation of synthetic collections of ratings using generative approaches. Nevertheless, these datasets are usually not reliable enough for conducting an evaluation campaign. In this paper, we propose a method for creating synthetic datasets with a configurable number of users that mimic the characteristics of already existing ones. We empirically validated the proposed approach by exploiting the synthetic datasets for evaluating different recommenders and by comparing the results with the ones obtained using real datasets

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

oai:iris.polito.it:11583/27492...

Last time updated on 30/10/2019