Search CORE

271 research outputs found

Safe Collaborative Filtering

Author: Morimura Tetsuro
Ohsaka Naoto
Oka Tatsushi
Togashi Riku
Publication venue
Publication date: 08/06/2023
Field of study

Excellent tail performance is crucial for modern machine learning tasks, such as algorithmic fairness, class imbalance, and risk-sensitive decision making, as it ensures the effective handling of challenging samples within a dataset. Tail performance is also a vital determinant of success for personalised recommender systems to reduce the risk of losing users with low satisfaction. This study introduces a "safe" collaborative filtering method that prioritises recommendation quality for less-satisfied users rather than focusing on the average performance. Our approach minimises the conditional value at risk (CVaR), which represents the average risk over the tails of users' loss. To overcome computational challenges for web-scale recommender systems, we develop a robust yet practical algorithm that extends the most scalable method, implicit alternating least squares (iALS). Empirical evaluation on real-world datasets demonstrates the excellent tail performance of our approach while maintaining competitive computational efficiency

arXiv.org e-Print Archive

Trust and recommendations

Author: A. Constantinople
A. Jøsang
A. Jøsang
C. Hess
C. Ziegler
C. Ziegler
D. Artz
D. McAllister
G. Adomavicius
J. Cacioppo
J. Herlocker
J. O’Donovan
J. Priester
J. Schafer
L.A. Zadeh
M. Ginsberg
M. Papagelis
N. Lathia
P. Massa
P. Resnick
P. Victor
P. Victor
R. Lewicki
R. Mayer
R. Moskovitch
R. Petty
W. Tang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Crossref

Ghent University Academic Bibliography

Recommended from our members

Hybrid-Parallel Parameter Estimation for Frequentist and Bayesian Models

Author: Raman Parameswaran
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Distributed algorithms in machine learning follow two main flavors: horizontal partitioning, where the data is distributed across multiple slaves and vertical partitioning, where the model parameters are partitioned across multiple machines. The main drawback of the former strategy is that the model parameters need to be replicated on every machine. This is problematic when the number of parameters is very large, and hence cannot fit in a single machine. This drawback of the latter strategy is that the data needs to be replicated on each machine, thus failing to scale to massive datasets.The goal of this thesis is to achieve the best of both worlds by partitioning both - the data as well as the model parameters, thus enabling the training of more sophisticated models on massive datasets. In order to do so, we exploit a structure that is observed in several machine learning models, which we term as \textit{Double-Separability}. Double-Separability basically means that the objective function of the model can be decomposed into independent sub-functions which can be computed independently. For distributed machine learning, this implies that both data and model parameters can partitioned across machines and stochastic updates for parameters can be carried out independently and without any locking. Furthermore, double-separability naturally lends itself to developing efficient asynchronous algorithms which enable computation and communication to happen in parallel, offering further speedup. Some machine learning models such as Matrix Factorization directly exhibit double-separability in their objective function, however the majority of models do not. My work explores techniques to reformulate the objective function of such models to cast them into double-separable form. Often this involves introducing additional auxiliary variables that have nice interpretations. In this direction, I have developed Hybrid Parallel algorithms for machine learning tasks that include {\it Latent Collaborative Retrieval}, {\it Multinomial Logistic Regression}, {\it Variational Inference for Mixture of Exponential Families} and {\it Factorization Machines}. The software resulting from this work are available for public use under an open-source license

eScholarship - University of California