Search CORE

5,871 research outputs found

Scalable and interpretable product recommendations via overlapping co-clustering

Author: Dünner Celestine
Heckel Reinhard
Parnell Thomas
Vlachos Michail
Publication venue
Publication date: 01/04/2017
Field of study

We consider the problem of generating interpretable recommendations by identifying overlapping co-clusters of clients and products, based only on positive or implicit feedback. Our approach is applicable on very large datasets because it exhibits almost linear complexity in the input examples and the number of co-clusters. We show, both on real industrial data and on publicly available datasets, that the recommendation accuracy of our algorithm is competitive to that of state-of-art matrix factorization techniques. In addition, our technique has the advantage of offering recommendations that are textually and visually interpretable. Finally, we examine how to implement our technique efficiently on Graphical Processing Units (GPUs).Comment: In IEEE International Conference on Data Engineering (ICDE) 201

arXiv.org e-Print Archive

Serveur académique lausannois

On Sampling Strategies for Neural Network-based Collaborative Filtering

Author: Bordes Antoine
Chen Tianqi
Chen Ting
Collobert Ronan
den Oord Aaron Van
Gopalan Prem K
Kingma Diederik
Krizhevsky Alex
Lin Xiao
Mikolov T
Reddi Sashank J
Rendle Steffen
Salakhutdinov Ruslan
Zhang Xiang
Zhao Peilin
Zheng Yin
Publication venue
Publication date: 23/06/2017
Field of study

Recent advances in neural networks have inspired people to design hybrid recommendation algorithms that can incorporate both (1) user-item interaction information and (2) content information including image, audio, and text. Despite their promising results, neural network-based recommendation algorithms pose extensive computational costs, making it challenging to scale and improve upon. In this paper, we propose a general neural network-based recommendation framework, which subsumes several existing state-of-the-art recommendation algorithms, and address the efficiency issue by investigating sampling strategies in the stochastic gradient descent training for the framework. We tackle this issue by first establishing a connection between the loss functions and the user-item interaction bipartite graph, where the loss function terms are defined on links while major computation burdens are located at nodes. We call this type of loss functions "graph-based" loss functions, for which varied mini-batch sampling strategies can have different computational costs. Based on the insight, three novel sampling strategies are proposed, which can significantly improve the training efficiency of the proposed framework (up to

\times 30

times speedup in our experiments), as well as improving the recommendation performance. Theoretical analysis is also provided for both the computational cost and the convergence. We believe the study of sampling strategies have further implications on general graph-based loss functions, and would also enable more research under the neural network-based recommendation framework.Comment: This is a longer version (with supplementary attached) of the KDD'17 pape

arXiv.org e-Print Archive

Crossref

Clustering and Latent Semantic Indexing Aspects of the Nonnegative Matrix Factorization

Author: Mirzal Andri
Publication venue
Publication date: 16/12/2011
Field of study

This paper provides a theoretical support for clustering aspect of the nonnegative matrix factorization (NMF). By utilizing the Karush-Kuhn-Tucker optimality conditions, we show that NMF objective is equivalent to graph clustering objective, so clustering aspect of the NMF has a solid justification. Different from previous approaches which usually discard the nonnegativity constraints, our approach guarantees the stationary point being used in deriving the equivalence is located on the feasible region in the nonnegative orthant. Additionally, since clustering capability of a matrix decomposition technique can sometimes imply its latent semantic indexing (LSI) aspect, we will also evaluate LSI aspect of the NMF by showing its capability in solving the synonymy and polysemy problems in synthetic datasets. And more extensive evaluation will be conducted by comparing LSI performances of the NMF and the singular value decomposition (SVD), the standard LSI method, using some standard datasets.Comment: 28 pages, 5 figure

arXiv.org e-Print Archive

CiteSeerX