8 research outputs found
A novel Boolean kernels family for categorical data
Kernel based classifiers, such as SVM, are considered state-of-the-art algorithms and are widely used on many classification tasks. However, this kind of methods are hardly interpretable and for this reason they are often considered as black-box models. In this paper, we propose a new family of Boolean kernels for categorical data where features correspond to propositional formulas applied to the input variables. The idea is to create human-readable features to ease the extraction of interpretation rules directly from the embedding space. Experiments on artificial and benchmark datasets show the effectiveness of the proposed family of kernels with respect to established ones, such as RBF, in terms of classification accuracy
Deep Item-based Collaborative Filtering for Top-N Recommendation
Item-based Collaborative Filtering(short for ICF) has been widely adopted in
recommender systems in industry, owing to its strength in user interest
modeling and ease in online personalization. By constructing a user's profile
with the items that the user has consumed, ICF recommends items that are
similar to the user's profile. With the prevalence of machine learning in
recent years, significant processes have been made for ICF by learning item
similarity (or representation) from data. Nevertheless, we argue that most
existing works have only considered linear and shallow relationship between
items, which are insufficient to capture the complicated decision-making
process of users.
In this work, we propose a more expressive ICF solution by accounting for the
nonlinear and higher-order relationship among items. Going beyond modeling only
the second-order interaction (e.g. similarity) between two items, we
additionally consider the interaction among all interacted item pairs by using
nonlinear neural networks. Through this way, we can effectively model the
higher-order relationship among items, capturing more complicated effects in
user decision-making. For example, it can differentiate which historical
itemsets in a user's profile are more important in affecting the user to make a
purchase decision on an item. We treat this solution as a deep variant of ICF,
thus term it as DeepICF. To justify our proposal, we perform empirical studies
on two public datasets from MovieLens and Pinterest. Extensive experiments
verify the highly positive effect of higher-order item interaction modeling
with nonlinear neural networks. Moreover, we demonstrate that by more
fine-grained second-order interaction modeling with attention network, the
performance of our DeepICF method can be further improved.Comment: 25 pages, submitted to TOI
Boolean kernels for collaborative filtering in top-N item recommendation
In many personalized recommendation problems available data consists only of
positive interactions (implicit feedback) between users and items. This problem
is also known as One-Class Collaborative Filtering (OC-CF). Linear models
usually achieve state-of-the-art performances on OC-CF problems and many
efforts have been devoted to build more expressive and complex representations
able to improve the recommendations. Recent analysis show that collaborative
filtering (CF) datasets have peculiar characteristics such as high sparsity and
a long tailed distribution of the ratings. In this paper we propose a boolean
kernel, called Disjunctive kernel, which is less expressive than the linear one
but it is able to alleviate the sparsity issue in CF contexts. The embedding of
this kernel is composed by all the combinations of a certain arity d of the
input variables, and these combined features are semantically interpreted as
disjunctions of the input variables. Experiments on several CF datasets show
the effectiveness and the efficiency of the proposed kernel.Comment: 24 pages, 28 figures, 2 table