28 research outputs found
An Algorithm for Density Enrichment of Sparse Collaborative Filtering Datasets Using Robust Predictions as Derived Ratings
Collaborative filtering algorithms formulate personalized recommendations for a user, first by analysing already entered ratings to identify other users with similar tastes to the user (termed as near neighbours), and then using the opinions of the near neighbours to predict which items the target user would like. However, in sparse datasets, too few near neighbours can be identified, resulting in low accuracy predictions and even a total inability to formulate personalized predictions. This paper addresses the sparsity problem by presenting an algorithm that uses robust predictions, that is predictions deemed as highly probable to be accurate, as derived ratings. Thus, the density of sparse datasets increases, and improved rating prediction coverage and accuracy are achieved. The proposed algorithm, termed as CFDR, is extensively evaluated using (1) seven widely-used collaborative filtering datasets, (2) the two most widely-used correlation metrics in collaborative filtering research, namely the Pearson correlation coefficient and the cosine similarity, and (3) the two most widely-used error metrics in collaborative filtering, namely the mean absolute error and the root mean square error. The evaluation results show that, by successfully increasing the density of the datasets, the capacity of collaborative filtering systems to formulate personalized and accurate recommendations is considerably improved
Improving Collaborative Filtering's Rating Prediction Quality by Considering Shifts in Rating Practices
Users that populate ratings databases, such as IMDB, follow different
marking practices, in the sense that some are stricter, while others are
more lenient. This aspect has been captured by the most widely used
similarity metrics in collaborative filtering, namely the Pearson
Correlation and the Adjusted Cosine Similarity, which adjust each
individual rating by the mean of the ratings entered by the specific
user, when computing similarities. However, relying on the mean value
presumes that the users' marking practices remain constant over time; in
practice though, it is possible that a user's marking practices change
over time, i.e. a user could start as strict and subsequently become
lenient, or vice versa. In this work, we propose an approach to take
into account marking practices shifts by (1) introducing the concept of
dynamic user rating averages which follow the users' marking practices
shifts, (2) presenting two alternative algorithms for computing a user's
dynamic averages and (3) performing a comparative evaluation among these
two algorithms and the classic static average (unique mean value) that
the Pearson Correlation uses
Exploiting Rating Abstention Intervals for Addressing Concept Drift in Social Network Recommender Systems
One of the major problems that social networks face is the continuous production of successful, user-targeted information in the form of recommendations, which are produced exploiting technology from the field of recommender systems. Recommender systems are based on information about users’ past behavior to formulate recommendations about their future actions. However, as time goes by, social network users may change preferences and likings: they may like different types of clothes, listen to different singers or even different genres of music and so on. This phenomenon has been termed as concept drift. In this paper: (1) we establish that when a social network user abstains from rating submission for a long time, it is a strong indication that concept drift has occurred and (2) we present a technique that exploits the abstention interval concept, to drop from the database ratings that do not reflect the current social network user’s interests, thus improving prediction quality
Improving Collaborative Filtering's Rating Prediction Accuracy by Considering Users' Rating Variability
When rating predictions are computed in user-user collaborative
filtering, each individual rating is typically adjusted by the mean of
the ratings entered by the specific user. This practice takes into
account the fact that users follow different rating practices, in the
sense that some are stricter when rating items, while others are more
lenient. However, users’ rating practices may also differ in rating
variability, in the sense that some user may be entering ratings close
to her mean, while another user may be entering more extreme ratings,
close to the limits of the rating scale. In this work, we (1) propose an
algorithm that considers users’ ratings variability in the rating
prediction computation process, aiming to improve rating prediction
quality and (2) evaluate the proposed algorithm against seven widely
used datasets considering three widely used variability measures and two
user similarity metrics. The proposed algorithm, using the “mean
absolute deviation around the mean” variability measure, has been
found to introduce considerable gains in rating prediction accuracy, in
every dataset and under both user similarity metrics tested
Editorial for the Special Issue on “Modern Recommender Systems: Approaches, Challenges and Applications”
Recommender systems are nowadays an indispensable part of most personalized systems implementing information access and content delivery, supporting a great variety of user activities [...
Improving Collaborative Filtering's Rating Prediction Coverage in Sparse Datasets by Exploiting User Dissimilarity
Collaborative filtering systems analyze the ratings databases to
identify users with similar likings and preferences, termed as near
neighbors, and then generate rating predictions for a user by examining
the ratings of his near neighbors for items that the user has not yet
rated; based on rating predictions, recommendations are then formulated.
However, these systems are known to exhibit the “gray sheep”
problem, i.e. the situation where no near neighbors can be identified
for a number of users, and hence no recommendation can be formulated for
them. This problem is more intense in sparse datasets, i.e. datasets
with relatively small number of ratings, compared to the number of users
and items. In this work, we propose a method for alleviating this
problem by exploiting user dissimilarity, under the assumption that if
some users have exhibited opposing preferences in the past, they are
likely to do so in the future. The proposed method has been evaluated
against seven widely used datasets and has been proven to be
particularly effective in increasing the percentage of users for which
personalized recommendations can be formulated in the context of sparse
datasets, while at the same time maintaining or slightly improving
rating prediction quality
Using Time Clusters for Following Users’ Shifts in Rating Practices
Users that enter ratings for items follow different rating practices, in the sense that, when rating items, some users are more lenient, while others are stricter. This aspect is taken into account by the most widely used similarity metric in user-user collaborative filtering, namely, the Pearson Correlation, which adjusts each individual user rating by the mean value of the ratings entered by the specific user, when computing similarities. However, a user’s rating practices change over time, i.e. a user could start as strict and subsequently become lenient or vice versa. In that sense, the practice of using a single mean value for adjusting users’ ratings is inadequate, since it fails to follow such shifts in users’ rating practices, leading to decreased rating prediction accuracy. In this work, we address this issue by using the concept of dynamic averages introduced earlier and we extend earlier work by (1) introducing the concept of rating time clusters and (2) presenting a novel algorithm for calculating dynamic user averages and exploiting them in user-user collaborative, filtering implementations. The proposed algorithm incorporates the aforementioned concept and is able to follow more successfully shifts in users’ rating practices. It has been evaluated using numerous datasets, and has been found to introduce significant gains in rating prediction accuracy, while outperforming the dynamic average computation approaches that are presented earlier
On Replacement Service Selection in WS-BPEL Scenario Adaptation
WS-BPEL scenario execution adaptation has been proposed by numerous
researchers as a response to the need of users to tailor the WS-BPEL
scenario execution to their individual preferences; these preferences
are typically expressed through Quality of Service (QoS) policies, which
the adaptation mechanism considers in order to select the services that
will ultimately be invoked to realize the desired business process. In
this paper, we consider a number of issues related to WS-BPEL scenario
adaptation, aiming to enhance adaptation quality and improve the QoS
offered to end users. More specifically, with the goal of broadening the
service selection pool we (a) discuss the identification of potential
services that can be used to realize a functionality used in the WS-BPEL
scenario and (b) elaborate on transactional semantics that invocations
to multiple services offered by the same provider may bear. We also
describe and validate an architecture for realizing the proposed
enhancements
On Exploiting Rating Prediction Accuracy Features in Dense Collaborative Filtering Datasets
One of the typical goals of collaborative filtering algorithms is to produce rating predictions with values very close to what real users would give to an item. Afterward, the items having the largest rating prediction values will be recommended to the users by the recommender system. Collaborative filtering algorithms can be applied to both sparse and dense datasets, and each of these dataset categories involves different kinds of risks. As far as the dense collaborative filtering datasets are concerned, where the rating prediction coverage is, most of the time, very high, we usually face large rating prediction times, issues concerning the selection of a user’s near neighbours, etc. Although collaborative filtering algorithms usually achieve better results when applied to dense datasets, there is still room for improvement, since in many cases, the rating prediction error is relatively high, which leads to unsuccessful recommendations and hence to recommender system unreliability. In this work, we explore rating prediction accuracy features, although in a broader context, in dense collaborative filtering datasets. We conduct an extensive evaluation, using dense datasets, widely used in collaborative filtering research, in order to find the associations between these features and the rating prediction accuracy
A Hybrid Framework for WS-BPEL Scenario Execution Adaptation, Using Monitoring and Feedback Data
In this paper, we present a framework which provides runtime adaptation
for BPEL scenarios. The adaptation is based on (a) quality of service
parameters of available web services (b) quality of service policies
specified by users (c) collaborative filtering techniques, allowing
clients to further refine the adaptation process by considering service
selections made by other clients, (d) monitoring, in order to follow the
variations of QoS attribute values and (e) on users’ opinions services
they have used