3,157 research outputs found
Synthetic sequence generator for recommender systems - memory biased random walk on sequence multilayer network
Personalized recommender systems rely on each user's personal usage data in
the system, in order to assist in decision making. However, privacy policies
protecting users' rights prevent these highly personal data from being publicly
available to a wider researcher audience. In this work, we propose a memory
biased random walk model on multilayer sequence network, as a generator of
synthetic sequential data for recommender systems. We demonstrate the
applicability of the synthetic data in training recommender system models for
cases when privacy policies restrict clickstream publishing.Comment: The new updated version of the pape
User's Privacy in Recommendation Systems Applying Online Social Network Data, A Survey and Taxonomy
Recommender systems have become an integral part of many social networks and
extract knowledge from a user's personal and sensitive data both explicitly,
with the user's knowledge, and implicitly. This trend has created major privacy
concerns as users are mostly unaware of what data and how much data is being
used and how securely it is used. In this context, several works have been done
to address privacy concerns for usage in online social network data and by
recommender systems. This paper surveys the main privacy concerns, measurements
and privacy-preserving techniques used in large-scale online social networks
and recommender systems. It is based on historical works on security,
privacy-preserving, statistical modeling, and datasets to provide an overview
of the technical difficulties and problems associated with privacy preserving
in online social networks.Comment: 26 pages, IET book chapter on big data recommender system
On content-based recommendation and user privacy in social-tagging systems
Recommendation systems and content filtering approaches based on annotations and ratings, essentially rely on users expressing their preferences and interests through their actions, in order to provide personalised content. This activity, in which users engage collectively has been named social tagging, and it is one of the most popular in which users engage online, and although it has opened new possibilities for application interoperability on the semantic web, it is also posing new privacy threats. It, in fact, consists of describing online or offline resources by using free-text labels (i.e. tags), therefore exposing the user profile and activity to privacy attacks. Users, as a result, may wish to adopt a privacy-enhancing strategy in order not to reveal their interests completely. Tag forgery is a privacy enhancing technology consisting of generating tags for categories or resources that do not reflect the user's actual preferences. By modifying their profile, tag forgery may have a negative impact on the quality of the recommendation system, thus protecting user privacy to a certain extent but at the expenses of utility loss. The impact of tag forgery on content-based recommendation is, therefore, investigated in a real-world application scenario where different forgery strategies are evaluated, and the consequent loss in utility is measured and compared.Peer ReviewedPostprint (author’s final draft
An Accuracy-Assured Privacy-Preserving Recommender System for Internet Commerce
Recommender systems, tool for predicting users' potential preferences by
computing history data and users' interests, show an increasing importance in
various Internet applications such as online shopping. As a well-known
recommendation method, neighbourhood-based collaborative filtering has
attracted considerable attention recently. The risk of revealing users' private
information during the process of filtering has attracted noticeable research
interests. Among the current solutions, the probabilistic techniques have shown
a powerful privacy preserving effect. When facing Nearest Neighbour attack,
all the existing methods provide no data utility guarantee, for the
introduction of global randomness. In this paper, to overcome the problem of
recommendation accuracy loss, we propose a novel approach, Partitioned
Probabilistic Neighbour Selection, to ensure a required prediction accuracy
while maintaining high security against NN attack. We define the sum of
neighbours' similarity as the accuracy metric alpha, the number of user
partitions, across which we select the neighbours, as the security metric
beta. We generalise the Nearest Neighbour attack to beta k Nearest
Neighbours attack. Differing from the existing approach that selects neighbours
across the entire candidate list randomly, our method selects neighbours from
each exclusive partition of size with a decreasing probability. Theoretical
and experimental analysis show that to provide an accuracy-assured
recommendation, our Partitioned Probabilistic Neighbour Selection method yields
a better trade-off between the recommendation accuracy and system security.Comment: replacement for the previous versio
Guiding Us Throughout a Sea of Data - A Survey on Recommender Systems and Its Privacy Challenges
Over the past decades, the Internet has served as the backbone connecting people to others, places and things. With the sheer volume of information generated everyday, people can feel overwhelmed when having to make a selection among the multiple options that normally come up after a search or application request. For example, when searching for news articles regarding a particular topic, the search engine will present a number of results to you. When looking for some product on shopping websites, there are usually several pages of results that match the keywords. It can be very challenging for people to find their most expected information in the era of big data. A recommender system is a program that utilizes algorithms to learn users’ preferences from historical data, and predict their future interests. Recommender systems are employed everywhere in the cyberspace. Many websites including Amazon, eBay, YouTube, Facebook, Netflix, and others, have integrated automatic personalized recommendation techniques into their systems, in order to help users find their most desired information. While recommender systems have become a common feature on most web applications and sites, one of the major issues around its use is privacy concerns. A regular recommender system requires the users to share their online behavior data, such as their past shopping records, browsing history, visited places, so that it can learn their preferences. This can potentially deter people from using the system because these data are considered as users’ privacy and many do not feel comfortable sharing the information with other parties. In this research, we studied several recommendation algorithms, and compared their performance as well as prediction accuracy on real-world datasets. We also proposed a novel nonnegative matrix factorization (NMF) based privacy-preserving point-of-interest recommendation framework, in which the latent factors in NMF are learned on user group preference instead of individual user preference. Recommendations are made by personalizing the group preference on user’s local devices. There are no location or check-in data collected from the users anywhere throughout the learning and recommendation processes. Some preliminary results on a regular recommender system were established and two GUI applications were developed. The on-going research focuses on integrating the privacy-preserving framework into the system and verifying the effectiveness as well as the recommendation accuracy of the proposed model
- …