3,157 research outputs found

    Synthetic sequence generator for recommender systems - memory biased random walk on sequence multilayer network

    Full text link
    Personalized recommender systems rely on each user's personal usage data in the system, in order to assist in decision making. However, privacy policies protecting users' rights prevent these highly personal data from being publicly available to a wider researcher audience. In this work, we propose a memory biased random walk model on multilayer sequence network, as a generator of synthetic sequential data for recommender systems. We demonstrate the applicability of the synthetic data in training recommender system models for cases when privacy policies restrict clickstream publishing.Comment: The new updated version of the pape

    User's Privacy in Recommendation Systems Applying Online Social Network Data, A Survey and Taxonomy

    Full text link
    Recommender systems have become an integral part of many social networks and extract knowledge from a user's personal and sensitive data both explicitly, with the user's knowledge, and implicitly. This trend has created major privacy concerns as users are mostly unaware of what data and how much data is being used and how securely it is used. In this context, several works have been done to address privacy concerns for usage in online social network data and by recommender systems. This paper surveys the main privacy concerns, measurements and privacy-preserving techniques used in large-scale online social networks and recommender systems. It is based on historical works on security, privacy-preserving, statistical modeling, and datasets to provide an overview of the technical difficulties and problems associated with privacy preserving in online social networks.Comment: 26 pages, IET book chapter on big data recommender system

    On content-based recommendation and user privacy in social-tagging systems

    Get PDF
    Recommendation systems and content filtering approaches based on annotations and ratings, essentially rely on users expressing their preferences and interests through their actions, in order to provide personalised content. This activity, in which users engage collectively has been named social tagging, and it is one of the most popular in which users engage online, and although it has opened new possibilities for application interoperability on the semantic web, it is also posing new privacy threats. It, in fact, consists of describing online or offline resources by using free-text labels (i.e. tags), therefore exposing the user profile and activity to privacy attacks. Users, as a result, may wish to adopt a privacy-enhancing strategy in order not to reveal their interests completely. Tag forgery is a privacy enhancing technology consisting of generating tags for categories or resources that do not reflect the user's actual preferences. By modifying their profile, tag forgery may have a negative impact on the quality of the recommendation system, thus protecting user privacy to a certain extent but at the expenses of utility loss. The impact of tag forgery on content-based recommendation is, therefore, investigated in a real-world application scenario where different forgery strategies are evaluated, and the consequent loss in utility is measured and compared.Peer ReviewedPostprint (author’s final draft

    An Accuracy-Assured Privacy-Preserving Recommender System for Internet Commerce

    Full text link
    Recommender systems, tool for predicting users' potential preferences by computing history data and users' interests, show an increasing importance in various Internet applications such as online shopping. As a well-known recommendation method, neighbourhood-based collaborative filtering has attracted considerable attention recently. The risk of revealing users' private information during the process of filtering has attracted noticeable research interests. Among the current solutions, the probabilistic techniques have shown a powerful privacy preserving effect. When facing kk Nearest Neighbour attack, all the existing methods provide no data utility guarantee, for the introduction of global randomness. In this paper, to overcome the problem of recommendation accuracy loss, we propose a novel approach, Partitioned Probabilistic Neighbour Selection, to ensure a required prediction accuracy while maintaining high security against kkNN attack. We define the sum of kk neighbours' similarity as the accuracy metric alpha, the number of user partitions, across which we select the kk neighbours, as the security metric beta. We generalise the kk Nearest Neighbour attack to beta k Nearest Neighbours attack. Differing from the existing approach that selects neighbours across the entire candidate list randomly, our method selects neighbours from each exclusive partition of size kk with a decreasing probability. Theoretical and experimental analysis show that to provide an accuracy-assured recommendation, our Partitioned Probabilistic Neighbour Selection method yields a better trade-off between the recommendation accuracy and system security.Comment: replacement for the previous versio

    Guiding Us Throughout a Sea of Data - A Survey on Recommender Systems and Its Privacy Challenges

    Get PDF
    Over the past decades, the Internet has served as the backbone connecting people to others, places and things. With the sheer volume of information generated everyday, people can feel overwhelmed when having to make a selection among the multiple options that normally come up after a search or application request. For example, when searching for news articles regarding a particular topic, the search engine will present a number of results to you. When looking for some product on shopping websites, there are usually several pages of results that match the keywords. It can be very challenging for people to find their most expected information in the era of big data. A recommender system is a program that utilizes algorithms to learn users’ preferences from historical data, and predict their future interests. Recommender systems are employed everywhere in the cyberspace. Many websites including Amazon, eBay, YouTube, Facebook, Netflix, and others, have integrated automatic personalized recommendation techniques into their systems, in order to help users find their most desired information. While recommender systems have become a common feature on most web applications and sites, one of the major issues around its use is privacy concerns. A regular recommender system requires the users to share their online behavior data, such as their past shopping records, browsing history, visited places, so that it can learn their preferences. This can potentially deter people from using the system because these data are considered as users’ privacy and many do not feel comfortable sharing the information with other parties. In this research, we studied several recommendation algorithms, and compared their performance as well as prediction accuracy on real-world datasets. We also proposed a novel nonnegative matrix factorization (NMF) based privacy-preserving point-of-interest recommendation framework, in which the latent factors in NMF are learned on user group preference instead of individual user preference. Recommendations are made by personalizing the group preference on user’s local devices. There are no location or check-in data collected from the users anywhere throughout the learning and recommendation processes. Some preliminary results on a regular recommender system were established and two GUI applications were developed. The on-going research focuses on integrating the privacy-preserving framework into the system and verifying the effectiveness as well as the recommendation accuracy of the proposed model
    • …
    corecore