885 research outputs found

    Comparison of group recommendation algorithms

    Get PDF
    In recent years recommender systems have become the common tool to handle the information overload problem of educational and informative web sites, content delivery systems, and online shops. Although most recommender systems make suggestions for individual users, in many circumstances the selected items (e.g., movies) are not intended for personal usage but rather for consumption in groups. This paper investigates how effective group recommendations for movies can be generated by combining the group members' preferences (as expressed by ratings) or by combining the group members' recommendations. These two grouping strategies, which convert traditional recommendation algorithms into group recommendation algorithms, are combined with five commonly used recommendation algorithms to calculate group recommendations for different group compositions. The group recommendations are not only assessed in terms of accuracy, but also in terms of other qualitative aspects that are important for users such as diversity, coverage, and serendipity. In addition, the paper discusses the influence of the size and composition of the group on the quality of the recommendations. The results show that the grouping strategy which produces the most accurate results depends on the algorithm that is used for generating individual recommendations. Therefore, the paper proposes a combination of grouping strategies which outperforms each individual strategy in terms of accuracy. Besides, the results show that the accuracy of the group recommendations increases as the similarity between members of the group increases. Also the diversity, coverage, and serendipity of the group recommendations are to a large extent dependent on the used grouping strategy and recommendation algorithm. Consequently for (commercial) group recommender systems, the grouping strategy and algorithm have to be chosen carefully in order to optimize the desired quality metrics of the group recommendations. The conclusions of this paper can be used as guidelines for this selection process

    A survey of data mining techniques for social media analysis

    Get PDF
    Social network has gained remarkable attention in the last decade. Accessing social network sites such as Twitter, Facebook LinkedIn and Google+ through the internet and the web 2.0 technologies has become more affordable. People are becoming more interested in and relying on social network for information, news and opinion of other users on diverse subject matters. The heavy reliance on social network sites causes them to generate massive data characterised by three computational issues namely; size, noise and dynamism. These issues often make social network data very complex to analyse manually, resulting in the pertinent use of computational means of analysing them. Data mining provides a wide range of techniques for detecting useful knowledge from massive datasets like trends, patterns and rules [44]. Data mining techniques are used for information retrieval, statistical modelling and machine learning. These techniques employ data pre-processing, data analysis, and data interpretation processes in the course of data analysis. This survey discusses different data mining techniques used in mining diverse aspects of the social network over decades going from the historical techniques to the up-to-date models, including our novel technique named TRCM. All the techniques covered in this survey are listed in the Table.1 including the tools employed as well as names of their authors

    Intelligent techniques for recommender systems

    Full text link
    This thesis focuses on the data sparsity issue and the temporal dynamic issue in the context of collaborative filtering, and addresses them with imputation techniques, low-rank subspace techniques and optimizations techniques from the machine learning perspective. A comprehensive survey on the development of collaborative filtering techniques is also included

    Context-Aware Recommendation Systems in Mobile Environments

    Get PDF
    Nowadays, the huge amount of information available may easily overwhelm users when they need to take a decision that involves choosing among several options. As a solution to this problem, Recommendation Systems (RS) have emerged to offer relevant items to users. The main goal of these systems is to recommend certain items based on user preferences. Unfortunately, traditional recommendation systems do not consider the user’s context as an important dimension to ensure high-quality recommendations. Motivated by the need to incorporate contextual information during the recommendation process, Context-Aware Recommendation Systems (CARS) have emerged. However, these recent recommendation systems are not designed with mobile users in mind, where the context and the movements of the users and items may be important factors to consider when deciding which items should be recommended. Therefore, context-aware recommendation models should be able to effectively and efficiently exploit the dynamic context of the mobile user in order to offer her/him suitable recommendations and keep them up-to-date.The research area of this thesis belongs to the fields of context-aware recommendation systems and mobile computing. We focus on the following scientific problem: how could we facilitate the development of context-aware recommendation systems in mobile environments to provide users with relevant recommendations? This work is motivated by the lack of generic and flexible context-aware recommendation frameworks that consider aspects related to mobile users and mobile computing. In order to solve the identified problem, we pursue the following general goal: the design and implementation of a context-aware recommendation framework for mobile computing environments that facilitates the development of context-aware recommendation applications for mobile users. In the thesis, we contribute to bridge the gap not only between recommendation systems and context-aware computing, but also between CARS and mobile computing.<br /

    Improvement of Recommender System by common rated movie similarity of users

    Get PDF
    ν•™μœ„λ…Όλ¬Έ (석사)-- μ„œμšΈλŒ€ν•™κ΅ λŒ€ν•™μ› : 산업곡학과, 2014. 8. μ‘°μ„±μ€€.μΆ”μ²œ μ‹œμŠ€ν…œμ€ μ „μ„Έκ³„μ μœΌλ‘œ 많이 쓰이고 있고 μ€‘μš”ν•œ 뢄야이닀. μ˜ν™”, μŒμ•… λ˜λŠ” μ±… λ“±μ˜ ν’ˆλͺ©μ΄ 점점 λ§Žμ•„μ§€κ³  인터넷이 λ°œλ‹¬ν•¨μ— 따라 μ‚¬μš©μžλ“€μ€ λ”μš± 더 λ§Žμ€ μ½˜ν…μΈ λ“€μ— μ ‘κ·Όν•  수 μžˆλ‹€. ν•˜μ§€λ§Œ μ‹€μ œλ‘œ μ†ŒλΉ„ν•  수 μžˆλŠ” μ½˜ν…μΈ λŠ” ν•œμ •μ μ΄λ‹€. λ”°λΌμ„œ μ†ŒλΉ„μžλ“€μ€ 본인의 μ·¨ν–₯에 μ ν•©ν•œ μ½˜ν…μΈ λ₯Ό κ³ λ₯΄κ³  μ‹Άμ–΄ν•œλ‹€. λ•Œλ¬Έμ— μΆ”μ²œμ‹œμŠ€ν…œμ€ μ€‘μš”ν•˜κ³  μœ μš©ν•˜κ²Œ μ‚¬μš©λ  수 μžˆλ‹€. μ‹€μ œλ‘œ Amazon μ΄λ‚˜ Netflixλ“±μ˜ μ„œλΉ„μŠ€λŠ” 이미 μ΄λŸ¬ν•œ μ‹œμŠ€ν…œμ„ λ„μž…ν•˜μ—¬ μ†ŒλΉ„μžλ“€μ΄ 직접 μ½˜ν…μΈ λ₯Ό 찾지 μ•Šμ•„λ„ κ°œλ³„ μ‚¬μš©μžμ—κ²Œ μ•Œλ§žμ€ μ½˜ν…μΈ λ“€μ„ μΆ”μ²œν•΄μ€€λ‹€. ν˜‘λ ₯ 필터링은 μΆ”μ²œμ‹œμŠ€ν…œμ— κ°€μž₯ 널리 쓰이고 κ°€μž₯ 많이 μ—°κ΅¬λ˜λŠ” λ°©λ²•μ€‘μ˜ ν•˜λ‚˜μ΄λ‹€. κ·Έ 쀑 μ‚¬μš©μžκΈ°λ°˜ ν˜‘λ ₯ 필터링은 νŠΉμ • μ‚¬μš©μžκ°€ ν₯λ―Έλ‘œμ›Œ ν•  수 μžˆλŠ” μ•„μ΄ν…œμ„ μ°ΎκΈ° μœ„ν•΄ 그와 μœ μ‚¬ν•œ μ‚¬μš©μžλ“€μ„ μ°Ύμ•„ κ·Έλ“€μ˜ 정보λ₯Ό μ΄μš©ν•˜μ—¬ μΆ”μ²œν•  μ•„μ΄ν…œμ„ μ°ΎλŠ”λ‹€. 이 λ•Œ, μ‚¬μš©μžλ“€μ˜ μœ μ‚¬λ„λ₯Ό κ΅¬ν•˜λŠ” 것은 맀우 μ€‘μš”ν•œ 과정이닀. λ³Έ μ—°κ΅¬μ—μ„œλŠ” μ‚¬μš©μžλ“€μ˜ 평점을 μ΄μš©ν•˜μ—¬ μ˜ν™”λ₯Ό μΆ”μ²œν•  λ•Œ μ‚¬μš©μžλ“€μ˜ μœ μ‚¬λ„λ₯Ό κ΅¬ν•˜λŠ” μƒˆλ‘œμš΄ 방법을 μ œμ•ˆν•œλ‹€. λ¨Όμ € μ‚¬μš©μžλ“€μ˜ 평가가 μžˆλŠ” μ˜ν™”λ“€μ€ μ‚¬μš©μžλ“€μ΄ 과거에 μ„ νƒν•΄μ„œ κ΄€λžŒν•œ μ˜ν™”λ“€μ΄λ‹€. 즉 이 μ˜ν™”λ“€μ€ μ‚¬μš©μžλ“€μ΄ μžμ‹ μ˜ μ·¨ν–₯에 적합할 것이라 μƒκ°ν•˜κ³  μ„ νƒν•˜μ˜€κΈ° λ•Œλ¬Έμ— μ„ νƒν•˜μ§€ μ•Šμ€ μ˜ν™”μ— λΉ„ν•΄ μ€‘μš”ν•˜λ‹€. λ”°λΌμ„œ 두 μ‚¬μš©μžκ°€ μ–Όλ§ŒνΌμ˜ μ˜ν™”λ₯Ό κ³΅ν†΅μœΌλ‘œ μ„ νƒν•˜μ—¬ ν‰κ°€ν•˜μ˜€λŠ”μ§€λ₯Ό κ³ λ €ν•˜μ˜€λ‹€. λ˜ν•œ κ³΅ν†΅μœΌλ‘œ μ„ νƒν•œ μ˜ν™”μ— λŒ€ν•˜μ—¬ μ‚¬μš©μžλ“€μ΄ μ–Όλ§ˆλ‚˜ λ§Œμ‘±ν•˜μ˜€λŠ”μ§€ κ³ λ €ν•˜μ˜€λ‹€. μ‚¬μš©μžκ°€ μ„ νƒν•œ ν•­λͺ©λ“€μ˜ 평균보닀 λ§Œμ‘±μŠ€λŸ¬μ› λŠ”μ§€, λΆˆλ§Œμ‘±μŠ€λŸ¬μ› λŠ”μ§€ 두 μ‚¬μš©μžκ°„μ˜ μ΄λŸ¬ν•œ 편ν–₯성이 μΌμΉ˜ν•˜λŠ”μ§€ κ³ λ €ν•˜μ—¬ μœ μ‚¬λ„λ₯Ό κ³„μ‚°ν•˜μ˜€λ‹€. λ³Έ μ—°κ΅¬μ—μ„œ μ œμ•ˆν•œ μœ μ‚¬λ„λ₯Ό μ΄μš©ν•˜μ—¬ ν˜‘λ ₯ 필터링을 μˆ˜ν–‰ν•œ κ²°κ³Ό 기쑴의 μœ μ‚¬λ„λ₯Ό κ΅¬ν•˜λŠ” 방법인 ν”Όμ–΄μŠ¨ 상관관계, μžμΉ΄λ“œ μœ μ‚¬λ„λ³΄λ‹€ 더 높은 예츑λ ₯을 λ³΄μ΄λŠ” 것을 μ•Œ 수 μžˆμ—ˆλ‹€. λ˜ν•œ 적은 μ΄μ›ƒλ§Œμ„ κ³ λ €ν•˜μ—¬ μ˜ˆμΈ‘ν–ˆμ„ λ•Œμ—λ„ μ„±λŠ₯이 쒋은 것을 ν™•μΈν•˜μ˜€λ‹€. μ œμ•ˆν•œ 방법을 μ‚¬μš©ν–ˆμ„ λ•Œ, 예츑였차 κ°’μ˜ 평균 MAE(Mean Absolue Error) 값은 0.7929둜 기쑴의 μœ μ‚¬λ„ 보닀 적은 값을 λ³΄μ˜€λ‹€.제 1 μž₯ μ„œ λ‘  1 제 2 μž₯ κΈ°μ‘΄ 연ꡬ 5 제 1 절 ν”Όμ–΄μŠ¨ 상관관계 8 제 2 절 μžμΉ΄λ“œ μœ μ‚¬λ„ 10 제 3 μž₯ 연ꡬ 방법 11 제 1 절 곡톡 평가 μ˜ν™” 12 제 2 절 편ν–₯μ„± 16 제 3 절 곡톡 평가 μ˜ν™”μ™€ 편ν–₯μ„± 19 제 4 μž₯ μ‹€ν—˜ μ„€μ • 및 κ²°κ³Ό 20 제 1 절 데이터 μ„€λͺ… 21 제 2 절 μ‹€ν—˜ μ„€μ • 23 제 3 절 μ‹€ν—˜ κ²°κ³Ό 26 제 5 μž₯ κ²° λ‘  33 제 6 μž₯ ν–₯ν›„ 연ꡬ λ°©ν–₯ 34 μ°Έκ³ λ¬Έν—Œ 35 Abstract 38Maste

    New accurate, explainable, and unbiased machine learning models for recommendation with implicit feedback.

    Get PDF
    Recommender systems have become ubiquitous Artificial Intelligence (AI) tools that play an important role in filtering online information in our daily lives. Whether we are shopping, browsing movies, or listening to music online, AI recommender systems are working behind the scene to provide us with curated and personalized content, that has been predicted to be relevant to our interest. The increasing prevalence of recommender systems has challenged researchers to develop powerful algorithms that can deliver recommendations with increasing accuracy. In addition to the predictive accuracy of recommender systems, recent research has also started paying attention to their fairness, in particular with regard to the bias and transparency of their predictions. This dissertation contributes to advancing the state of the art in fairness in AI by proposing new Machine Learning models and algorithms that aim to improve the user\u27s experience when receiving recommendations, with a focus that is positioned at the nexus of three objectives, namely accuracy, transparency, and unbiasedness of the predictions. In our research, we focus on state-of-the-art Collaborative Filtering (CF) recommendation approaches trained on implicit feedback data. More specifically, we address the limitations of two established deep learning approaches in two distinct recommendation settings, namely recommendation with user profiles and sequential recommendation. First, we focus on a state of the art pairwise ranking model, namely Bayesian Personalized Ranking (BPR), which has been found to outperform pointwise models in predictive accuracy in the recommendation with the user profiles setting. Specifically, we address two limitations of BPR: (1) BPR is a black box model that does not explain its outputs, thus limiting the user\u27s trust in the recommendations, and the analyst\u27s ability to scrutinize a model\u27s outputs; and (2) BPR is vulnerable to exposure bias due to the data being Missing Not At Random (MNAR). This exposure bias usually translates into an unfairness against the least popular items because they risk being under-exposed by the recommender system. We propose a novel explainable loss function and a corresponding model called Explainable Bayesian Personalized Ranking (EBPR) that generates recommendations along with item-based explanations. Then, we theoretically quantify the additional exposure bias resulting from the explainability, and use it as a basis to propose an unbiased estimator for the ideal EBPR loss. This being done, we perform an empirical study on three real-world benchmarking datasets that demonstrate the advantages of our proposed models, compared to existing state of the art techniques. Next, we shift our attention to sequential recommendation systems and focus on modeling and mitigating exposure bias in BERT4Rec, which is a state-of-the-art recommendation approach based on bidirectional transformers. The bi-directional representation capacity in BERT4Rec is based on the Cloze task, a.k.a. Masked Language Model, which consists of predicting randomly masked items within the sequence, assuming that the true interacted item is the most relevant one. This results in an exposure bias, where non-interacted items with low exposure propensities are assumed to be irrelevant. Thus far, the most common approach to mitigating exposure bias in recommendation has been Inverse Propensity Scoring (IPS), which consists of down-weighting the interacted predictions in the loss function in proportion to their propensities of exposure, yielding a theoretically unbiased learning. We first argue and prove that IPS does not extend to sequential recommendation because it fails to account for the sequential nature of the problem. We then propose a novel propensity scoring mechanism, that we name Inverse Temporal Propensity Scoring (ITPS), which is used to theoretically debias the Cloze task in sequential recommendation. We also rely on the ITPS framework to propose a bidirectional transformer-based model called ITPS-BERT4Rec. Finally, we empirically demonstrate the debiasing capabilities of our proposed approach and its robustness to the severity of exposure bias. Our proposed explainable approach in recommendation with user profiles, EBPR, showed an increase in ranking accuracy of about 4% and an increase in explainability of about 7% over the baseline BPR model when performing experiments on real-world recommendation datasets. Moreover, experiments on a real-world unbiased dataset demonstrated the importance of coupling explainability and exposure debiasing in capturing the true preferences of the user with a significant improvement of 1% over the baseline unbiased model UBPR. Furthermore, coupling explainability with exposure debiasing was also empirically proven to mitigate popularity bias with an improvement in popularity debiasing metrics of over 10% on three real-world recommendation tasks over the unbiased UBPR model. These results demonstrate the viability of our proposed approaches in recommendation with user profiles and their capacity to improve the user\u27s experience in recommendation by better capturing and modeling their true preferences, improving the explainability of the recommendations, and presenting them with more diverse recommendations that span a larger portion of the item catalog. On the other hand, our proposed approach in sequential recommendation ITPS-BERT4Rec has demonstrated a significant increase of 1% in terms of modeling the true preferences of the user in a semi-synthetic setting over the state-of-the-art sequential recommendation model BERT4Rec while also being unbiased in terms of exposure. Similarly, ITPS-BERT4Rec showed an average increase of 8.7% over BERT4Rec in three real-world recommendation settings. Moreover, empirical experiments demonstrated the robustness of our proposed ITPS-BERT4Rec model to increasing levels of exposure bias and its stability in terms of variance. Furthermore, experiments on popularity debiasing showed a significant advantage of our proposed ITPS-BERT4Rec model for both the short and long term sequences. Finally, ITPS-BERT4Rec showed respective improvements of around 60%, 470%, and 150% over vanilla BERT4Rec in capturing the temporal dependencies between the items within the sequences of interactions for three different evaluation metrics. These results demonstrate the potential of our proposed unbiased estimator to improve the user experience in the context of sequential recommendation by presenting them with more accurate and diverse recommendations that better match their true preferences and the sequential dependencies between the recommended items
    • …
    corecore