Search CORE

21,712 research outputs found

Collaborative filtering with diffusion-based similarity on tripartite graphs

Author: Adomavicius
Belkin
Brin
Broder
Cattuto
Herlocker
Jaccard
Kleinberg
Lee
Liben-Nowell
Liu
Ming-Sheng Shang
Ou
Ren
Resnick
Salton
Shang
Sun
Tao Zhou
Tso
Wang
Weimer
Yi-Cheng Zhang
Zhang
Zhang
Zhang
Zhou
Zhou
Zi-Ke Zhang
Publication venue: 'Elsevier BV'
Publication date: 17/10/2009
Field of study

Collaborative tags are playing more and more important role for the organization of information systems. In this paper, we study a personalized recommendation model making use of the ternary relations among users, objects and tags. We propose a measure of user similarity based on his preference and tagging information. Two kinds of similarities between users are calculated by using a diffusion-based process, which are then integrated for recommendation. We test the proposed method in a standard collaborative filtering framework with three metrics: ranking score, Recall and Precision, and demonstrate that it performs better than the commonly used cosine similarity.Comment: 8 pages, 4 figures, 1 tabl

arXiv.org e-Print Archive

Crossref

Ultra accurate collaborative information filtering via directed user similarity

Author: Guo Qiang
Liu Jian-Guo
Song Wen-Jun
Publication venue: 'IOP Publishing'
Publication date: 30/06/2014
Field of study

A key challenge of the collaborative filtering (CF) information filtering is how to obtain the reliable and accurate results with the help of peers' recommendation. Since the similarities from small-degree users to large-degree users would be larger than the ones opposite direction, the large-degree users' selections are recommended extensively by the traditional second-order CF algorithms. By considering the users' similarity direction and the second-order correlations to depress the influence of mainstream preferences, we present the directed second-order CF (HDCF) algorithm specifically to address the challenge of accuracy and diversity of the CF algorithm. The numerical results for two benchmark data sets, MovieLens and Netflix, show that the accuracy of the new algorithm outperforms the state-of-the-art CF algorithms. Comparing with the CF algorithm based on random-walks proposed in the Ref.7, the average ranking score could reach 0.0767 and 0.0402, which is enhanced by 27.3\% and 19.1\% for MovieLens and Netflix respectively. In addition, the diversity, precision and recall are also enhanced greatly. Without relying on any context-specific information, tuning the similarity direction of CF algorithms could obtain accurate and diverse recommendations. This work suggests that the user similarity direction is an important factor to improve the personalized recommendation performance.Comment: 6 pages, 4 figure

arXiv.org e-Print Archive

EDP Sciences OAI-PMH repository (1.2.0)

Personal Recommendation via Modified Collaborative Filtering

Author: Adomavicius
Balabanovi
Bing-Hong Wang
Broder
Chun-Xiao Jia
Duo Sun
Faloutsos
Goldberg
Herlocker
Holme
Huang
Konstan
Leicht
Liu
Maslov
Pazzani
Ren
Run-Ran Liu
Schafer
Sorenson
Tao Zhou
Zhang
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date: 27/07/2008
Field of study

In this paper, we propose a novel method to compute the similarity between congeneric nodes in bipartite networks. Different from the standard Person correlation, we take into account the influence of node's degree. Substituting this new definition of similarity for the standard Person correlation, we propose a modified collaborative filtering (MCF). Based on a benchmark database, we demonstrate the great improvement of algorithmic accuracy for both user-based MCF and object-based MCF.Comment: 7 pages, 8 figures and 1 tabl

arXiv.org e-Print Archive

Crossref

RERO DOC Digital Library

Relevance feedback for best match term weighting algorithms in information retrieval

Author: Hiemstra D.
Robertson S.E.
Publication venue: European Research Consortium for Informatics and Mathematics
Publication date: 01/01/2001
Field of study

Personalisation in full text retrieval or full text filtering implies reweighting of the query terms based on some explicit or implicit feedback from the user. Relevance feedback inputs the user's judgements on previously retrieved documents to construct a personalised query or user profile. This paper studies relevance feedback within two probabilistic models of information retrieval: the first based on statistical language models and the second based on the binary independence probabilistic model. The paper shows the resemblance of the approaches to relevance feedback of these models, introduces new approaches to relevance feedback for both models, and evaluates the new relevance feedback algorithms on the TREC collection. The paper shows that there are no significant differences between simple and sophisticated approaches to relevance feedback

CiteSeerX

Radboud Repository

University of Twente Research Information

Learning Tree-based Deep Model for Recommender Systems

Author: Gai Kun
He Jie
Li Guozheng
Li Han
Li Xiang
Zhang Pengye
Zhu Han
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/12/2018
Field of study

Model-based methods for recommender systems have been studied extensively in recent years. In systems with large corpus, however, the calculation cost for the learnt model to predict all user-item preferences is tremendous, which makes full corpus retrieval extremely difficult. To overcome the calculation barriers, models such as matrix factorization resort to inner product form (i.e., model user-item preference as the inner product of user, item latent factors) and indexes to facilitate efficient approximate k-nearest neighbor searches. However, it still remains challenging to incorporate more expressive interaction forms between user and item features, e.g., interactions through deep neural networks, because of the calculation cost. In this paper, we focus on the problem of introducing arbitrary advanced models to recommender systems with large corpus. We propose a novel tree-based method which can provide logarithmic complexity w.r.t. corpus size even with more expressive models such as deep neural networks. Our main idea is to predict user interests from coarse to fine by traversing tree nodes in a top-down fashion and making decisions for each user-node pair. We also show that the tree structure can be jointly learnt towards better compatibility with users' interest distribution and hence facilitate both training and prediction. Experimental evaluations with two large-scale real-world datasets show that the proposed method significantly outperforms traditional methods. Online A/B test results in Taobao display advertising platform also demonstrate the effectiveness of the proposed method in production environments.Comment: Accepted by KDD 201

arXiv.org e-Print Archive

Crossref

On the Impact of Entity Linking in Microblog Real-Time Filtering

Author: Berardi G.
Han Z.
Ounis I.
Robertson S.
Soboroff I.
Zhang Y.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 10/11/2016
Field of study

Microblogging is a model of content sharing in which the temporal locality of posts with respect to important events, either of foreseeable or unforeseeable nature, makes applica- tions of real-time filtering of great practical interest. We propose the use of Entity Linking (EL) in order to improve the retrieval effectiveness, by enriching the representation of microblog posts and filtering queries. EL is the process of recognizing in an unstructured text the mention of relevant entities described in a knowledge base. EL of short pieces of text is a difficult task, but it is also a scenario in which the information EL adds to the text can have a substantial impact on the retrieval process. We implement a start-of-the-art filtering method, based on the best systems from the TREC Microblog track realtime adhoc retrieval and filtering tasks , and extend it with a Wikipedia-based EL method. Results show that the use of EL significantly improves over non-EL based versions of the filtering methods.Comment: 6 pages, 1 figure, 1 table. SAC 2015, Salamanca, Spain - April 13 - 17, 201

arXiv.org e-Print Archive

Crossref

A study into annotation ranking metrics in geo-tagged image corpora

Author: Hughes Mark
Jones Gareth J.F.
O'Connor Noel E.
Publication venue
Publication date: 24/10/2012
Field of study

Community contributed datasets are becoming increasingly common in automated image annotation systems. One important issue with community image data is that there is no guarantee that the associated metadata is relevant. A method is required that can accurately rank the semantic relevance of community annotations. This should enable the extracting of relevant subsets from potentially noisy collections of these annotations. Having relevant, non heterogeneous tags assigned to images should improve community image retrieval systems, such as Flickr, which are based on text retrieval methods. In the literature, the current state of the art approach to ranking the semantic relevance of Flickr tags is based on the widely used tf-idf metric. In the case of datasets containing landmark images, however, this metric is inefficient due to the high frequency of common landmark tags within the data set and can be improved upon. In this paper, we present a landmark recognition framework, that provides end-to-end automated recognition and annotation. In our study into automated annotation, we evaluate 5 alternate approaches to tf-idf to rank tag relevance in community contributed landmark image corpora. We carry out a thorough evaluation of each of these ranking metrics and results of this evaluation demonstrate that four of these proposed techniques outperform the current commonly-used tf-idf approach for this task

Irish Universities

DCU Online Research Access Service