Search CORE

56,552 research outputs found

A Harmonic Extension Approach for Collaborative Ranking

Author: Bertozzi Andrea
Kuang Da
Osher Stanley
Shi Zuoqiang
Publication venue
Publication date: 16/02/2016
Field of study

We present a new perspective on graph-based methods for collaborative ranking for recommender systems. Unlike user-based or item-based methods that compute a weighted average of ratings given by the nearest neighbors, or low-rank approximation methods using convex optimization and the nuclear norm, we formulate matrix completion as a series of semi-supervised learning problems, and propagate the known ratings to the missing ones on the user-user or item-item graph globally. The semi-supervised learning problems are expressed as Laplace-Beltrami equations on a manifold, or namely, harmonic extension, and can be discretized by a point integral method. We show that our approach does not impose a low-rank Euclidean subspace on the data points, but instead minimizes the dimension of the underlying manifold. Our method, named LDM (low dimensional manifold), turns out to be particularly effective in generating rankings of items, showing decent computational efficiency and robust ranking quality compared to state-of-the-art methods

arXiv.org e-Print Archive

eScholarship - University of California

An LSH Index for Computing Kendall's Tau over Top-k Lists

Author: Michel Sebastian
Pal Koninika
Publication venue
Publication date: 01/01/2014
Field of study

We consider the problem of similarity search within a set of top-k lists under the Kendall's Tau distance function. This distance describes how related two rankings are in terms of concordantly and discordantly ordered items. As top-k lists are usually very short compared to the global domain of possible items to be ranked, creating an inverted index to look up overlapping lists is possible but does not capture tight enough the similarity measure. In this work, we investigate locality sensitive hashing schemes for the Kendall's Tau distance and evaluate the proposed methods using two real-world datasets.Comment: 6 pages, 8 subfigures, presented in Seventeenth International Workshop on the Web and Databases (WebDB 2014) co-located with ACM SIGMOD201

arXiv.org e-Print Archive

MPG.PuRe

ANTIQUE: A Non-Factoid Question Answering Benchmark

Author: Aliannejadi Mohammad
Croft W. Bruce
Hashemi Helia
Zamani Hamed
Publication venue
Publication date: 19/08/2019
Field of study

Considering the widespread use of mobile and voice search, answer passage retrieval for non-factoid questions plays a critical role in modern information retrieval systems. Despite the importance of the task, the community still feels the significant lack of large-scale non-factoid question answering collections with real questions and comprehensive relevance judgments. In this paper, we develop and release a collection of 2,626 open-domain non-factoid questions from a diverse set of categories. The dataset, called ANTIQUE, contains 34,011 manual relevance annotations. The questions were asked by real users in a community question answering service, i.e., Yahoo! Answers. Relevance judgments for all the answers to each question were collected through crowdsourcing. To facilitate further research, we also include a brief analysis of the data as well as baseline results on both classical and recently developed neural IR models

arXiv.org e-Print Archive

Crossref

International Migration, Integration and Social Cohesion online publications

Learning to Rank based on Analogical Reasoning

Author: Fahandar Mohsen Ahmadi
Hüllermeier Eyke
Publication venue
Publication date: 28/11/2017
Field of study

Object ranking or "learning to rank" is an important problem in the realm of preference learning. On the basis of training data in the form of a set of rankings of objects represented as feature vectors, the goal is to learn a ranking function that predicts a linear order of any new set of objects. In this paper, we propose a new approach to object ranking based on principles of analogical reasoning. More specifically, our inference pattern is formalized in terms of so-called analogical proportions and can be summarized as follows: Given objects

A,B,C,D

, if object

A

is known to be preferred to

B

, and

C

relates to

D

A

relates to

B

, then

C

is (supposedly) preferred to

D

. Our method applies this pattern as a main building block and combines it with ideas and techniques from instance-based learning and rank aggregation. Based on first experimental results for data sets from various domains (sports, education, tourism, etc.), we conclude that our approach is highly competitive. It appears to be specifically interesting in situations in which the objects are coming from different subdomains, and which hence require a kind of knowledge transfer.Comment: Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18), 8 page

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Greedy Randomized Adaptive Search and Variable Neighbourhood Search for the minimum labelling spanning tree problem

Author: Avis
Brüggemann
Cerulli
Chang
Demšar
Duin
Feo
Friedman
Glover
Goldberg
Hansen
Hansen
Hollander
Iman
J.A. Moreno Pérez
K. Darby-Dowman
Krumke
Mladenović
N. Mladenović
Resende
S. Consoli
Tanenbaum
Van-Nes
Voß
Voß
Wan
Xiong
Xiong
Xiong
Publication venue: 'Elsevier BV'
Publication date: 01/07/2009
Field of study

This paper studies heuristics for the minimum labelling spanning tree (MLST) problem. The purpose is to find a spanning tree using edges that are as similar as possible. Given an undirected labelled connected graph, the minimum labelling spanning tree problem seeks a spanning tree whose edges have the smallest number of distinct labels. This problem has been shown to be NP-hard. A Greedy Randomized Adaptive Search Procedure (GRASP) and a Variable Neighbourhood Search (VNS) are proposed in this paper. They are compared with other algorithms recommended in the literature: the Modified Genetic Algorithm and the Pilot Method. Nonparametric statistical tests show that the heuristics based on GRASP and VNS outperform the other algorithms tested. Furthermore, a comparison with the results provided by an exact approach shows that we may quickly obtain optimal or near-optimal solutions with the proposed heuristics

Crossref

Brunel University Research Archive