Search CORE

42,149 research outputs found

Matrix completion with queries

Author: Crovella Mark
Ruchansky Natali
Terzi Evimaria
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 30/04/2017
Field of study

In many applications, e.g., recommender systems and traffic monitoring, the data comes in the form of a matrix that is only partially observed and low rank. A fundamental data-analysis task for these datasets is matrix completion, where the goal is to accurately infer the entries missing from the matrix. Even when the data satisfies the low-rank assumption, classical matrix-completion methods may output completions with significant error -- in that the reconstructed matrix differs significantly from the true underlying matrix. Often, this is due to the fact that the information contained in the observed entries is insufficient. In this work, we address this problem by proposing an active version of matrix completion, where queries can be made to the true underlying matrix. Subsequently, we design Order&Extend, which is the first algorithm to unify a matrix-completion approach and a querying strategy into a single algorithm. Order&Extend is able identify and alleviate insufficient information by judiciously querying a small number of additional entries. In an extensive experimental evaluation on real-world datasets, we demonstrate that our algorithm is efficient and is able to accurately reconstruct the true matrix while asking only a small number of queries.Comment: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Minin

arXiv.org e-Print Archive

CiteSeerX

Near-optimal asymmetric binary matrix partitions

Author: A Ghosh
B Lehmann
GA Akerlof
I Caragiannis
I Caragiannis
J Cremer
J Cremer
J Levin
N Alon
P Milgrom
PR Milgrom
PR Milgrom
S Athanassopoulos
S Athanassopoulos
S Khot
U Feige
V Crawford
Publication venue
Publication date: 08/04/2015
Field of study

We study the asymmetric binary matrix partition problem that was recently introduced by Alon et al. (WINE 2013) to model the impact of asymmetric information on the revenue of the seller in take-it-or-leave-it sales. Instances of the problem consist of an

n \times m

binary matrix

A

and a probability distribution over its columns. A partition scheme

B=(B_1,...,B_n)

consists of a partition

B_i

for each row

i

A

. The partition

B_i

acts as a smoothing operator on row

i

that distributes the expected value of each partition subset proportionally to all its entries. Given a scheme

B

that induces a smooth matrix

A^B

, the partition value is the expected maximum column entry of

A^B

. The objective is to find a partition scheme such that the resulting partition value is maximized. We present a

9/10

-approximation algorithm for the case where the probability distribution is uniform and a

(1-1/e)

-approximation algorithm for non-uniform distributions, significantly improving results of Alon et al. Although our first algorithm is combinatorial (and very simple), the analysis is based on linear programming and duality arguments. In our second result we exploit a nice relation of the problem to submodular welfare maximization.Comment: 17 page

arXiv.org e-Print Archive

Crossref

Open-Vocabulary Semantic Parsing with both Distributional Statistics and Formal Knowledge

Author: Gardner Matt
Krishnamurthy Jayant
Publication venue
Publication date: 28/11/2016
Field of study

Traditional semantic parsers map language onto compositional, executable queries in a fixed schema. This mapping allows them to effectively leverage the information contained in large, formal knowledge bases (KBs, e.g., Freebase) to answer questions, but it is also fundamentally limiting---these semantic parsers can only assign meaning to language that falls within the KB's manually-produced schema. Recently proposed methods for open vocabulary semantic parsing overcome this limitation by learning execution models for arbitrary language, essentially using a text corpus as a kind of knowledge base. However, all prior approaches to open vocabulary semantic parsing replace a formal KB with textual information, making no use of the KB in their models. We show how to combine the disparate representations used by these two approaches, presenting for the first time a semantic parser that (1) produces compositional, executable representations of language, (2) can successfully leverage the information contained in both a formal KB and a large corpus, and (3) is not limited to the schema of the underlying KB. We demonstrate significantly improved performance over state-of-the-art baselines on an open-domain natural language question answering task.Comment: Re-written abstract and intro, other minor changes throughout. This version published at AAAI 201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications