Search CORE

25 research outputs found

Information Recovery from Pairwise Measurements

Author: Chen Yuxin
Goldsmith Andrea J.
Publication venue
Publication date: 02/08/2016
Field of study

A variety of information processing tasks in practice involve recovering

n

objects from single-shot graph-based measurements, particularly those taken over the edges of some measurement graph

\mathcal{G}

. This paper concerns the situation where each object takes value over a group of

M

different values, and where one is interested to recover all these values based on observations of certain pairwise relations over

\mathcal{G}

. The imperfection of measurements presents two major challenges for information recovery: 1)

\textit{inaccuracy}

: a (dominant) portion

1-p

of measurements are corrupted; 2)

\textit{incompleteness}

: a significant fraction of pairs are unobservable, i.e.

\mathcal{G}

can be highly sparse. Under a natural random outlier model, we characterize the

\textit{minimax recovery rate}

, that is, the critical threshold of non-corruption rate

p

below which exact information recovery is infeasible. This accommodates a very general class of pairwise relations. For various homogeneous random graph models (e.g. Erdos Renyi random graphs, random geometric graphs, small world graphs), the minimax recovery rate depends almost exclusively on the edge sparsity of the measurement graph

\mathcal{G}

irrespective of other graphical metrics. This fundamental limit decays with the group size

M

at a square root rate before entering a connectivity-limited regime. Under the Erdos Renyi random graph, a tractable combinatorial algorithm is proposed to approach the limit for large

M

(

M=n^{\Omega(1)}

), while order-optimal recovery is enabled by semidefinite programs in the small

M

regime. The extended (and most updated) version of this work can be found at (http://arxiv.org/abs/1504.01369).Comment: This version is no longer updated -- please find the latest version at (arXiv:1504.01369

arXiv.org e-Print Archive

CiteSeerX

Clustering from Sparse Pairwise Measurements

Author: Krzakala Florent
Lelarge Marc
Saade Alaa
Zdeborová Lenka
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/05/2016
Field of study

We consider the problem of grouping items into clusters based on few random pairwise comparisons between the items. We introduce three closely related algorithms for this task: a belief propagation algorithm approximating the Bayes optimal solution, and two spectral algorithms based on the non-backtracking and Bethe Hessian operators. For the case of two symmetric clusters, we conjecture that these algorithms are asymptotically optimal in that they detect the clusters as soon as it is information theoretically possible to do so. We substantiate this claim for one of the spectral approaches we introduce

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL-CEA

Hal-Diderot

Fundamental Limits on Data Acquisition: Trade-offs between Sample Complexity and Query Difficulty

Author: Chung Hye Won
Hero Alfred O.
Lee Ji Oon
Publication venue
Publication date: 02/01/2018
Field of study

We consider query-based data acquisition and the corresponding information recovery problem, where the goal is to recover

k

binary variables (information bits) from parity measurements of those variables. The queries and the corresponding parity measurements are designed using the encoding rule of Fountain codes. By using Fountain codes, we can design potentially limitless number of queries, and corresponding parity measurements, and guarantee that the original