Search CORE

490 research outputs found

Spectral Clustering of Graphs with the Bethe Hessian

Author: Krzakala Florent
Saade Alaa
Zdeborová Lenka
Publication venue
Publication date: 08/09/2014
Field of study

Spectral clustering is a standard approach to label nodes on a graph by studying the (largest or lowest) eigenvalues of a symmetric real matrix such as e.g. the adjacency or the Laplacian. Recently, it has been argued that using instead a more complicated, non-symmetric and higher dimensional operator, related to the non-backtracking walk on the graph, leads to improved performance in detecting clusters, and even to optimal performance for the stochastic block model. Here, we propose to use instead a simpler object, a symmetric real matrix known as the Bethe Hessian operator, or deformed Laplacian. We show that this approach combines the performances of the non-backtracking operator, thus detecting clusters all the way down to the theoretical limit in the stochastic block model, with the computational, theoretical and memory advantages of real symmetric matrices.Comment: 8 pages, 2 figure

arXiv.org e-Print Archive

HAL-CEA

Matrix Completion from Fewer Entries: Spectral Detectability and Rank Estimation

Author: Krzakala Florent
Saade Alaa
Zdeborová Lenka
Publication venue
Publication date: 01/01/2015
Field of study

The completion of low rank matrices from few entries is a task with many practical applications. We consider here two aspects of this problem: detectability, i.e. the ability to estimate the rank

r

reliably from the fewest possible random entries, and performance in achieving small reconstruction error. We propose a spectral algorithm for these two tasks called MaCBetH (for Matrix Completion with the Bethe Hessian). The rank is estimated as the number of negative eigenvalues of the Bethe Hessian matrix, and the corresponding eigenvectors are used as initial condition for the minimization of the discrepancy between the estimated matrix and the revealed entries. We analyze the performance in a random matrix setting using results from the statistical mechanics of the Hopfield neural network, and show in particular that MaCBetH efficiently detects the rank

r

of a large

n\times m

matrix from

C(r)r\sqrt{nm}

entries, where

C(r)

is a constant close to

1

. We also evaluate the corresponding root-mean-square error empirically and show that MaCBetH compares favorably to other existing approaches.Comment: NIPS Conference 201

arXiv.org e-Print Archive

CiteSeerX

HAL-CEA

Clustering from Sparse Pairwise Measurements

Author: Krzakala Florent
Lelarge Marc
Saade Alaa
Zdeborová Lenka
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/05/2016
Field of study

We consider the problem of grouping items into clusters based on few random pairwise comparisons between the items. We introduce three closely related algorithms for this task: a belief propagation algorithm approximating the Bayes optimal solution, and two spectral algorithms based on the non-backtracking and Bethe Hessian operators. For the case of two symmetric clusters, we conjecture that these algorithms are asymptotically optimal in that they detect the clusters as soon as it is information theoretically possible to do so. We substantiate this claim for one of the spectral approaches we introduce

arXiv.org e-Print Archive

Crossref

INRIA a CCSD electronic archive server

HAL-CEA

Hal-Diderot

Performance of a community detection algorithm based on semidefinite programming

Author: Javanmard Adel
Montanari Andrea
RICCI TERSENGHI Federico
Publication venue: 'IOP Publishing'
Publication date: 01/01/2016
Field of study

The problem of detecting communities in a graph is maybe one the most studied inference problems, given its simplicity and widespread diffusion among several disciplines. A very common benchmark for this problem is the stochastic block model or planted partition problem, where a phase transition takes place in the detection of the planted partition by changing the signal-to-noise ratio. Optimal algorithms for the detection exist which are based on spectral methods, but we show these are extremely sensible to slight modification in the generative model. Recently Javanmard, Montanari and Ricci-Tersenghi [1] have used statistical physics arguments, and numerical simulations to show that finding communities in the stochastic block model via semidefinite programming is quasi optimal. Further, the resulting semidefinite relaxation can be solved efficiently, and is very robust with respect to changes in the generative model. In this paper we study in detail several practical aspects of this new algorithm based on semidefinite programming for the detection of the planted partition. The algorithm turns out to be very fast, allowing the solution of problems with O(105) variables in few second on a laptop computer

arXiv.org e-Print Archive

Archivio della ricerca- Università di Roma La Sapienza