Search CORE

922 research outputs found

Generalized Separable Nonnegative Matrix Factorization

Author: Gillis Nicolas
Pan Junjun
Publication venue
Publication date: 01/01/2019
Field of study

Nonnegative matrix factorization (NMF) is a linear dimensionality technique for nonnegative data with applications such as image analysis, text mining, audio source separation and hyperspectral unmixing. Given a data matrix

M

and a factorization rank

r

, NMF looks for a nonnegative matrix

W

with

r

columns and a nonnegative matrix

H

with

r

rows such that

M \approx WH

. NMF is NP-hard to solve in general. However, it can be computed efficiently under the separability assumption which requires that the basis vectors appear as data points, that is, that there exists an index set

\mathcal{K}

such that

W = M(:,\mathcal{K})

. In this paper, we generalize the separability assumption: We only require that for each rank-one factor

W(:,k)H(k,:)

for

k=1,2,\dots,r

, either

W(:,k) = M(:,j)

for some

j

H(k,:) = M(i,:)

for some

i

. We refer to the corresponding problem as generalized separable NMF (GS-NMF). We discuss some properties of GS-NMF and propose a convex optimization model which we solve using a fast gradient method. We also propose a heuristic algorithm inspired by the successive projection algorithm. To verify the effectiveness of our methods, we compare them with several state-of-the-art separable NMF algorithms on synthetic, document and image data sets.Comment: 31 pages, 12 figures, 4 tables. We have added discussions about the identifiability of the model, we have modified the first synthetic experiment, we have clarified some aspects of the contributio

arXiv.org e-Print Archive

Crossref

Dictionary-based Tensor Canonical Polyadic Decomposition

Author: Cohen Jérémy E.
Gillis Nicolas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 08/11/2017
Field of study

To ensure interpretability of extracted sources in tensor decomposition, we introduce in this paper a dictionary-based tensor canonical polyadic decomposition which enforces one factor to belong exactly to a known dictionary. A new formulation of sparse coding is proposed which enables high dimensional tensors dictionary-based canonical polyadic decomposition. The benefits of using a dictionary in tensor decomposition models are explored both in terms of parameter identifiability and estimation accuracy. Performances of the proposed algorithms are evaluated on the decomposition of simulated data and the unmixing of hyperspectral images

arXiv.org e-Print Archive

Consistent Estimation of Mixed Memberships with Successive Projections

Author: G Palla
N Gillis
N Gillis
T Mizutani
U Luxburg Von
Z Lu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/10/2017
Field of study

This paper considers the parameter estimation problem in Mixed Membership Stochastic Block Model (MMSB), which is a quite general instance of random graph model allowing for overlapping community structure. We present the new algorithm successive projection overlapping clustering (SPOC) which combines the ideas of spectral clustering and geometric approach for separable non-negative matrix factorization. The proposed algorithm is provably consistent under MMSB with general conditions on the parameters of the model. SPOC is also shown to perform well experimentally in comparison to other algorithms

arXiv.org e-Print Archive

Crossref