Search CORE

2 research outputs found

Metrics matter in community detection

Author: A Clauset
A Decelle
A Lancichinetti
AJ Gates
D Horta
DH Wolpert
DW Matula
F Radicchi
F Shahrokhi
IS Dhillon
J Reichardt
J Zhang
JFC Kingman
JG Young
L Danon
L Hubert
L Peel
M Rosvall
M Rosvall
Marina Meilă
MEJ Newman
P Zhang
Pascal Pons
S Romano
UN Raghavan
VD Blondel
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/01/2019
Field of study

We present a critical evaluation of normalized mutual information (NMI) as an evaluation metric for community detection. NMI exaggerates the leximin method's performance on weak communities: Does leximin, in finding the trivial singletons clustering, truly outperform eight other community detection methods? Three NMI improvements from the literature are AMI, rrNMI, and cNMI. We show equivalences under relevant random models, and for evaluating community detection, we advise one-sided AMI under the

\mathbb{M}_{\mathrm{all}}

model (all partitions of

n

nodes). This work seeks (1) to start a conversation on robust measurements, and (2) to advocate evaluations which do not give "free lunch"

arXiv.org e-Print Archive

Crossref

An Exact No Free Lunch Theorem for Community Detection

Author: AJ Gates
B Hauer
D Lai
DH Wolpert
F Radicchi
J Zhang
L Hubert
L Peel
L Peel
MEJ Newman
P Zhang
S Romano
TO Kvalseth
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/03/2019
Field of study

A precondition for a No Free Lunch theorem is evaluation with a loss function which does not assume a priori superiority of some outputs over others. A previous result for community detection by Peel et al. (2017) relies on a mismatch between the loss function and the problem domain. The loss function computes an expectation over only a subset of the universe of possible outputs; thus, it is only asymptotically appropriate with respect to the problem size. By using the correct random model for the problem domain, we provide a stronger, exact No Free Lunch theorem for community detection. The claim generalizes to other set-partitioning tasks including core/periphery separation,

k

-clustering, and graph partitioning. Finally, we review the literature of proposed evaluation functions and identify functions which (perhaps with slight modifications) are compatible with an exact No Free Lunch theorem

arXiv.org e-Print Archive

Crossref