Search CORE

15,275 research outputs found

Unifying Sparsest Cut, Cluster Deletion, and Modularity Clustering Objectives with Correlation Clustering

Author: Gleich David
Veldt Nate
Wirth Anthony
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

Graph clustering, or community detection, is the task of identifying groups of closely related objects in a large network. In this paper we introduce a new community-detection framework called LambdaCC that is based on a specially weighted version of correlation clustering. A key component in our methodology is a clustering resolution parameter,

\lambda

, which implicitly controls the size and structure of clusters formed by our framework. We show that, by increasing this parameter, our objective effectively interpolates between two different strategies in graph clustering: finding a sparse cut and forming dense subgraphs. Our methodology unifies and generalizes a number of other important clustering quality functions including modularity, sparsest cut, and cluster deletion, and places them all within the context of an optimization problem that has been well studied from the perspective of approximation algorithms. Our approach is particularly relevant in the regime of finding dense clusters, as it leads to a 2-approximation for the cluster deletion problem. We use our approach to cluster several graphs, including large collaboration networks and social networks

arXiv.org e-Print Archive

University of Melbourne Institutional Repository

Community Structure Characterization

Author: A Clauset
A Lancichinetti
A Lancichinetti
C Bothorel
F Radicchi
G Palla
GK Orman
Hongyun Cai
J Creusefond
J Shi
J Yang
L da Fontoura Costa
M Girvan
M Rosvall
M Rosvall
M Tumminello
MEJ Newman
MEJ Newman
MEJ Newman
MEJ Newman
MEJ Newman
N Dugué
N Kashtan
NR Mabroukeh
P Bródka
R Guimera
S Asur
S Fortunato
S Fortunato
T Aynaud
T-C Fu
V Labatut
Vincent Labatut
X Han
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

This entry discusses the problem of describing some communities identified in a complex network of interest, in a way allowing to interpret them. We suppose the community structure has already been detected through one of the many methods proposed in the literature. The question is then to know how to extract valuable information from this first result, in order to allow human interpretation. This requires subsequent processing, which we describe in the rest of this entry

arXiv.org e-Print Archive

Crossref

Semi-supervised Embedding in Attributed Networks with Outliers

Author: Jacobs Peter
Liang Jiongqian
Parthasarathy Srinivasan
Sun Jiankai
Publication venue
Publication date: 26/04/2018
Field of study

In this paper, we propose a novel framework, called Semi-supervised Embedding in Attributed Networks with Outliers (SEANO), to learn a low-dimensional vector representation that systematically captures the topological proximity, attribute affinity and label similarity of vertices in a partially labeled attributed network (PLAN). Our method is designed to work in both transductive and inductive settings while explicitly alleviating noise effects from outliers. Experimental results on various datasets drawn from the web, text and image domains demonstrate the advantages of SEANO over state-of-the-art methods in semi-supervised classification under transductive as well as inductive settings. We also show that a subset of parameters in SEANO is interpretable as outlier score and can significantly outperform baseline methods when applied for detecting network outliers. Finally, we present the use of SEANO in a challenging real-world setting -- flood mapping of satellite images and show that it is able to outperform modern remote sensing algorithms for this task.Comment: in Proceedings of SIAM International Conference on Data Mining (SDM'18

arXiv.org e-Print Archive

Crossref