56,885 research outputs found
On the Permanence of Vertices in Network Communities
Despite the prevalence of community detection algorithms, relatively less
work has been done on understanding whether a network is indeed modular and how
resilient the community structure is under perturbations. To address this
issue, we propose a new vertex-based metric called "permanence", that can
quantitatively give an estimate of the community-like structure of the network.
The central idea of permanence is based on the observation that the strength
of membership of a vertex to a community depends upon the following two
factors: (i) the distribution of external connectivity of the vertex to
individual communities and not the total external connectivity, and (ii) the
strength of its internal connectivity and not just the total internal edges.
In this paper, we demonstrate that compared to other metrics, permanence
provides (i) a more accurate estimate of a derived community structure to the
ground-truth community and (ii) is more sensitive to perturbations in the
network. As a by-product of this study, we have also developed a community
detection algorithm based on maximizing permanence. For a modular network
structure, the results of our algorithm match well with ground-truth
communities.Comment: 10 pages, 5 figures, 8 tables, Accepted in 20th ACM SIGKDD Conference
on Knowledge Discovery and Data Minin
Outlier Edge Detection Using Random Graph Generation Models and Applications
Outliers are samples that are generated by different mechanisms from other
normal data samples. Graphs, in particular social network graphs, may contain
nodes and edges that are made by scammers, malicious programs or mistakenly by
normal users. Detecting outlier nodes and edges is important for data mining
and graph analytics. However, previous research in the field has merely focused
on detecting outlier nodes. In this article, we study the properties of edges
and propose outlier edge detection algorithms using two random graph generation
models. We found that the edge-ego-network, which can be defined as the induced
graph that contains two end nodes of an edge, their neighboring nodes and the
edges that link these nodes, contains critical information to detect outlier
edges. We evaluated the proposed algorithms by injecting outlier edges into
some real-world graph data. Experiment results show that the proposed
algorithms can effectively detect outlier edges. In particular, the algorithm
based on the Preferential Attachment Random Graph Generation model consistently
gives good performance regardless of the test graph data. Further more, the
proposed algorithms are not limited in the area of outlier edge detection. We
demonstrate three different applications that benefit from the proposed
algorithms: 1) a preprocessing tool that improves the performance of graph
clustering algorithms; 2) an outlier node detection algorithm; and 3) a novel
noisy data clustering algorithm. These applications show the great potential of
the proposed outlier edge detection techniques.Comment: 14 pages, 5 figures, journal pape
- …