Search CORE

Detecting highly overlapping community structure by greedy clique expansion

Author: Hurley Neil
Lee Conrad
McDaid Aaron
Reid Fergal
Publication venue
Publication date: 01/01/2010
Field of study

In complex networks it is common for each node to belong to several communities, implying a highly overlapping community structure. Recent advances in benchmarking indicate that existing community assignment algorithms that are capable of detecting overlapping communities perform well only when the extent of community overlap is kept to modest levels. To overcome this limitation, we introduce a new community assignment algorithm called Greedy Clique Expansion (GCE). The algorithm identifies distinct cliques as seeds and expands these seeds by greedily optimizing a local fitness function. We perform extensive benchmarks on synthetic data to demonstrate that GCE's good performance is robust across diverse graph topologies. Significantly, GCE is the only algorithm to perform well on these synthetic graphs, in which every node belongs to multiple communities. Furthermore, when put to the task of identifying functional modules in protein interaction data, and college dorm assignments in Facebook friendship data, we find that GCE performs competitively.Comment: 10 pages, 7 Figures. Implementation source and binaries available at http://sites.google.com/site/greedycliqueexpansion

Research Repository UCD

Irish Universities

Communities in Networks

Author: J. Mucha
Jukka-pekka Onnela
Jukka-pekka Onnela
Mason A. Porter
Mason A. Porter
Peter
Peter J. Mucha
Publication venue
Publication date: 01/01/2009
Field of study

We survey some of the concepts, methods, and applications of community detection, which has become an increasingly important area of network science. To help ease newcomers into the field, we provide a guide to available methodology and open problems, and discuss why scientists from diverse backgrounds are interested in these problems. As a running theme, we emphasize the connections of community detection to problems in statistical physics and computational optimization.Comment: survey/review article on community structure in networks; published version is available at http://people.maths.ox.ac.uk/~porterm/papers/comnotices.pd

Oxford University Research Archive

Recent advances in clustering methods for protein interaction networks

Author: Deng Youping
Li Min
Pan Yi
Wang Jianxin
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

The increasing availability of large-scale protein-protein interaction data has made it possible to understand the basic components and organization of cell machinery from the network level. The arising challenge is how to analyze such complex interacting data to reveal the principles of cellular organization, processes and functions. Many studies have shown that clustering protein interaction network is an effective approach for identifying protein complexes or functional modules, which has become a major research topic in systems biology. In this review, recent advances in clustering methods for protein interaction networks will be presented in detail. The predictions of protein functions and interactions based on modules will be covered. Finally, the performance of different clustering methods will be compared and the directions for future research will be discussed

Crossref

ScholarWorks @ Georgia State University

Springer - Publisher Connector

Community landscapes: an integrative approach to determine overlapping network module hierarchy, identify key nodes and predict network dynamics

Author: A Arenas
A Capocci
A Hinneburg
A Lancichinetti
A Lancichinetti
AK Ramani
C Baerveldt
D Ekman
D Krioukov
DJ Watts
DL Nelson
ER Gansner
F Radicchi
G Palla
G Tibély
H Yu
I Kovacs
I Vragovic
István A. Kovács
J Moody
JB Axelsen
JD Han
JM Kumpula
JM Thevelein
JP Bagrow
JP Eckmann
JW Berry
K Komurov
M Blatt
M Fiedler
M Girvan
M Grendar
M Rosvall
ME Newman
ME Newman
ML Clark
Máté S. Szalay
N Bertin
Olaf Sporns
P Csermely
P Pons
Peter Csermely
PM Kim
Robin Palotai
S Fortunato
S Fortunato
S Fortunato
T Nepusz
TS Evans
V Latora
VD Blondel
WW Zachary
Y-Y Ahn
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2010
Field of study

Background: Network communities help the functional organization and evolution of complex networks. However, the development of a method, which is both fast and accurate, provides modular overlaps and partitions of a heterogeneous network, has proven to be rather difficult. Methodology/Principal Findings: Here we introduce the novel concept of ModuLand, an integrative method family determining overlapping network modules as hills of an influence function-based, centrality-type community landscape, and including several widely used modularization methods as special cases. As various adaptations of the method family, we developed several algorithms, which provide an efficient analysis of weighted and directed networks, and (1) determine pervasively overlapping modules with high resolution; (2) uncover a detailed hierarchical network structure allowing an efficient, zoom-in analysis of large networks; (3) allow the determination of key network nodes and (4) help to predict network dynamics. Conclusions/Significance: The concept opens a wide range of possibilities to develop new approaches and applications including network routing, classification, comparison and prediction.Comment: 25 pages with 6 figures and a Glossary + Supporting Information containing pseudo-codes of all algorithms used, 14 Figures, 5 Tables (with 18 module definitions, 129 different modularization methods, 13 module comparision methods) and 396 references. All algorithms can be downloaded from this web-site: http://www.linkgroup.hu/modules.ph

Directory of Open Access Journals

Crossref

ELTE Digital Institutional Repository (EDIT)

Multiple Alignment of Protein Interaction Networks by Three-Index Assignment Algorithm

Author: Arora Arushi
Publication venue: 'University of Windsor Leddy Library'
Publication date: 01/01/2013
Field of study

Bio-molecular networks have led to many discoveries in molecular biology. The most atypical of them are protein-protein interaction (PPI) networks. In PPI networks the nodes refer to proteins and edges refer to interactions between nodes. The comparison of PPI networks can be demonstrated as a powerful approach for examining interactions in these networks and predicting protein functions. This thesis contributes a new alignment algorithm for aligning three PPI networks. We examine how Three-Index Assignment Problem via Hungarian Pair Matching algorithm is used to maximize the complete match between the three networks to identify protein triplets with higher similarity. We have performed tests on PPI networks extracted from the IntAct database and IsoRank database. We experimentally show that the results obtained by our method have more biological significance in comparison to other methods and can be used in future to predict protein functions and complexes in PPI networks

Scholarship at UWindsor

Clustering and Community Detection in Directed Networks: A Survey

Author: Malliaros Fragkiskos D.
Vazirgiannis Michalis
Publication venue: 'Elsevier BV'
Publication date: 05/08/2013
Field of study

Networks (or graphs) appear as dominant structures in diverse domains, including sociology, biology, neuroscience and computer science. In most of the aforementioned cases graphs are directed - in the sense that there is directionality on the edges, making the semantics of the edges non symmetric. An interesting feature that real networks present is the clustering or community structure property, under which the graph topology is organized into modules commonly called communities or clusters. The essence here is that nodes of the same community are highly similar while on the contrary, nodes across communities present low similarity. Revealing the underlying community structure of directed complex networks has become a crucial and interdisciplinary topic with a plethora of applications. Therefore, naturally there is a recent wealth of research production in the area of mining directed graphs - with clustering being the primary method and tool for community detection and evaluation. The goal of this paper is to offer an in-depth review of the methods presented so far for clustering directed networks along with the relevant necessary methodological background and also related applications. The survey commences by offering a concise review of the fundamental concepts and methodological base on which graph clustering algorithms capitalize on. Then we present the relevant work along two orthogonal classifications. The first one is mostly concerned with the methodological principles of the clustering algorithms, while the second one approaches the methods from the viewpoint regarding the properties of a good cluster in a directed network. Further, we present methods and metrics for evaluating graph clustering results, demonstrate interesting application domains and provide promising future research directions.Comment: 86 pages, 17 figures. Physics Reports Journal (To Appear

Module detection in complex networks using integer optimisation

Author: Bennett Laura
Papageorgiou Lazaros G
Tsoka Sophia
Xu Gang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background The detection of <it>modules or community structure </it>is widely used to reveal the underlying properties of complex networks in biology, as well as physical and social sciences. Since the adoption of modularity as a measure of network topological properties, several methodologies for the discovery of community structure based on modularity maximisation have been developed. However, satisfactory partitions of large graphs with modest computational resources are particularly challenging due to the NP-hard nature of the related optimisation problem. Furthermore, it has been suggested that optimising the modularity metric can reach a resolution limit whereby the algorithm fails to detect smaller communities than a specific size in large networks. Results We present a novel solution approach to identify community structure in large complex networks and address resolution limitations in module detection. The proposed algorithm employs modularity to express network community structure and it is based on mixed integer optimisation models. The solution procedure is extended through an iterative procedure to diminish effects that tend to agglomerate smaller modules (resolution limitations). Conclusions A comprehensive comparative analysis of methodologies for module detection based on modularity maximisation shows that our approach outperforms previously reported methods. Furthermore, in contrast to previous reports, we propose a strategy to handle resolution limitations in modularity maximisation. Overall, we illustrate ways to improve existing methodologies for community structure identification so as to increase its efficiency and applicability.</p

Springer - Publisher Connector

Crossref

Directory of Open Access Journals

King's Research Portal

A least square method based model for identifying protein complexes in protein-protein interaction network

Author: Maozu Guo
Qiguo Dai
Xiaoyan Liu
Yang Liu
Yingjie Guo
Zhixia Teng
Publication venue
Publication date: 10/04/2020
Field of study

Protein complex formed by a group of physical interacting proteins plays a crucial role in cell activities. Great effort has been made to computationally identify protein complexes from protein-protein interaction (PPI) network. However, the accuracy of the prediction is still far from being satisfactory, because the topological structures of protein complexes in the PPI network are too complicated. This paper proposes a novel optimization framework to detect complexes from PPI network, named PLSMC. The method is on the basis of the fact that if two proteins are in a common complex, they are likely to be interacting. PLSMC employs this relation to determine complexes by a penalized least squares method. PLSMC is applied to several public yeast PPI networks, and compared with several state-of-the-art methods. The results indicate that PLSMC outperforms other methods. In particular, complexes predicted by PLSMC can match known complexes with a higher accuracy than other methods. Furthermore, the predicted complexes have high functional homogeneity