Search CORE

5,987 research outputs found

Methods for protein complex prediction and their contributions towards understanding the organization, function and dynamics of complexes

Author: Patil Ashwini
Srihari Sriganesh
Wong Limsoon
Yong Chern Han
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Complexes of physically interacting proteins constitute fundamental functional units responsible for driving biological processes within cells. A faithful reconstruction of the entire set of complexes is therefore essential to understand the functional organization of cells. In this review, we discuss the key contributions of computational methods developed till date (approximately between 2003 and 2015) for identifying complexes from the network of interacting proteins (PPI network). We evaluate in depth the performance of these methods on PPI datasets from yeast, and highlight challenges faced by these methods, in particular detection of sparse and small or sub- complexes and discerning of overlapping complexes. We describe methods for integrating diverse information including expression profiles and 3D structures of proteins with PPI networks to understand the dynamics of complex formation, for instance, of time-based assembly of complex subunits and formation of fuzzy complexes from intrinsically disordered proteins. Finally, we discuss methods for identifying dysfunctional complexes in human diseases, an application that is proving invaluable to understand disease mechanisms and to discover novel therapeutic targets. We hope this review aptly commemorates a decade of research on computational prediction of complexes and constitutes a valuable reference for further advancements in this exciting area.Comment: 1 Tabl

arXiv.org e-Print Archive

Elsevier - Publisher Connector

University of Queensland eSpace

Recent advances in clustering methods for protein interaction networks

Author: Deng Youping
Li Min
Pan Yi
Wang Jianxin
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

The increasing availability of large-scale protein-protein interaction data has made it possible to understand the basic components and organization of cell machinery from the network level. The arising challenge is how to analyze such complex interacting data to reveal the principles of cellular organization, processes and functions. Many studies have shown that clustering protein interaction network is an effective approach for identifying protein complexes or functional modules, which has become a major research topic in systems biology. In this review, recent advances in clustering methods for protein interaction networks will be presented in detail. The predictions of protein functions and interactions based on modules will be covered. Finally, the performance of different clustering methods will be compared and the directions for future research will be discussed

Crossref

ScholarWorks @ Georgia State University

Springer - Publisher Connector

PubMed Central

Efficient and accurate greedy search methods for mining functional modules in protein interaction networks

Author: A Gavin
B Adamcsek
Baoliu Ye
BS Everitt
C Brun
Chaojun Li
DJ Watts
F Luo
F Radicchi
G Palla
GD Bader
H Jeong
H Leung
HW Mewes
I Xenarios
J Wang
J Wang
J Wang
Jieyue He
L Gao
LF Wu
M Altaf-Ul-Amin
M Girvan
M Li
M Li
M Wu
MEJ Newman
SH Jung
SS Dwight
V Spirin
Wei Zhong
X Li
YR Cho
Z Dezso
Publication venue: BioMed Central
Publication date: 01/06/2012
Field of study

Abstract Background Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures. Methods In this paper, a Greedy Search Method based on Core-Attachment structure (GSM-CA) is proposed. The GSM-CA method detects densely connected regions in large protein-protein interaction networks based on the edge weight and two criteria for determining core nodes and attachment nodes. The GSM-CA method improves the prediction accuracy compared to other similar module detection approaches, however it is computationally expensive. Many module detection approaches are based on the traditional hierarchical methods, which is also computationally inefficient because the hierarchical tree structure produced by these approaches cannot provide adequate information to identify whether a network belongs to a module structure or not. In order to speed up the computational process, the Greedy Search Method based on Fast Clustering (GSM-FC) is proposed in this work. The edge weight based GSM-FC method uses a greedy procedure to traverse all edges just once to separate the network into the suitable set of modules. Results The proposed methods are applied to the protein interaction network of S. cerevisiae. Experimental results indicate that many significant functional modules are detected, most of which match the known complexes. Results also demonstrate that the GSM-FC algorithm is faster and more accurate as compared to other competing algorithms. Conclusions Based on the new edge weight definition, the proposed algorithm takes advantages of the greedy search procedure to separate the network into the suitable set of modules. Experimental analysis shows that the identified modules are statistically significant. The algorithm can reduce the computational time significantly while keeping high prediction accuracy.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Searching for network modules

Author: A Lancichinetti
A Lancichinetti
AE Brower
B Adamcsek
B Bollobás
E Boros
GC Rota
Hao Wu
I Gilboa
I Gilboa
J Reichardt
J Vlasblom
J Wang
Jierui Xie
Jose B. Pereira-Leal
M Aigner
M Szalay-Bekő
MC Schmidt
MEJ Newman
MEJ Newman
MEJ Newman
MEJ Newman
MEJ Newman
R Diestel
R Sharan
R Stanley
Randolf Rotta
S Asur
S Fortunato
S Miyamoto
S Zhang
SE Schaeffer
T Nepusz
T Yu
Tom C. Freeman
U Brandes
X Lei
Y Li
YY Ahn
Publication venue
Publication date: 07/09/2018
Field of study

When analyzing complex networks a key target is to uncover their modular structure, which means searching for a family of modules, namely node subsets spanning each a subnetwork more densely connected than the average. This work proposes a novel type of objective function for graph clustering, in the form of a multilinear polynomial whose coefficients are determined by network topology. It may be thought of as a potential function, to be maximized, taking its values on fuzzy clusterings or families of fuzzy subsets of nodes over which every node distributes a unit membership. When suitably parametrized, this potential is shown to attain its maximum when every node concentrates its all unit membership on some module. The output thus is a partition, while the original discrete optimization problem is turned into a continuous version allowing to conceive alternative search strategies. The instance of the problem being a pseudo-Boolean function assigning real-valued cluster scores to node subsets, modularity maximization is employed to exemplify a so-called quadratic form, in that the scores of singletons and pairs also fully determine the scores of larger clusters, while the resulting multilinear polynomial potential function has degree 2. After considering further quadratic instances, different from modularity and obtained by interpreting network topology in alternative manners, a greedy local-search strategy for the continuous framework is analytically compared with an existing greedy agglomerative procedure for the discrete case. Overlapping is finally discussed in terms of multiple runs, i.e. several local searches with different initializations.Comment: 10 page

arXiv.org e-Print Archive

Crossref

Community landscapes: an integrative approach to determine overlapping network module hierarchy, identify key nodes and predict network dynamics

Author: A Arenas
A Capocci
A Hinneburg
A Lancichinetti
A Lancichinetti
AK Ramani
C Baerveldt
D Ekman
D Krioukov
DJ Watts
DL Nelson
ER Gansner
F Radicchi
G Palla
G Tibély
H Yu
I Kovacs
I Vragovic
István A. Kovács
J Moody
JB Axelsen
JD Han
JM Kumpula
JM Thevelein
JP Bagrow
JP Eckmann
JW Berry
K Komurov
M Blatt
M Fiedler
M Girvan
M Grendar
M Rosvall
ME Newman
ME Newman
ML Clark
Máté S. Szalay
N Bertin
Olaf Sporns
P Csermely
P Pons
Peter Csermely
PM Kim
Robin Palotai
S Fortunato
S Fortunato
S Fortunato
T Nepusz
TS Evans
V Latora
VD Blondel
WW Zachary
Y-Y Ahn
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2010
Field of study

Background: Network communities help the functional organization and evolution of complex networks. However, the development of a method, which is both fast and accurate, provides modular overlaps and partitions of a heterogeneous network, has proven to be rather difficult. Methodology/Principal Findings: Here we introduce the novel concept of ModuLand, an integrative method family determining overlapping network modules as hills of an influence function-based, centrality-type community landscape, and including several widely used modularization methods as special cases. As various adaptations of the method family, we developed several algorithms, which provide an efficient analysis of weighted and directed networks, and (1) determine pervasively overlapping modules with high resolution; (2) uncover a detailed hierarchical network structure allowing an efficient, zoom-in analysis of large networks; (3) allow the determination of key network nodes and (4) help to predict network dynamics. Conclusions/Significance: The concept opens a wide range of possibilities to develop new approaches and applications including network routing, classification, comparison and prediction.Comment: 25 pages with 6 figures and a Glossary + Supporting Information containing pseudo-codes of all algorithms used, 14 Figures, 5 Tables (with 18 module definitions, 129 different modularization methods, 13 module comparision methods) and 396 references. All algorithms can be downloaded from this web-site: http://www.linkgroup.hu/modules.ph

arXiv.org e-Print Archive

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

ELTE Digital Institutional Repository (EDIT)

ModuLand plug-in for Cytoscape: determination of hierarchical layers of overlapping network modules and community centrality

Author: Adamcsek
Ahn
Bader
Balázs Papp
Balázs Szappanos
Fortunato
Ghosh
Ghosh
Guimera
István A. Kovács
Kovács
Kreimer
Mihalik
Morris
Máté Szalay-Bekő
Palla
Parter
Pál
Péter Csermely
Rhrissorrakrai
Rivera
Robin Palotai
Samal
Sethi
Shannon
Su
Tamames
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2012
Field of study

Summary: The ModuLand plug-in provides Cytoscape users an algorithm for determining extensively overlapping network modules. Moreover, it identifies several hierarchical layers of modules, where meta-nodes of the higher hierarchical layer represent modules of the lower layer. The tool assigns module cores, which predict the function of the whole module, and determines key nodes bridging two or multiple modules. The plug-in has a detailed JAVA-based graphical interface with various colouring options. The ModuLand tool can run on Windows, Linux, or Mac OS. We demonstrate its use on protein structure and metabolic networks. Availability: The plug-in and its user guide can be downloaded freely from: http://www.linkgroup.hu/modules.php. Contact: [email protected] Supplementary information: Supplementary information is available at Bioinformatics online.Comment: 39 pages, 1 figure and a Supplement with 9 figures and 10 table

arXiv.org e-Print Archive

CiteSeerX

Crossref

ELTE Digital Institutional Repository (EDIT)

Towards the Identification of Protein Complexes and Functional Modules by Integrating PPI Network and Gene Expression Data

Author: Li Min
Pan Yi
Wang Jianxin
Wu Xuehong
Publication venue: ScholarWorks @ Georgia State University
Publication date: 01/01/2012
Field of study

Background: Identification of protein complexes and functional modules from protein-protein interaction (PPI) networks is crucial to understanding the principles of cellular organization and predicting protein functions. In the past few years, many computational methods have been proposed. However, most of them considered the PPI networks as static graphs and overlooked the dynamics inherent within these networks. Moreover, few of them can distinguish between protein complexes and functional modules. Results: In this paper, a new framework is proposed to distinguish between protein complexes and functional modules by integrating gene expression data into protein-protein interaction (PPI) data. A series of time-sequenced subnetworks (TSNs) is constructed according to the time that the interactions were activated. The algorithm TSN-PCD was then developed to identify protein complexes from these TSNs. As protein complexes are significantly related to functional modules, a new algorithm DFM-CIN is proposed to discover functional modules based on the identified complexes. The experimental results show that the combination of temporal gene expression data with PPI data contributes to identifying protein complexes more precisely. A quantitative comparison based on f-measure reveals that our algorithm TSN-PCD outperforms the other previous protein complex discovery algorithms. Furthermore, we evaluate the identified functional modules by using “Biological Process” annotated in GO (Gene Ontology). The validation shows that the identified functional modules are statistically significant in terms of “Biological Process”. More importantly, the relationship between protein complexes and functional modules are studied. Conclusions: The proposed framework based on the integration of PPI data and gene expression data makes it possible to identify protein complexes and functional modules more effectively. Moveover, the proposed new framework and algorithms can distinguish between protein complexes and functional modules. Our findings suggest that functional modules are closely related to protein complexes and a functional module may consist of one or multiple protein complexes. The program is available at http://netlab.csu.edu.cn/bioinfomatics/limin/DFM-CIN/index

ScholarWorks @ Georgia State University

Springer - Publisher Connector

PubMed Central