Search CORE

27 research outputs found

Semantic integration to identify overlapping functional modules in protein interaction networks

Author: A Barrat
A Tanay
A-C Gavin
A-L Barabási
AD King
Aidong Zhang
AW Rives
C von Mering
CA Ball
CM Deane
D Bu
E Ravasz
G Palla
H Jeong
HW Mewes
L Salwinski
LH Hartwell
M Girvan
MP Samanta
Murali Ramanathan
P Pei
P Resnik
P Uetz
R Dunn
S Tornow
T Ideker
T Ito
The Gene Ontology Consortium
TR Hvidsten
V Arnau
V Spirin
Woochang Hwang
Y Ho
Y-R Cho
Young-Rae Cho
Z Fang
Z Lubovac
Publication venue: BioMed Central
Publication date: 01/07/2007
Field of study

Abstract Background The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms. Results We have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO) annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches. Conclusion The integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification.</p

Crossref

Directory of Open Access Journals

PubMed Central

Ontology integration to identify protein complex in protein interaction networks

Author: Lin Hongfei
Xu Bo
Yang Zhihao
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Springer - Publisher Connector

PubMed Central

Noise reduction in protein-protein interaction graphs by the implementation of a novel weighting scheme

Author: Kossida Sophia
Kritikos George D
Moschopoulos Charalampos
Vazirgiannis Michalis
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Recent technological advances applied to biology such as yeast-two-hybrid, phage display and mass spectrometry have enabled us to create a detailed map of protein interaction networks. These interaction networks represent a rich, yet noisy, source of data that could be used to extract meaningful information, such as protein complexes. Several interaction network weighting schemes have been proposed so far in the literature in order to eliminate the noise inherent in interactome data. In this paper, we propose a novel weighting scheme and apply it to the <it>S. cerevisiae </it>interactome. Complex prediction rates are improved by up to 39%, depending on the clustering algorithm applied. Results We adopt a two step procedure. During the first step, by applying both novel and well established protein-protein interaction (PPI) weighting methods, weights are introduced to the original interactome graph based on the confidence level that a given interaction is a true-positive one. The second step applies clustering using established algorithms in the field of graph theory, as well as two variations of Spectral clustering. The clustered interactome networks are also cross-validated against the confirmed protein complexes present in the MIPS database. Conclusions The results of our experimental work demonstrate that interactome graph weighting methods clearly improve the clustering results of several clustering algorithms. Moreover, our proposed weighting scheme outperforms other approaches of PPI graph weighting.</p

Crossref

Directory of Open Access Journals

PubMed Central

Which clustering algorithm is better for predicting protein complexes?

Abstract Background Protein-Protein interactions (PPI) play a key role in determining the outcome of most cellular processes. The correct identification and characterization of protein interactions and the networks, which they comprise, is critical for understanding the molecular mechanisms within the cell. Large-scale techniques such as pull down assays and tandem affinity purification are used in order to detect protein interactions in an organism. Today, relatively new high-throughput methods like yeast two hybrid, mass spectrometry, microarrays, and phage display are also used to reveal protein interaction networks. Results In this paper we evaluated four different clustering algorithms using six different interaction datasets. We parameterized the MCL, Spectral, RNSC and Affinity Propagation algorithms and applied them to six PPI datasets produced experimentally by Yeast 2 Hybrid (Y2H) and Tandem Affinity Purification (TAP) methods. The predicted clusters, so called protein complexes, were then compared and benchmarked with already known complexes stored in published databases. Conclusions While results may differ upon parameterization, the MCL and RNSC algorithms seem to be more promising and more accurate at predicting PPI complexes. Moreover, they predict more complexes than other reviewed algorithms in absolute numbers. On the other hand the spectral clustering algorithm achieves the highest valid prediction rate in our experiments. However, it is nearly always outperformed by both RNSC and MCL in terms of the geometrical accuracy while it generates the fewest valid clusters than any other reviewed algorithm. This article demonstrates various metrics to evaluate the accuracy of such predictions as they are presented in the text below. Supplementary material can be found at: <url>http://www.bioacademy.gr/bioinformatics/projects/ppireview.htm</url></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

EUR Research Repository

Open Repository and Bibliography - Luxembourg

University of Thessaly Institutional Repository

Identification of functional hubs and modules by converting interactome networks into hierarchical ordering of proteins

Author: A Chatr-aryamontri
A-L Barabasi
Aidong Zhang
AW Rives
B-J Breitkreutz
C Brun
DJ Watts
E Banks
F Luo
G Palla
GD Bader
H Jeong
HB Fraser
HW Mewes
J Demeter
J-DJ Han
JR Parrish
L Salwinski
M Altaf-Ul-Amin
NN Batada
R Aebersold
R Dunn
R Saeed
R Sharan
S Kerrien
The Gene Ontology Consortium
V Spirin
W Li
X He
Y Chen
Y-R Cho
Y-R Cho
Young-Rae Cho
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Determining modular organization of protein interaction networks by maximizing modularity density

Author: A Barabasi
A Ng
AD King
AW Rives
C Brun
C Caretta-Cartozo
CC Friedel
Chris Ding
D Bu
E Ravasz
E Segal
EA Stone
F Bach
G Palla
GD Bader
H Lu
HW Mewes
J Zhao
ME Newman
MEJ Newman
R Guimer
S Broheé
S Fortunato
S van Dongen
S White
S Zhang
S Zhang
S Zhang
Shihua Zhang
V Spirin
Xiang-Sun Zhang
Xue-Mei Ning
Y Cho
Z Li
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps

Author: Chiam Tak Chien
Cho Young-Rae
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Recent computational techniques have facilitated analyzing genome-wide protein-protein interaction data for several model organisms. Various graph-clustering algorithms have been applied to protein interaction networks on the genomic scale for predicting the entire set of potential protein complexes. In particular, the density-based clustering algorithms which are able to generate overlapping clusters, i.e. the clusters sharing a set of nodes, are well-suited to protein complex detection because each protein could be a member of multiple complexes. However, their accuracy is still limited because of complex overlap patterns of their output clusters. Results We present a systematic approach of refining the overlapping clusters identified from protein interaction networks. We have designed novel metrics to assess cluster overlaps: overlap coverage and overlapping consistency. We then propose an overlap refinement algorithm. It takes as input the clusters produced by existing density-based graph-clustering methods and generates a set of refined clusters by parameterizing the metrics. To evaluate protein complex prediction accuracy, we used the <it>f</it>-measure by comparing each refined cluster to known protein complexes. The experimental results with the yeast protein-protein interaction data sets from BioGRID and DIP demonstrate that accuracy on protein complex prediction has increased significantly after refining cluster overlaps. Conclusions The effectiveness of the proposed cluster overlap refinement approach for protein complex detection has been validated in this study. Analyzing overlaps of the clusters from protein interaction networks is a crucial task for understanding of functional roles of proteins and topological characteristics of the functional systems.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Efficient and accurate greedy search methods for mining functional modules in protein interaction networks

Author: A Gavin
B Adamcsek
Baoliu Ye
BS Everitt
C Brun
Chaojun Li
DJ Watts
F Luo
F Radicchi
G Palla
GD Bader
H Jeong
H Leung
HW Mewes
I Xenarios
J Wang
J Wang
J Wang
Jieyue He
L Gao
LF Wu
M Altaf-Ul-Amin
M Girvan
M Li
M Li
M Wu
MEJ Newman
SH Jung
SS Dwight
V Spirin
Wei Zhong
X Li
YR Cho
Z Dezso
Publication venue: BioMed Central
Publication date: 01/06/2012
Field of study

Abstract Background Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures. Methods In this paper, a Greedy Search Method based on Core-Attachment structure (GSM-CA) is proposed. The GSM-CA method detects densely connected regions in large protein-protein interaction networks based on the edge weight and two criteria for determining core nodes and attachment nodes. The GSM-CA method improves the prediction accuracy compared to other similar module detection approaches, however it is computationally expensive. Many module detection approaches are based on the traditional hierarchical methods, which is also computationally inefficient because the hierarchical tree structure produced by these approaches cannot provide adequate information to identify whether a network belongs to a module structure or not. In order to speed up the computational process, the Greedy Search Method based on Fast Clustering (GSM-FC) is proposed in this work. The edge weight based GSM-FC method uses a greedy procedure to traverse all edges just once to separate the network into the suitable set of modules. Results The proposed methods are applied to the protein interaction network of S. cerevisiae. Experimental results indicate that many significant functional modules are detected, most of which match the known complexes. Results also demonstrate that the GSM-FC algorithm is faster and more accurate as compared to other competing algorithms. Conclusions Based on the new edge weight definition, the proposed algorithm takes advantages of the greedy search procedure to separate the network into the suitable set of modules. Experimental analysis shows that the identified modules are statistically significant. The algorithm can reduce the computational time significantly while keeping high prediction accuracy.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Identifying protein complexes from interaction networks based on clique percolation and distance restriction

Author: Li Min
Liu Binbin
Pan Yi
Wang Jianxin
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: Identification of protein complexes in large interaction networks is crucial to understand principles of cellular organization and predict protein functions, which is one of the most important issues in the post-genomic era. Each protein might be subordinate multiple protein complexes in the real protein-protein interaction networks.Identifying overlapping protein complexes from protein-protein interaction networks is a considerable research topic. Result: As an effective algorithm in identifying overlapping module structures, clique percolation method (CPM) has a wide range of application in social networks and biological networks. However, the recognition accuracy of algorithm CPM is lowly. Furthermore, algorithm CPM is unfit to identifying protein complexes with meso-scale when it applied in protein-protein interaction networks. In this paper, we propose a new topological model by extending the definition of k-clique community of algorithm CPM and introduced distance restriction, and develop a novel algorithm called CP-DR based on the new topological model for identifying protein complexes. In this new algorithm, the protein complex size is restricted by distance constraint to conquer the shortcomings of algorithm CPM. The algorithm CP-DR is applied to the protein interaction network of Sacchromyces cerevisiae and identifies many well known complexes. Conclusion: The proposed algorithm CP-DR based on clique percolation and distance restriction makes it possible to identify dense subgraphs in protein interaction networks, a large number of which correspond to known protein complexes. Compared to algorithm CPM, algorithm CP-DR has more outstanding performance

ScholarWorks @ Georgia State University

Springer - Publisher Connector

PubMed Central