Search CORE

17 research outputs found

Gene network interconnectedness and the generalized topological overlap measure

Author: A Barabási
A Ghazalpour
A Li
Andy M Yip
B Zhang
C Stark
DS Goldberg
E Ravasz
E Segal
FJ Isaacs
H Jeong
H Jeong
L Hartwell
L Kaufman
M Eisen
M Newman
M Newman
MC Oldham
MR Carlson
MW Hahn
O Thimm
P Spellman
P Tamayo
R Albert
S Horvath
S Prinz
S Tornow
S Wasserman
Steve Horvath
T Cox
T Toyoda
X Xu
X Zhou
Y Ye
Z BarJoseph
Z Lubovac
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Network methods are increasingly used to represent the interactions of genes and/or proteins. Genes or proteins that are directly linked may have a similar biological function or may be part of the same biological pathway. Since the information on the connection (adjacency) between 2 nodes may be noisy or incomplete, it can be desirable to consider alternative measures of pairwise interconnectedness. Here we study a class of measures that are proportional to the number of neighbors that a pair of nodes share in common. For example, the topological overlap measure by Ravasz et al. [1] can be interpreted as a measure of agreement between the m = 1 step neighborhoods of 2 nodes. Several studies have shown that two proteins having a higher topological overlap are more likely to belong to the same functional class than proteins having a lower topological overlap. Here we address the question whether a measure of topological overlap based on higher-order neighborhoods could give rise to a more robust and sensitive measure of interconnectedness. RESULTS: We generalize the topological overlap measure from m = 1 step neighborhoods to m ≥ 2 step neighborhoods. This allows us to define the m-th order generalized topological overlap measure (GTOM) by (i) counting the number of m-step neighbors that a pair of nodes share and (ii) normalizing it to take a value between 0 and 1. Using theoretical arguments, a yeast co-expression network application, and a fly protein network application, we illustrate the usefulness of the proposed measure for module detection and gene neighborhood analysis. CONCLUSION: Topological overlap can serve as an important filter to counter the effects of spurious or missing connections between network nodes. The m-th order topological overlap measure allows one to trade-off sensitivity versus specificity when it comes to defining pairwise interconnectedness and network modules

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ScholarBank@NUS

Gene network modular-based classification of microarray samples

Author: A Presson
A Spira
B Zhang
D Dettling
H Li
H Pang
HN Chua
Hui Jiang
IT Jolliffe
IW Taylor
L Elo
MD Radmacher
MY Park
Pingzhao Hu
R Shen
R Tibshirani
RA Irizarry
RO Stuart
S Dudoit
S Horvath
Shelley B Bull
T Hastie
TR Golub
U Alon
V Tusher
X Yu
Y Guo
Z Lubovac
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Semantic integration to identify overlapping functional modules in protein interaction networks

Author: A Barrat
A Tanay
A-C Gavin
A-L Barabási
AD King
Aidong Zhang
AW Rives
C von Mering
CA Ball
CM Deane
D Bu
E Ravasz
G Palla
H Jeong
HW Mewes
L Salwinski
LH Hartwell
M Girvan
MP Samanta
Murali Ramanathan
P Pei
P Resnik
P Uetz
R Dunn
S Tornow
T Ideker
T Ito
The Gene Ontology Consortium
TR Hvidsten
V Arnau
V Spirin
Woochang Hwang
Y Ho
Y-R Cho
Young-Rae Cho
Z Fang
Z Lubovac
Publication venue: BioMed Central
Publication date: 01/07/2007
Field of study

Abstract Background The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms. Results We have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO) annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches. Conclusion The integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification.</p

Crossref

Directory of Open Access Journals

PubMed Central

Comparative analysis of clustering methods for gene expression time course data

Author: Brown MP
Cho R
Costa IG
Costa IG
Costa IG
Datta S
Diday E
Dubes R
Efron B
Eisen MB
Francisco de A. T. de Carvalho
Gordon AD
Heyer LJ
Ivan G. Costa
Jain AK
Jain AK
Jonsson P
Kohonen T
Lubovac Z
Mangiameli P
Marcílio C. P. de Souto
Milligan GW
Mitchell T
Quackenbush J
Sharan R
Slonim D
Tamayo P
Tavazoie S
Vesanto J
Yeung KY
Zhu J
Publication venue: 'FapUNIFESP (SciELO)'
Publication date: 01/01/2004
Field of study

Crossref

Biological Process Linkage Networks

Author: A Battle
A Schlicker
A Vazquez
AC Gavin
AH Tong
AJ Butte
Avraham A. Melkman
B Schwikowski
C Stark
D Finley
D Lin
D Segre
DA Stavreva
Dikla Dotan-Cohen
E Formstecher
E Segal
E Unal
EM Marcotte
F Luo
H Hishigaki
H Jeong
I Xenarios
JL Lu
JM Stuart
KR Brown
L Giot
LA Amaral
LF Wu
M Larochelle
MA Harris
MA Huynen
P Bork
PT Spellman
PW Lord
R Kelley
R Sharan
Rodolfo Aramayo
Simon Kasif
SL Wong
Stan Letovsky
TR Hughes
U de Lichtenberg
U Karaoz
X Guo
Z Lubovac
Publication venue: Public Library of Science
Publication date: 23/04/2009
Field of study

BACKGROUND. The traditional approach to studying complex biological networks is based on the identification of interactions between internal components of signaling or metabolic pathways. By comparison, little is known about interactions between higher order biological systems, such as biological pathways and processes. We propose a methodology for gleaning patterns of interactions between biological processes by analyzing protein-protein interactions, transcriptional co-expression and genetic interactions. At the heart of the methodology are the concept of Linked Processes and the resultant network of biological processes, the Process Linkage Network (PLN). RESULTS. We construct, catalogue, and analyze different types of PLNs derived from different data sources and different species. When applied to the Gene Ontology, many of the resulting links connect processes that are distant from each other in the hierarchy, even though the connection makes eminent sense biologically. Some others, however, carry an element of surprise and may reflect mechanisms that are unique to the organism under investigation. In this aspect our method complements the link structure between processes inherent in the Gene Ontology, which by its very nature is species-independent. As a practical application of the linkage of processes we demonstrate that it can be effectively used in protein function prediction, having the power to increase both the coverage and the accuracy of predictions, when carefully integrated into prediction methods. CONCLUSIONS. Our approach constitutes a promising new direction towards understanding the higher levels of organization of the cell as a system which should help current efforts to re-engineer ontologies and improve our ability to predict which proteins are involved in specific biological processes.Lynn and William Frankel Center for Computer Science; the Paul Ivanier center for robotics research and production; National Science Foundation (ITR-048715); National Human Genome Research Institute (1R33HG002850-01A1, R01 HG003367-01A1); National Institute of Health (U54 LM008748

Public Library of Science (PLOS)

Crossref

Boston University Institutional Repository (OpenBU)

Directory of Open Access Journals

PubMed Central

Which clustering algorithm is better for predicting protein complexes?

Abstract Background Protein-Protein interactions (PPI) play a key role in determining the outcome of most cellular processes. The correct identification and characterization of protein interactions and the networks, which they comprise, is critical for understanding the molecular mechanisms within the cell. Large-scale techniques such as pull down assays and tandem affinity purification are used in order to detect protein interactions in an organism. Today, relatively new high-throughput methods like yeast two hybrid, mass spectrometry, microarrays, and phage display are also used to reveal protein interaction networks. Results In this paper we evaluated four different clustering algorithms using six different interaction datasets. We parameterized the MCL, Spectral, RNSC and Affinity Propagation algorithms and applied them to six PPI datasets produced experimentally by Yeast 2 Hybrid (Y2H) and Tandem Affinity Purification (TAP) methods. The predicted clusters, so called protein complexes, were then compared and benchmarked with already known complexes stored in published databases. Conclusions While results may differ upon parameterization, the MCL and RNSC algorithms seem to be more promising and more accurate at predicting PPI complexes. Moreover, they predict more complexes than other reviewed algorithms in absolute numbers. On the other hand the spectral clustering algorithm achieves the highest valid prediction rate in our experiments. However, it is nearly always outperformed by both RNSC and MCL in terms of the geometrical accuracy while it generates the fewest valid clusters than any other reviewed algorithm. This article demonstrates various metrics to evaluate the accuracy of such predictions as they are presented in the text below. Supplementary material can be found at: <url>http://www.bioacademy.gr/bioinformatics/projects/ppireview.htm</url></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

EUR Research Repository

Open Repository and Bibliography - Luxembourg

University of Thessaly Institutional Repository