Search CORE

84 research outputs found

NeAT: a toolbox for the analysis of biological networks, clusters, classes and pathways

Author: Babu
Bebek
Brohee
Croes
Croes
Deville
Enright
Fukuda
G. Lima-Mendez
G. Vanderstocken
Gagneur
Han
J. van Helden
Janky
Jeong
Jeong
K. Faust
Krogan
Letovsky
Luscombe
Mewes
Milenkovic
O. Sand
Pereira-Leal
R. Janky
S. Brohee
Samuel Lattimore
Scott
Shannon
Sharan
Sprinzak
Uetz
von Mering
Y. Deville
Yook
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The network analysis tools (NeAT) (http://rsat.ulb.ac.be/neat/) provide a user-friendly web access to a collection of modular tools for the analysis of networks (graphs) and clusters (e.g. microarray clusters, functional classes, etc.). A first set of tools supports basic operations on graphs (comparison between two graphs, neighborhood of a set of input nodes, path finding and graph randomization). Another set of programs makes the connection between networks and clusters (graph-based clustering, cliques discovery and mapping of clusters onto a network). The toolbox also includes programs for detecting significant intersections between clusters/classes (e.g. clusters of co-expression versus functional classes of genes). NeAT are designed to cope with large datasets and provide a flexible toolbox for analyzing biological networks stored in various databases (protein interactions, regulation and metabolism) or obtained from high-throughput experiments (two-hybrid, mass-spectrometry and microarrays). The web interface interconnects the programs in predefined analysis flows, enabling to address a series of questions about networks of interest. Each tool can also be used separately by entering custom data for a specific analysis. NeAT can also be used as web services (SOAP/WSDL interface), in order to design programmatic workflows and integrate them with other available resources

NeAT: a toolbox for the analysis of biological networks, clusters, classes and pathways

Author: Babu
Bebek
Brohee
Croes
Croes
Deville
Enright
Fukuda
G. Lima-Mendez
G. Vanderstocken
Gagneur
Han
J. van Helden
Janky
Jeong
Jeong
K. Faust
Krogan
Letovsky
Luscombe
Mewes
Milenkovic
O. Sand
Pereira-Leal
R. Janky
S. Brohee
Samuel Lattimore
Scott
Shannon
Sharan
Sprinzak
Uetz
von Mering
Y. Deville
Yook
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

MCL-CAw: A refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure

Author: A Eberharter
A Mitrofanova
AC Gavin
AC Gavin
AD King
AJ Enright
B Breitkreutz
B Zhang
C Fridel
C Friedel
C von Mering
DF Seals
E Zotenko
EA Winzeler
G Giaever
G Hart
G Liu
G Liu
G Rigaut
GD Bader
H Cheng
H Chua
H Jeong
H Leung
H Wang
Hon Wai Leong
HW Mewes
J Hurwitz
J Zhao
JC Mellor
JD Han
JM Cherry
JS Luz
K Voevodski
Kang Ning
M Ashburner
M Wu
N Batada
NJ Krogan
P Aloy
P Carvalho
P Shannon
P Uetz
PA Grant
PA Grant
S Brohee
S Dongen
S Pu
S Pu
S Srihari
S Srihari
SR Collins
Sriganesh Srihari
T Ito
T Miller
X Zhou
Y Araki
Y Ho
Y Ozawa
Y Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Abstract Background The reconstruction of protein complexes from the physical interactome of organisms serves as a building block towards understanding the higher level organization of the cell. Over the past few years, several independent high-throughput experiments have helped to catalogue enormous amount of physical protein interaction data from organisms such as yeast. However, these individual datasets show lack of correlation with each other and also contain substantial number of false positives (noise). Over these years, several affinity scoring schemes have also been devised to improve the qualities of these datasets. Therefore, the challenge now is to detect meaningful as well as novel complexes from protein interaction (PPI) networks derived by combining datasets from multiple sources and by making use of these affinity scoring schemes. In the attempt towards tackling this challenge, the Markov Clustering algorithm (MCL) has proved to be a popular and reasonably successful method, mainly due to its scalability, robustness, and ability to work on scored (weighted) networks. However, MCL produces many noisy clusters, which either do not match known complexes or have additional proteins that reduce the accuracies of correctly predicted complexes. Results Inspired by recent experimental observations by Gavin and colleagues on the modularity structure in yeast complexes and the distinctive properties of "core" and "attachment" proteins, we develop a core-attachment based refinement method coupled to MCL for reconstruction of yeast complexes from scored (weighted) PPI networks. We combine physical interactions from two recent "pull-down" experiments to generate an unscored PPI network. We then score this network using available affinity scoring schemes to generate multiple scored PPI networks. The evaluation of our method (called MCL-CAw) on these networks shows that: (i) MCL-CAw derives larger number of yeast complexes and with better accuracies than MCL, particularly in the presence of natural noise; (ii) Affinity scoring can effectively reduce the impact of noise on MCL-CAw and thereby improve the quality (precision and recall) of its predicted complexes; (iii) MCL-CAw responds well to most available scoring schemes. We discuss several instances where MCL-CAw was successful in deriving meaningful complexes, and where it missed a few proteins or whole complexes due to affinity scoring of the networks. We compare MCL-CAw with several recent complex detection algorithms on unscored and scored networks, and assess the relative performance of the algorithms on these networks. Further, we study the impact of augmenting physical datasets with computationally inferred interactions for complex detection. Finally, we analyse the essentiality of proteins within predicted complexes to understand a possible correlation between protein essentiality and their ability to form complexes. Conclusions We demonstrate that core-attachment based refinement in MCL-CAw improves the predictions of MCL on yeast PPI networks. We show that affinity scoring improves the performance of MCL-CAw.http://deepblue.lib.umich.edu/bitstream/2027.42/78256/1/1471-2105-11-504.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/2/1471-2105-11-504-S1.PDFhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/3/1471-2105-11-504-S2.ZIPhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/4/1471-2105-11-504.pdfPeer Reviewe

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Deep Blue Documents at the University of Michigan

ScholarBank@NUS

Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences

Discovery and Expansion of Gene Modules by Seeking Isolated Groups in a Random Graph Process

Author: A Beyer
C Schluter
E Hartuv
EA Winzeler
Elizabeth Conibear
Eshel Ben-Jacob
G Giaever
G Milligan
GD Bader
Jennifer Bryan
Jochen Brumm
L Kiemer
MA Wong
O Rinner
P Shannon
R Tibshirani
RF Ling
RO Duda
S Brohee
S van Dongen
SR Collins
TI Lee
W Huber
W Lee
W Stuetzle
W Stuetzle
Wyeth W. Wasserman
Publication venue: Public Library of Science
Publication date: 09/10/2008
Field of study

BACKGROUND: A central problem in systems biology research is the identification and extension of biological modules-groups of genes or proteins participating in a common cellular process or physical complex. As a result, there is a persistent need for practical, principled methods to infer the modular organization of genes from genome-scale data. RESULTS: We introduce a novel approach for the identification of modules based on the persistence of isolated gene groups within an evolving graph process. First, the underlying genomic data is summarized in the form of ranked gene-gene relationships, thereby accommodating studies that quantify the relevant biological relationship directly or indirectly. Then, the observed gene-gene relationship ranks are viewed as the outcome of a random graph process and candidate modules are given by the identifiable subgraphs that arise during this process. An isolation index is computed for each module, which quantifies the statistical significance of its survival time. CONCLUSIONS: The Miso (module isolation) method predicts gene modules from genomic data and the associated isolation index provides a module-specific measure of confidence. Improving on existing alternative, such as graph clustering and the global pruning of dendrograms, this index offers two intuitively appealing features: (1) the score is module-specific; and (2) different choices of threshold correlate logically with the resulting performance, i.e. a stringent cutoff yields high quality predictions, but low sensitivity. Through the analysis of yeast phenotype data, the Miso method is shown to outperform existing alternatives, in terms of the specificity and sensitivity of its predictions

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

An overlapping module identification method in protein-protein interaction networks

Author: AJ Enright
B Adamcsek
B Schwikowski
B Titz
C Liu
DL Nelson
G Cui
G Palla
GD Bader
IK Jordan
J Kim
JDJ Han
JF Xia
K Rhrissorrakrai
Lijing Li
MEJ Newman
MEJ Newman
MG Shi
O Kuchaiev
P Shafer
S Asur
S Brohee
S Van Dongen
U Güldener
V Spirin
X Yan
Xuesong Wang
Yuhu Cheng
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Linking Proteins to Signaling Pathways for Experiment Design and Evaluation

Author: A Li
A Pires-daSilva
Attila Gursoy
AV Antonov
BH Junker
C Stark
CT Lopes
D Binns
D Croft
DS Wishart
E Boutet
G Dennis Jr
Illés J. Farkas
J Li
J Yu
JJ Hornberg
LJ Jensen
M Kanehisa
MH Cheok
MP Cary
N Simonis
NR Ashcroft
P Flicek
R Drysdale
RP Owen
S Brohee
T Korcsmaros
Tamás Korcsmáros
TS Keshava Prasad
TW Harris
Ádám Szántó-Várnagy
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Biomedical experimental work often focuses on altering the functions of selected proteins. These changes can hit signaling pathways, and can therefore unexpectedly and non-specifically affect cellular processes. We propose PathwayLinker, an online tool that can provide a first estimate of the possible signaling effects of such changes, e.g., drug or microRNA treatments. PathwayLinker minimizes the users' efforts by integrating protein-protein interaction and signaling pathway data from several sources with statistical significance tests and clear visualization. We demonstrate through three case studies that the developed tool can point out unexpected signaling bias in normal laboratory experiments and identify likely novel signaling proteins among the interactors of known drug targets. In our first case study we show that knockdown of the Caenorhabditis elegans gene cdc-25.1 (meant to avoid progeny) may globally affect the signaling system and unexpectedly bias experiments. In the second case study we evaluate the loss-of-function phenotypes of a less known C. elegans gene to predict its function. In the third case study we analyze GJA1, an anti-cancer drug target protein in human, and predict for this protein novel signaling pathway memberships, which may be sources of side effects. Compared to similar services, a major advantage of PathwayLinker is that it drastically reduces the necessary amount of manual literature searches and can be used without a computational background. PathwayLinker is available at http://PathwayLinker.org. Detailed documentation and source code are available at the website

Public Library of Science (PLOS)

CiteSeerX

Crossref

PubMed Central

ELTE Digital Institutional Repository (EDIT)

Assessing the functional coherence of modules found in multiple-evidence networks from Arabidopsis

Abstract Background Combining multiple evidence-types from different information sources has the potential to reveal new relationships in biological systems. The integrated information can be represented as a relationship network, and clustering the network can suggest possible functional modules. The value of such modules for gaining insight into the underlying biological processes depends on their functional coherence. The challenges that we wish to address are to define and quantify the functional coherence of modules in relationship networks, so that they can be used to infer function of as yet unannotated proteins, to discover previously unknown roles of proteins in diseases as well as for better understanding of the regulation and interrelationship between different elements of complex biological systems. Results We have defined the functional coherence of modules with respect to the Gene Ontology (GO) by considering two complementary aspects: (i) the fragmentation of the GO functional categories into the different modules and (ii) the most representative functions of the modules. We have proposed a set of metrics to evaluate these two aspects and demonstrated their utility in <it>Arabidopsis thaliana</it>. We selected 2355 proteins for which experimentally established protein-protein interaction (PPI) data were available. From these we have constructed five relationship networks, four based on single types of data: PPI, co-expression, co-occurrence of protein names in scientific literature abstracts and sequence similarity and a fifth one combining these four evidence types. The ability of these networks to suggest biologically meaningful grouping of proteins was explored by applying Markov clustering and then by measuring the functional coherence of the clusters. Conclusions Relationship networks integrating multiple evidence-types are biologically informative and allow more proteins to be assigned to a putative functional module. Using additional evidence types concentrates the functional annotations in a smaller number of modules without unduly compromising their consistency. These results indicate that integration of more data sources improves the ability to uncover functional association between proteins, both by allowing more proteins to be linked and producing a network where modular structure more closely reflects the hierarchy in the gene ontology.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Rothamsted Repository

Discover Protein Complexes in Protein-Protein Interaction Networks Using Parametric Local Modularity

Author: A Beyer
A Clauset
AC Gavin
AL Barabasi
B Adamcsek
C Stark
D Agarwal GaK
E Georgii
E Ravasz
G Palla
GD Bader
JB Pereira-Leal
Jongkwang Kim
Kai Tan
L Everett
L Salwinski
M Girvan
M Sales-Pardo
M Wu
MEJ Newman
MEJ Newman
NJ Krogan
NN Batada
P Schuetz
R Guimera
R Noack AaR
S Brohee
S Fortunato
S Muff
S Pu
S van Dongen
U Brandes
U Brandes
V Spirin
W-K Huh
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A methodology for detecting the orthology signal in a PPI network at a functional complex level

Author: A Kuzniar
A Vespignani
AC Berglund
AC Gavin
AJ Enright
AK Gillingham
E Georgii
Eleftheria Mavridou
Elena Marchiori
Enrique Carrillo-de Santa Pau
F Chen
F Reggiori
G Manning
I Xenarios
JB Bock
K Brown
K Dolinski
KP O'Brien
L Fokkens
L Li
M Ashburner
M Campillos
M Remm
M Woźniak
MJ Kuehn
ML Bochman
N Bhardwaj
N Yosef
NJ Krogan
P Jancura
P Jancura
Pavol Jancura
R Behnia
R Benne
R Jansen
R Sharan
R Sharan
S Bauer
S Brohee
S Conchon
S Erten
S van Dongen
S Wuchty
S Wuchty
S Yon Rhee
S Özcan
SJL van Wijk
T Brandt
U Guldener
W Seufert
YL Chang
Z Liang
Z Lu
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Contains fulltext : 93730.pdf (preprint version ) (Open Access)13 p

Crossref

Springer

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Radboud Repository

Which clustering algorithm is better for predicting protein complexes?

Abstract Background Protein-Protein interactions (PPI) play a key role in determining the outcome of most cellular processes. The correct identification and characterization of protein interactions and the networks, which they comprise, is critical for understanding the molecular mechanisms within the cell. Large-scale techniques such as pull down assays and tandem affinity purification are used in order to detect protein interactions in an organism. Today, relatively new high-throughput methods like yeast two hybrid, mass spectrometry, microarrays, and phage display are also used to reveal protein interaction networks. Results In this paper we evaluated four different clustering algorithms using six different interaction datasets. We parameterized the MCL, Spectral, RNSC and Affinity Propagation algorithms and applied them to six PPI datasets produced experimentally by Yeast 2 Hybrid (Y2H) and Tandem Affinity Purification (TAP) methods. The predicted clusters, so called protein complexes, were then compared and benchmarked with already known complexes stored in published databases. Conclusions While results may differ upon parameterization, the MCL and RNSC algorithms seem to be more promising and more accurate at predicting PPI complexes. Moreover, they predict more complexes than other reviewed algorithms in absolute numbers. On the other hand the spectral clustering algorithm achieves the highest valid prediction rate in our experiments. However, it is nearly always outperformed by both RNSC and MCL in terms of the geometrical accuracy while it generates the fewest valid clusters than any other reviewed algorithm. This article demonstrates various metrics to evaluate the accuracy of such predictions as they are presented in the text below. Supplementary material can be found at: <url>http://www.bioacademy.gr/bioinformatics/projects/ppireview.htm</url></p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

EUR Research Repository

Open Repository and Bibliography - Luxembourg

University of Thessaly Institutional Repository