Search CORE

MINE: Module Identification in Networks

Author: A Ceol
AJ Enright
B Adamcsek
B Aranda
C Stark
DJ Watts
GD Bader
H Hu
HW Mewes
IX Leung
JD Han
Kahn Rhrissorrakrai
Kristin C Gunsalus
M Ashburner
M Boxem
M Remm
ME Newman
N Simonis
X Yan
X Yan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Graphical models of network associations are useful for both visualizing and integrating multiple types of association data. Identifying modules, or groups of functionally related gene products, is an important challenge in analyzing biological networks. However, existing tools to identify modules are insufficient when applied to dense networks of experimentally derived interaction data. To address this problem, we have developed an agglomerative clustering method that is able to identify highly modular sets of gene products within highly interconnected molecular interaction networks. Results MINE outperforms MCODE, CFinder, NEMO, SPICi, and MCL in identifying non-exclusive, high modularity clusters when applied to the <it>C. elegans </it>protein-protein interaction network. The algorithm generally achieves superior geometric accuracy and modularity for annotated functional categories. In comparison with the most closely related algorithm, MCODE, the top clusters identified by MINE are consistently of higher density and MINE is less likely to designate overlapping modules as a single unit. MINE offers a high level of granularity with a small number of adjustable parameters, enabling users to fine-tune cluster results for input networks with differing topological properties. Conclusions MINE was created in response to the challenge of discovering high quality modules of gene products within highly interconnected biological networks. The algorithm allows a high degree of flexibility and user-customisation of results with few adjustable parameters. MINE outperforms several popular clustering algorithms in identifying modules with high modularity and obtains good overall recall and precision of functional annotations in protein-protein interaction networks from both <it>S. cerevisiae </it>and <it>C. elegans</it>.</p

Comparative analysis of thermophilic and mesophilic proteins using Protein Energy Networks

Author: A Ghosh
A Ghosh
A Razvi
A Szilagyi
AL Barabasi
B Adamcsek
C Vieille
DVan Der Spoel
IN Berezovsky
J Hollien
KV Brinda
LH Greene
M Robinson-Rechavi
M Sadeghi
MS Vijayabaskar
N Kannan
S Chakravarty
S Kumar
Saraswathi Vishveshwara
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Thermophilic proteins sustain themselves and function at higher temperatures. Despite their structural and functional similarities with their mesophilic homologues, they show enhanced stability. Various comparative studies at genomic, protein sequence and structure levels, and experimental works highlight the different factors and dominant interacting forces contributing to this increased stability. Methods In this comparative structure based study, we have used interaction energies between amino acids, to generate structure networks called as Protein Energy Networks (PENs). These PENs are used to compute network, sub-graph, and node specific parameters. These parameters are then compared between the thermophile-mesophile homologues. Results The results show an increased number of clusters and low energy cliques in thermophiles as the main contributing factors for their enhanced stability. Further more, we see an increase in the number of hubs in thermophiles. We also observe no community of electrostatic cliques forming in PENs. Conclusion In this study we were able to take an energy based network approach, to identify the factors responsible for enhanced stability of thermophiles, by comparative analysis. We were able to point out that the sub-graph parameters are the prominent contributing factors. The thermophiles have a better-packed hydrophobic core. We have also discussed how thermophiles, although increasing stability through higher connectivity retains conformational flexibility, from a cliques and communities perspective.</p

ePrints@IISc

An overlapping module identification method in protein-protein interaction networks

Author: AJ Enright
B Adamcsek
B Schwikowski
B Titz
C Liu
DL Nelson
G Cui
G Palla
GD Bader
IK Jordan
J Kim
JDJ Han
JF Xia
K Rhrissorrakrai
Lijing Li
MEJ Newman
MEJ Newman
MG Shi
O Kuchaiev
P Shafer
S Asur
S Brohee
S Van Dongen
U Güldener
V Spirin
X Yan
Xuesong Wang
Yuhu Cheng
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Aberdeen University Research

Interactive analysis of systems biology molecular expression data

Author: Alan Stephenson
B Adamcsek
B Lahner
CB Clish
Charles Buck
David E Salt
DJ Sheskin
EC Butcher
G Valet
IG Tollis
JM Asara
John Burgner
L Hood
M Chen
Michael D Kane
Mingwu Zhang
P Shannon
Qi Ouyang
R Goodacre
S Wang
Sunil Prabhakar
TMJ Fruchterman
W Weckwerth
WH Press
X Zhang
X Zhang
Xiang Zhang
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Peer reviewedPublisher PD

Public Library of Science (PLOS)

Cohesive versus Flexible Evolution of Functional Modules in Eukaryotes

Author: A Hirsh
AC Gavin
AH Singh
B Adamcsek
B Snel
Berend Snel
BJ Monahan
Christian von Mering
E Neher
ESGA Snitkin
F Pazos
GV Glazko
HM Bourbon
HW Mewes
L Li
Like Fokkens
M Ashburner
M Campillos
M Kroiss
M Pellegrini
M Remm
MA Huynen
NJ Krogan
OX Cordero
P Aloy
P Shannon
P Smits
R Jothi
R Tatusov
S Collins
T Gabaldon
T Rognes
T Tanaka
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Although functionally related proteins can be reliably predicted from phylogenetic profiles, many functional modules do not seem to evolve cohesively according to case studies and systematic analyses in prokaryotes. In this study we quantify the extent of evolutionary cohesiveness of functional modules in eukaryotes and probe the biological and methodological factors influencing our estimates. We have collected various datasets of protein complexes and pathways in Saccheromyces cerevisiae. We define orthologous groups on 34 eukaryotic genomes and measure the extent of cohesive evolution of sets of orthologous groups of which members constitute a known complex or pathway. Within this framework it appears that most functional modules evolve flexibly rather than cohesively. Even after correcting for uncertain module definitions and potentially problematic orthologous groups, only 46% of pathways and complexes evolve more cohesively than random modules. This flexibility seems partly coupled to the nature of the functional module because biochemical pathways are generally more cohesively evolving than complexes

Efficient and accurate greedy search methods for mining functional modules in protein interaction networks

Author: A Gavin
B Adamcsek
Baoliu Ye
BS Everitt
C Brun
Chaojun Li
DJ Watts
F Luo
F Radicchi
G Palla
GD Bader
H Jeong
H Leung
HW Mewes
I Xenarios
J Wang
J Wang
J Wang
Jieyue He
L Gao
LF Wu
M Altaf-Ul-Amin
M Girvan
M Li
M Li
M Wu
MEJ Newman
SH Jung
SS Dwight
V Spirin
Wei Zhong
X Li
YR Cho
Z Dezso
Publication venue: BioMed Central
Publication date: 01/06/2012
Field of study

Abstract Background Most computational algorithms mainly focus on detecting highly connected subgraphs in PPI networks as protein complexes but ignore their inherent organization. Furthermore, many of these algorithms are computationally expensive. However, recent analysis indicates that experimentally detected protein complexes generally contain Core/attachment structures. Methods In this paper, a Greedy Search Method based on Core-Attachment structure (GSM-CA) is proposed. The GSM-CA method detects densely connected regions in large protein-protein interaction networks based on the edge weight and two criteria for determining core nodes and attachment nodes. The GSM-CA method improves the prediction accuracy compared to other similar module detection approaches, however it is computationally expensive. Many module detection approaches are based on the traditional hierarchical methods, which is also computationally inefficient because the hierarchical tree structure produced by these approaches cannot provide adequate information to identify whether a network belongs to a module structure or not. In order to speed up the computational process, the Greedy Search Method based on Fast Clustering (GSM-FC) is proposed in this work. The edge weight based GSM-FC method uses a greedy procedure to traverse all edges just once to separate the network into the suitable set of modules. Results The proposed methods are applied to the protein interaction network of S. cerevisiae. Experimental results indicate that many significant functional modules are detected, most of which match the known complexes. Results also demonstrate that the GSM-FC algorithm is faster and more accurate as compared to other competing algorithms. Conclusions Based on the new edge weight definition, the proposed algorithm takes advantages of the greedy search procedure to separate the network into the suitable set of modules. Experimental analysis shows that the identified modules are statistically significant. The algorithm can reduce the computational time significantly while keeping high prediction accuracy.</p

arXiv.org e-Print Archive

Overlapping Community Discovery Methods: A Survey

Author: A Lancichinetti
A Lancichinetti
A Lancichinetti
A Lancichinetti
AJ Enright
B Adamcsek
Baumes J Goldberg M, Magdon-Ismail M (2005) Efficient identification of overlapping communities. In: Proceedings of the
BS Rees
C Lee
DE Goldberg
F Wei
G Palla
G Palla
H Shen
J Baumes
J Chen
J Xie
JB Pereira
M Coscia
M Girvan
MEJ Newman
R Cazabet
S Fortunato
S Gregory
S Gregory
S Zhang
TS Evans
UN Raghavan
Y-Y Ahn
Z-H Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/11/2014
Field of study

The detection of overlapping communities is a challenging problem which is gaining increasing interest in recent years because of the natural attitude of individuals, observed in real-world networks, to participate in multiple groups at the same time. This review gives a description of the main proposals in the field. Besides the methods designed for static networks, some new approaches that deal with the detection of overlapping communities in networks that change over time, are described. Methods are classified with respect to the underlying principles guiding them to obtain a network division in groups sharing part of their nodes. For each of them we also report, when available, computational complexity and web site address from which it is possible to download the software implementing the method.Comment: 20 pages, Book Chapter, appears as Social networks: Analysis and Case Studies, A. Gunduz-Oguducu and A. S. Etaner-Uyar eds, Lecture Notes in Social Networks, pp. 105-125, Springer,201

CiteSeerX

Clique-based data mining for related genes in a biomedical database

Author: A Bairoch
A Hamosh
AL Barabási
B Adamcsek
B Baudin
Chikara Yonemori
DJ Cook
DJ Watts
DM Wilkinson
E Tomita
E Tomita
EE Snyder
Etsuji Tomita
H Hu
H Müller
J Chen
J Hauer
JP Benzecri
K Almind
K Oda
KI Goh
LJ Jensen
M Haraguchi
M Kanehisa
Masaaki Muramatsu
MEJ Newman
MEJ Newman
MK Halushka
MY Galperin
NCEP
O Seda
PC White
PM Roberts
R Dunn
R Sharan
RA De Fronzo
RH Eckel
T Aittokallio
T Matsunaga
T Uno
Tsutomu Matsunaga
X Yan
Y Wang
Y Zhang
Publication venue: BioMed Central
Publication date: 01/07/2009
Field of study

Abstract Background Progress in the life sciences cannot be made without integrating biomedical knowledge on numerous genes in order to help formulate hypotheses on the genetic mechanisms behind various biological phenomena, including diseases. There is thus a strong need for a way to automatically and comprehensively search from biomedical databases for related genes, such as genes in the same families and genes encoding components of the same pathways. Here we address the extraction of related genes by searching for densely-connected subgraphs, which are modeled as cliques, in a biomedical relational graph. Results We constructed a graph whose nodes were gene or disease pages, and edges were the hyperlink connections between those pages in the Online Mendelian Inheritance in Man (OMIM) database. We obtained over 20,000 sets of related genes (called 'gene modules') by enumerating cliques computationally. The modules included genes in the same family, genes for proteins that form a complex, and genes for components of the same signaling pathway. The results of experiments using 'metabolic syndrome'-related gene modules show that the gene modules can be used to get a coherent holistic picture helpful for interpreting relations among genes. Conclusion We presented a data mining approach extracting related genes by enumerating cliques. The extracted gene sets provide a holistic picture useful for comprehending complex disease mechanisms.</p

Prior knowledge based mining functional modules from Yeast PPI networks with gene ontology

Author: A Capocci
A Hahn
A Hotho
A Jain
A Schlicker
A Subramanian
B Adamcsek
D King
D Scholtens
D Watts
D Zhou
E Rual
E Yeger-Lotem
F Radicchi
F Sohler
GO-Consortium
H Ge
H Jeong
H Zheng
I Davidson
I Ulitsky
J Enright
J Pereira
J Vlasblom
L Hartwell
Liping Jing
M Aldenderfer
M Dittrich
M Holme
M Li
M Newman
Michael K Ng
O Chapelle
O Mason
P Lord
S Asur
S Brohee
S Hoi
S Kamvar
S van Dongen
S van Dongen
T Aittokallio
T Beissbarth
T Hertz
T Ito
X Guo
X Hu
Y Qi
Z Lu
Publication venue: BioMed Central
Publication date: 01/12/2010
Field of study

Abstract Background In the literature, there are fruitful algorithmic approaches for identification functional modules in protein-protein interactions (PPI) networks. Because of accumulation of large-scale interaction data on multiple organisms and non-recording interaction data in the existing PPI database, it is still emergent to design novel computational techniques that can be able to correctly and scalably analyze interaction data sets. Indeed there are a number of large scale biological data sets providing indirect evidence for protein-protein interaction relationships. Results The main aim of this paper is to present a prior knowledge based mining strategy to identify functional modules from PPI networks with the aid of Gene Ontology. Higher similarity value in Gene Ontology means that two gene products are more functionally related to each other, so it is better to group such gene products into one functional module. We study (i) to encode the functional pairs into the existing PPI networks; and (ii) to use these functional pairs as pairwise constraints to supervise the existing functional module identification algorithms. Topology-based modularity metric and complex annotation in MIPs will be used to evaluate the identified functional modules by these two approaches. Conclusions The experimental results on Yeast PPI networks and GO have shown that the prior knowledge based learning methods perform better than the existing algorithms.</p