Search CORE

20 research outputs found

MINE: Module Identification in Networks

Author: A Ceol
AJ Enright
B Adamcsek
B Aranda
C Stark
DJ Watts
GD Bader
H Hu
HW Mewes
IX Leung
JD Han
Kahn Rhrissorrakrai
Kristin C Gunsalus
M Ashburner
M Boxem
M Remm
ME Newman
N Simonis
X Yan
X Yan
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Graphical models of network associations are useful for both visualizing and integrating multiple types of association data. Identifying modules, or groups of functionally related gene products, is an important challenge in analyzing biological networks. However, existing tools to identify modules are insufficient when applied to dense networks of experimentally derived interaction data. To address this problem, we have developed an agglomerative clustering method that is able to identify highly modular sets of gene products within highly interconnected molecular interaction networks. Results MINE outperforms MCODE, CFinder, NEMO, SPICi, and MCL in identifying non-exclusive, high modularity clusters when applied to the <it>C. elegans </it>protein-protein interaction network. The algorithm generally achieves superior geometric accuracy and modularity for annotated functional categories. In comparison with the most closely related algorithm, MCODE, the top clusters identified by MINE are consistently of higher density and MINE is less likely to designate overlapping modules as a single unit. MINE offers a high level of granularity with a small number of adjustable parameters, enabling users to fine-tune cluster results for input networks with differing topological properties. Conclusions MINE was created in response to the challenge of discovering high quality modules of gene products within highly interconnected biological networks. The algorithm allows a high degree of flexibility and user-customisation of results with few adjustable parameters. MINE outperforms several popular clustering algorithms in identifying modules with high modularity and obtains good overall recall and precision of functional annotations in protein-protein interaction networks from both <it>S. cerevisiae </it>and <it>C. elegans</it>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Integrative Analysis of Many Weighted Co-Expression Networks Using Tensor Computation

Author: A Ruepp
A Smilde
AA Tsay
AJ Butte
AL Barabasi
AL Yuille
AY Ng
B Breitkreutz
BP Kelley
C Faloutsos
CHQ Ding
Chun-Chi Liu
D Achlioptas
D Tao
DJ Thomas
E Acar
F Pan
FRK Chung
H Chen
H Hu
Haifeng Li
I Bernales
J Flannick
J Sun
J Sun
J Sun
JA Papin
JJ Hopfield
Jörg Stelling
K Kuwahara
K Takahashi
K Toeda
KA Allen
L Mao
L Omberg
LR Tucker
M Ashburner
M Kalaev
M Kanehisa
M Koyuturk
M Koyuturk
M Nicolás
M Xu
M Xu
MA Serrano
MEJ Newman
Michael S. Waterman
MR Mehan
MW Mahoney
N Genkai
O Alter
O Alter
O Alter
RB Cattell
S Arora
S Miard
T Zhang
T Zhang
TG Kolda
Tong Zhang
TS Motzkin
TW Anderson
U Luxburg
V Spirin
W Li
Wenyuan Li
X Yan
X Yan
X Zhou
Xianghong Jasmine Zhou
Y Huang
Y Yu
YP Deniélou
Publication venue: Public Library of Science
Publication date: 01/06/2011
Field of study

The rapid accumulation of biological networks poses new challenges and calls for powerful integrative analysis tools. Most existing methods capable of simultaneously analyzing a large number of networks were primarily designed for unweighted networks, and cannot easily be extended to weighted networks. However, it is known that transforming weighted into unweighted networks by dichotomizing the edges of weighted networks with a threshold generally leads to information loss. We have developed a novel, tensor-based computational framework for mining recurrent heavy subgraphs in a large set of massive weighted networks. Specifically, we formulate the recurrent heavy subgraph identification problem as a heavy 3D subtensor discovery problem with sparse constraints. We describe an effective approach to solving this problem by designing a multi-stage, convex relaxation protocol, and a non-uniform edge sampling technique. We applied our method to 130 co-expression networks, and identified 11,394 recurrent heavy subgraphs, grouped into 2,810 families. We demonstrated that the identified subgraphs represent meaningful biological modules by validating against a large set of compiled biological knowledge bases. We also showed that the likelihood for a heavy subgraph to be meaningful increases significantly with its recurrence in multiple networks, highlighting the importance of the integrative approach to biological network analysis. Moreover, our approach based on weighted graphs detects many patterns that would be overlooked using unweighted graphs. In addition, we identified a large number of modules that occur predominately under specific phenotypes. This analysis resulted in a genome-wide mapping of gene network modules onto the phenome. Finally, by comparing module activities across many datasets, we discovered high-order dynamic cooperativeness in protein complex networks and transcriptional regulatory networks

Crossref

Directory of Open Access Journals

PubMed Central

gPrune: A Constraint Pushing Framework for Graph Pattern Mining

Author: A. Butte
A. Inokuchi
C. Bucila
C. Wang
F. Bonchi
F. Bonchi
F. Bonchi
G. Dong
M. Deshpande
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Abstract. In graph mining applications, there has been an increasingly strong urge for imposing user-specified constraints on the mining results. However, unlike most traditional itemset constraints, structural constraints, such as density and diameter of a graph, are very hard to be pushed deep into the mining process. In this paper, we give the first comprehensive study on the pruning properties of both traditional and structural constraints aiming to reduce not only the pattern search space but the data search space as well. A new general framework, called gPrune, is proposed to incorporate all the constraints in such a way that they recursively reinforce each other through the entire mining process. A new concept, Pattern-inseparable Data-antimonotonicity, is proposed to handle the structural constraints unique in the context of graph, which, combined with known pruning properties, provides a comprehensive and unified classification framework for structural constraints. The exploration of these antimonotonicities in the context of graph pattern mining is a significant extension to the known classification of constraints, and deepens our understanding of the pruning properties of structural graph constraints.

CiteSeerX

Crossref

Institutional Knowledge at Singapore Management University

Core Decomposition in Multilayer Networks: Theory, Algorithms, and Applications

Author: Bonchi Francesco
Galimberti Edoardo
Gullo Francesco
Lanciano Tommaso
Publication venue
Publication date: 15/11/2019
Field of study

Multilayer networks are a powerful paradigm to model complex systems, where multiple relations occur between the same entities. Despite the keen interest in a variety of tasks, algorithms, and analyses in this type of network, the problem of extracting dense subgraphs has remained largely unexplored so far. In this work we study the problem of core decomposition of a multilayer network. The multilayer context is much challenging as no total order exists among multilayer cores; rather, they form a lattice whose size is exponential in the number of layers. In this setting we devise three algorithms which differ in the way they visit the core lattice and in their pruning techniques. We then move a step forward and study the problem of extracting the inner-most (also known as maximal) cores, i.e., the cores that are not dominated by any other core in terms of their core index in all the layers. Inner-most cores are typically orders of magnitude less than all the cores. Motivated by this, we devise an algorithm that effectively exploits the maximality property and extracts inner-most cores directly, without first computing a complete decomposition. Finally, we showcase the multilayer core-decomposition tool in a variety of scenarios and problems. We start by considering the problem of densest-subgraph extraction in multilayer networks. We introduce a definition of multilayer densest subgraph that trades-off between high density and number of layers in which the high density holds, and exploit multilayer core decomposition to approximate this problem with quality guarantees. As further applications, we show how to utilize multilayer core decomposition to speed-up the extraction of frequent cross-graph quasi-cliques and to generalize the community-search problem to the multilayer setting

arXiv.org e-Print Archive

Archivio della ricerca- Università di Roma La Sapienza

Mining frequent closed rooted trees

Author: Balcázar Navarro José Luis
Bifet Figuerol Albert Carles
Lozano Bojados Antoni
Publication venue
Publication date: 01/01/2010
Field of study

Many knowledge representation mechanisms are based on tree-like structures, thus symbolizing the fact that certain pieces of information are related in one sense or another. There exists a well-studied process of closure-based data mining in the itemset framework: we consider the extension of this process into trees. We focus mostly on the case where labels on the nodes are nonexistent or unreliable, and discuss algorithms for closurebased mining that only rely on the root of the tree and the link structure. We provide a notion of intersection that leads to a deeper understanding of the notion of support-based closure, in terms of an actual closure operator. We describe combinatorial characterizations and some properties of ordered trees, discuss their applicability to unordered trees, and rely on them to design efficient algorithms for mining frequent closed subtrees both in the ordered and the unordered settings. Empirical validations and comparisons with alternative algorithms are provided.Postprint (author’s final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Recommended from our members

Identify Dynamic Network Modules with Temporal and Spatial Constraints

Author: Almaas E.
Jin R.
Liu C.
McCallen S.
Zhou X. J.
Publication venue: Lawrence Livermore National Laboratory
Publication date: 24/09/2007
Field of study

Despite the rapid accumulation of systems-level biological data, understanding the dynamic nature of cellular activity remains a difficult task. The reason is that most biological data are static, or only correspond to snapshots of cellular activity. In this study, we explicitly attempt to detangle the temporal complexity of biological networks by using compilations of time-series gene expression profiling data.We define a dynamic network module to be a set of proteins satisfying two conditions: (1) they form a connected component in the protein-protein interaction (PPI) network; and (2) their expression profiles form certain structures in the temporal domain. We develop the first efficient mining algorithm to discover dynamic modules in a temporal network, as well as frequently occurring dynamic modules across many temporal networks. Using yeast as a model system, we demonstrate that the majority of the identified dynamic modules are functionally homogeneous. Additionally, many of them provide insight into the sequential ordering of molecular events in cellular systems. We further demonstrate that identifying frequent dynamic network modules can significantly increase the signal to noise separation, despite the fact that most dynamic network modules are highly condition-specific. Finally, we note that the applicability of our algorithm is not limited to the study of PPI systems, instead it is generally applicable to the combination of any type of network and time-series data

UNT Digital Library

Identification of large disjoint motifs in biological networks

Author: A Chatr-Aryamontri
A Masoudi-Nejad
AL Barabási
C Yanover
D Gale
DA Charlebois
F Ay
FL Homa
H Jeong
K Baskerville
M Ashburner
M Deshpande
M Kuramochi
M Kuramochi
N Kashtan
P Wang
R Milo
Rasha Elhesha
RD Leclerc
S Omidi
S Redner
S Wernicke
S Wuchty
S Wuchty
SN Dorogovtsev
SS Shen-Orr
T Milenković
Tamer Kahveci
X Zhu
ZR Kashani
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref