Search CORE

1,341 research outputs found

Microbial community pattern detection in human body habitats via ensemble clustering framework

Author: Chua Hon-Nian
Li Xiao-Li
Ning Kang
Ou-Yang Le
Su Xiaoquan
Yang Peng
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

The human habitat is a host where microbial species evolve, function, and continue to evolve. Elucidating how microbial communities respond to human habitats is a fundamental and critical task, as establishing baselines of human microbiome is essential in understanding its role in human disease and health. However, current studies usually overlook a complex and interconnected landscape of human microbiome and limit the ability in particular body habitats with learning models of specific criterion. Therefore, these methods could not capture the real-world underlying microbial patterns effectively. To obtain a comprehensive view, we propose a novel ensemble clustering framework to mine the structure of microbial community pattern on large-scale metagenomic data. Particularly, we first build a microbial similarity network via integrating 1920 metagenomic samples from three body habitats of healthy adults. Then a novel symmetric Nonnegative Matrix Factorization (NMF) based ensemble model is proposed and applied onto the network to detect clustering pattern. Extensive experiments are conducted to evaluate the effectiveness of our model on deriving microbial community with respect to body habitat and host gender. From clustering results, we observed that body habitat exhibits a strong bound but non-unique microbial structural patterns. Meanwhile, human microbiome reveals different degree of structural variations over body habitat and host gender. In summary, our ensemble clustering framework could efficiently explore integrated clustering results to accurately identify microbial communities, and provide a comprehensive view for a set of microbial communities. Such trends depict an integrated biography of microbial communities, which offer a new insight towards uncovering pathogenic model of human microbiome.Comment: BMC Systems Biology 201

arXiv.org e-Print Archive

Crossref

PubMed Central

Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences

ScholarBank@NUS

Modularity functions maximization with nonnegative relaxation facilitates community detection in networks

Author: Barabási
Fortunato
Fortunato
Gfeller
Girvan
Jiang
Jonathan Q. Jiang
Lancichinetti
Lancichinetti
Li
Lisa J. McQuay
Lovasz
Newman
Newman
Palla
Porter
Reichardt
Rosvall
Zachary
Publication venue: 'Elsevier BV'
Publication date: 30/08/2011
Field of study

We show here that the problem of maximizing a family of quantitative functions, encompassing both the modularity (Q-measure) and modularity density (D-measure), for community detection can be uniformly understood as a combinatoric optimization involving the trace of a matrix called modularity Laplacian. Instead of using traditional spectral relaxation, we apply additional nonnegative constraint into this graph clustering problem and design efficient algorithms to optimize the new objective. With the explicit nonnegative constraint, our solutions are very close to the ideal community indicator matrix and can directly assign nodes into communities. The near-orthogonal columns of the solution can be reformulated as the posterior probability of corresponding node belonging to each community. Therefore, the proposed method can be exploited to identify the fuzzy or overlapping communities and thus facilitates the understanding of the intrinsic structure of networks. Experimental results show that our new algorithm consistently, sometimes significantly, outperforms the traditional spectral relaxation approaches

arXiv.org e-Print Archive

Crossref

Autonomous Overlapping Community Detection in Temporal Networks: A Dynamic Bayesian Nonnegative Matrix Factorization Approach.

Author: Ahmed
Ball
Bogdan Gabrys
Chakrabarti
Chi
Danon
Derényi
Di Jin
Dongxiao He
Du
Folino
Gehrke
Girvan
Holme
Jin
Jin
Karrer
Kim
Kumar
Lee
Leskovec
Lin
Lin
Lin Pan
Lusseau
Nepusz
Newman
Newman
Nguyen
P. Wipf
Palla
Palla
Pengfei Jiao
Psorakis
Reichardt
Reichardt
Rezvanian
Schuetz
Spirin
Sun
Tan
Tang
Wenjun Wang
Xu
Xu
Yang
Yuan
Publication venue: 'Elsevier BV'
Publication date: 01/10/2016
Field of study

A wide variety of natural or artificial systems can be modeled as time-varying or temporal networks. To understand the structural and functional properties of these time-varying networked systems, it is desirable to detect and analyze the evolving community structure. In temporal networks, the identified communities should reflect the current snapshot network, and at the same time be similar to the communities identified in history or say the previous snapshot networks. Most of the existing approaches assume that the number of communities is known or can be obtained by some heuristic methods. This is unsuitable and complicated for most real world networks, especially temporal networks. In this paper, we propose a Bayesian probabilistic model, named Dynamic Bayesian Nonnegative Matrix Factorization (DBNMF), for automatic detection of overlapping communities in temporal networks. Our model can not only give the overlapping community structure based on the probabilistic memberships of nodes in each snapshot network but also automatically determines the number of communities in each snapshot network based on automatic relevance determination. Thereafter, a gradient descent algorithm is proposed to optimize the objective function of our DBNMF model. The experimental results using both synthetic datasets and real-world temporal networks demonstrate that the DBNMF model has superior performance compared with two widely used methods, especially when the number of communities is unknown and when the network is highly sparse

Crossref

Bournemouth University Research Online

An efficient and principled method for detecting communities in networks

Author: A. Gyenge
B. W. Kernighan
Brian Ball
Brian Karrer
C. Ding
C. Ding
D. E. Knuth
D. M. Blei
E. M. Airoldi
H. Zhang
J. Parkinnen
K. Henderson
L. A. Adamic
L. Backstrom
M. E. J. Newman
M. Girolami
T. Hofmann
W. W. Zachary
Publication venue: 'American Physical Society (APS)'
Publication date: 18/04/2011
Field of study

A fundamental problem in the analysis of network data is the detection of network communities, groups of densely interconnected nodes, which may be overlapping or disjoint. Here we describe a method for finding overlapping communities based on a principled statistical approach using generative network models. We show how the method can be implemented using a fast, closed-form expectation-maximization algorithm that allows us to analyze networks of millions of nodes in reasonable running times. We test the method both on real-world networks and on synthetic benchmarks and find that it gives results competitive with previous methods. We also show that the same approach can be used to extract nonoverlapping community divisions via a relaxation method, and demonstrate that the algorithm is competitively fast and accurate for the nonoverlapping problem.Comment: 14 pages, 5 figures, 1 tabl

arXiv.org e-Print Archive

Crossref

Neighborhood Overlapped Propagation Algorithm For Community Detection Based On Label Time-Sequence

Author: Hong Yu-ling
Zhang Qishan
Publication venue: AIS Electronic Library (AISeL)
Publication date: 06/12/2015
Field of study

The community detection algorithms based on label propagation (LPA) receive broad attention for the advantages of near-linear complexity and no prerequisite for any object function or cluster number. However, the propagation of labels contains uncertainty and randomness, which affects the accuracy and stability of the LPA algorithm. In this study, we propose an efficient detection method based on COPRA with Time-sequence (COPRA_TS). Firstly, the labels are sorted according to a new label importance measure. Then, the label of each vertex is updated according to time-sequence topology measure. The experiments on both the artificial datasets and the real-world datasets demonstrate that the quality of communities discovered by COPRA_TS algorithm is improved with a better stability. At last some future research topics are given

AIS Electronic Library (AISeL)