Search CORE

2,623 research outputs found

The impact of sequence database choice on metaproteomic results in gut microbiota studies

Author: Addis Maria Filippa
Deligios Massimo
Fraumene Cristina
Manghina Valeria
Martens Lennart
Muth Thilo
Pagnozzi Daniela
Palomba Antonio
Rapp Erdmann
Tanca Alessandro
Uzzau Sergio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: Elucidating the role of gut microbiota in physiological and pathological processes has recently emerged as a key research aim in life sciences. In this respect, metaproteomics, the study of the whole protein complement of a microbial community, can provide a unique contribution by revealing which functions are actually being expressed by specific microbial taxa. However, its wide application to gut microbiota research has been hindered by challenges in data analysis, especially related to the choice of the proper sequence databases for protein identification. Results: Here, we present a systematic investigation of variables concerning database construction and annotation and evaluate their impact on human and mouse gut metaproteomic results. We found that both publicly available and experimental metagenomic databases lead to the identification of unique peptide assortments, suggesting parallel database searches as a mean to gain more complete information. In particular, the contribution of experimental metagenomic databases was revealed to be mandatory when dealing with mouse samples. Moreover, the use of a "merged" database, containing all metagenomic sequences from the population under study, was found to be generally preferable over the use of sample-matched databases. We also observed that taxonomic and functional results are strongly database-dependent, in particular when analyzing the mouse gut microbiota. As a striking example, the Firmicutes/Bacteroidetes ratio varied up to tenfold depending on the database used. Finally, assembling reads into longer contigs provided significant advantages in terms of functional annotation yields. Conclusions: This study contributes to identify host- and database-specific biases which need to be taken into account in a metaproteomic experiment, providing meaningful insights on how to design gut microbiota studies and to perform metaproteomic data analysis. In particular, the use of multiple databases and annotation tools has to be encouraged, even though this requires appropriate bioinformatic resources

AIR Universita degli studi di Milano

Ghent University Academic Bibliography

PubMed Central

MPG.PuRe

Analysis of a data matrix and a graph: Metagenomic data and the phylogenetic tree

Author: Purdom Elizabeth
Publication venue: 'Institute of Mathematical Statistics'
Publication date: 01/01/2011
Field of study

In biological experiments researchers often have information in the form of a graph that supplements observed numerical data. Incorporating the knowledge contained in these graphs into an analysis of the numerical data is an important and nontrivial task. We look at the example of metagenomic data---data from a genomic survey of the abundance of different species of bacteria in a sample. Here, the graph of interest is a phylogenetic tree depicting the interspecies relationships among the bacteria species. We illustrate that analysis of the data in a nonstandard inner-product space effectively uses this additional graphical information and produces more meaningful results.Comment: Published in at http://dx.doi.org/10.1214/10-AOAS402 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

arXiv.org e-Print Archive

CiteSeerX

Multiple Comparative Metagenomics using Multiset k-mer Counting

Author: Benoit Gaëtan
Drezen Erwan
Lavenier Dominique
Lemaitre Claire
Mariadassou Mahendra
Peterlongo Pierre
Schbath Sophie
Publication venue
Publication date: 28/04/2016
Field of study

Background. Large scale metagenomic projects aim to extract biodiversity knowledge between different environmental conditions. Current methods for comparing microbial communities face important limitations. Those based on taxonomical or functional assignation rely on a small subset of the sequences that can be associated to known organisms. On the other hand, de novo methods, that compare the whole sets of sequences, either do not scale up on ambitious metagenomic projects or do not provide precise and exhaustive results. Methods. These limitations motivated the development of a new de novo metagenomic comparative method, called Simka. This method computes a large collection of standard ecological distances by replacing species counts by k-mer counts. Simka scales-up today's metagenomic projects thanks to a new parallel k-mer counting strategy on multiple datasets. Results. Experiments on public Human Microbiome Project datasets demonstrate that Simka captures the essential underlying biological structure. Simka was able to compute in a few hours both qualitative and quantitative ecological distances on hundreds of metagenomic samples (690 samples, 32 billions of reads). We also demonstrate that analyzing metagenomes at the k-mer level is highly correlated with extremely precise de novo comparison techniques which rely on all-versus-all sequences alignment strategy or which are based on taxonomic profiling

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

Directory of Open Access Journals

Recommended from our members

The Computational Diet: A Review of Computational Methods Across Diet, Microbiome, and Health.

Author: Eetemadi Ameen
Kim Minseung
Pereira Beatriz Merchel Piovesan
Rai Navneet
Schmitz Harold
Tagkopoulos Ilias
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Food and human health are inextricably linked. As such, revolutionary impacts on health have been derived from advances in the production and distribution of food relating to food safety and fortification with micronutrients. During the past two decades, it has become apparent that the human microbiome has the potential to modulate health, including in ways that may be related to diet and the composition of specific foods. Despite the excitement and potential surrounding this area, the complexity of the gut microbiome, the chemical composition of food, and their interplay in situ remains a daunting task to fully understand. However, recent advances in high-throughput sequencing, metabolomics profiling, compositional analysis of food, and the emergence of electronic health records provide new sources of data that can contribute to addressing this challenge. Computational science will play an essential role in this effort as it will provide the foundation to integrate these data layers and derive insights capable of revealing and understanding the complex interactions between diet, gut microbiome, and health. Here, we review the current knowledge on diet-health-gut microbiota, relevant data sources, bioinformatics tools, machine learning capabilities, as well as the intellectual property and legislative regulatory landscape. We provide guidance on employing machine learning and data analytics, identify gaps in current methods, and describe new scenarios to be unlocked in the next few years in the context of current knowledge

eScholarship - University of California

Entropy-scaling search of massive biological data

Author: Berger Bonnie
Daniels Noah M.
Danko David Christian
Yu Y. William
Publication venue: 'Elsevier BV'
Publication date: 01/06/2015
Field of study

Many datasets exhibit a well-defined structure that can be exploited to design faster search tools, but it is not always clear when such acceleration is possible. Here, we introduce a framework for similarity search based on characterizing a dataset's entropy and fractal dimension. We prove that searching scales in time with metric entropy (number of covering hyperspheres), if the fractal dimension of the dataset is low, and scales in space with the sum of metric entropy and information-theoretic entropy (randomness of the data). Using these ideas, we present accelerated versions of standard tools, with no loss in specificity and little loss in sensitivity, for use in three domains---high-throughput drug screening (Ammolite, 150x speedup), metagenomics (MICA, 3.5x speedup of DIAMOND [3,700x BLASTX]), and protein structure search (esFragBag, 10x speedup of FragBag). Our framework can be used to achieve "compressive omics," and the general theory can be readily applied to data science problems outside of biology.Comment: Including supplement: 41 pages, 6 figures, 4 tables, 1 bo

arXiv.org e-Print Archive

Elsevier - Publisher Connector

DSpace@MIT

PubMed Central

Doctor of Philosophy

Author: Flygare Steven
Publication venue: University of Utah
Publication date: 01/01/2015
Field of study

dissertationAdvances in technology have produced efficient and powerful scientific instruments for measuring biological phenomena. In particular, modern microscopes and nextgeneration sequencing machines produce data at such a rate that manual analysis is no longer practical or feasible for meaningful scientific inquiries. Thus, there is a great need for computational strategies to organize and analyze huge amounts of data produced by biological experiments. My work presents computational strategies and software solutions for application in image analysis, human variant prioritization, and metagenomics. The information content of images can be leveraged to answer an extremely broad spectrum of questions ranging from inquiries about basic biological processes to highly specific, application-driven inquiries like the efficacy of a pharmaceutical drug. Modern microscopes can produce images at a rate at which rigorous manual analysis is impossible. I have created software pipelines that automate image analysis in two specific applications domains. In addition, I discuss general image analysis strategies that can be applied to a wide variety of problems. There are tens of millions of known human genetic variants. Prioritizing human variants based on how likely they are to cause disease is of huge importance because of the potential impact on human health. Current variant prioritization methods are limited by their scope, efficiency, and accuracy. I present a variant prioritization method, the VAAST variant prioritizer, which is superior in its scope, efficiency, and accuracy to existing variant prioritization methods. The rise of next-generation sequencing enables huge quantities of sequence to be generated in a short period of time. No field of study has been affected by rapid sequencing more than metagenomics. Metagenomics, the genomic analysis of a population v of microorganisms, has important implications for pathogen detection because metagenomics enables the culture-free detection of microorganisms. I have created Taxonomer, a comprehensive metagenomics pipeline that enables the real-time analysis of read datasets derived from environmental samples

The University of Utah: J. Willard Marriott Digital Library

Size Doesn't Matter: Towards a More Inclusive Philosophy of Biology

Author: A. Brune
A.E. Douglas
A.G. O’Donnell
A.G.B. Simpson
A.J. Underwood
A.S. Griffin
A.T. Bull
A.T. Bull
B. Costerton
B. Dixon
B. Magasanik
B.B. Ward
B.F. Brehm-Stecher
B.J. Crespi
B.J. Finlay
B.J. Finlay
B.L. Bassler
B.R. Levin
C. Schmeisser
C.A. Suttle
C.J. Bult
C.J. Goodnight
C.M. Fraser
C.M. Thomas
C.N. Keim
C.R. Woese
C.R. Woese
C.R. Woese
C.R. Woese
C.S. Lewis
C.S. Riesenfeld
D. Bryant
D. Gevers
D. Kaiser
D. Lloyd
D. Medini
D. Nanney
D. Raoult
D.A. Relman
D.A. Stahl
D.A. Stahl
D.A. Walsh
D.C. Queller
D.C. Reanney
D.C. Savage
D.E. Caldwell
D.E. Caldwell
D.E. Dykhuizen
D.E. Koshland Jr.
D.G. Davies
D.H. Huson
D.J. Griffiths
D.J. Webre
D.K. Newman
D.L. Hull
D.L. Hull
D.M. Faguy
D.M. Ward
D.P. Genereux
D.S. Wilson
D.W. Cutler
D.W. McShea
E. Ben-Jacob
E. Mayr
E. Sober
E. Stackebrandt
E. Zuckerkandl
E.A. Lloyd
E.A. Lloyd
E.F. DeLong
E.F. DeLong
E.F. DeLong
E.F. DeLong
E.G. Nisbet
E.J. Feil
E.K. Shiner
E.V. Koonin
F. Bushman
F. Bäckhed
F. Rodríguez-Valera
F. Rodríguez-Valera
F. Rohwer
F.M. Cohan
G. Beadle
G. Drews
G. Myers
G. O’Toole
G.A. Biagini
G.E. Fox
G.H. Wadhams
G.J. Olsen
G.J. Olsen
G.J. Velicer
G.K. Schoolnik
G.M. Dunny
G.W. Tyson
H. Daims
H. Engelberg
H. Ochman
H.N. Schulz
H.W. Jannasch
Hugenholtz
J. Adler
J. Casadesús
J. Dupré
J. Gans
J. Handelsman
J. Handelsman
J. Lederberg
J. Maynard Smith
J. Maynard Smith
J. Maynard Smith
J. Sapp
J. Sapp
J. Sapp
J. Wimpenny
J. Xu
J.-C. Cho
J.-U. Kreft
J.A. Shapiro
J.A. Shapiro
J.B.H. Martiny
J.C. Ameison
J.C. Venter
J.E. Wertz
J.F. Kasting
J.G. Lawrence
J.G. Lawrence
J.G. Lawrence
J.H. Andrews
J.H. Brown
J.H. Slater
J.J. Falke
J.M. Henke
J.M. Solomon
J.M. Young
J.O. Andersson
J.O. Corliss
J.P. Amend
J.P. Collins
J.P. Gogarten
J.P. Gogarten
J.R. Brown
J.R. Leadbetter
J.R. Postgate
J.S. Robert
J.S. Webb
J.S. Wilkins
J.T. Bonner
J.T. Staley
J.T. Staley
J.T. Staley
J.W. Costerton
John Dupré
K. Lee
K. Lewis
K. Sterelny
K. Sterelny
K. Sterelny
K.C. Rice
K.K. Jefferson
K.L. Manchester
K.L. Visick
K.M. Gray
K.T. Konstantinidis
L. Aravind
L. Bromham
L. Dijkshoorn
L. Kroos
L. Margulis
L. Pauling
L.J. Ehlers
L.J. Shimkets
L.J. Shimkets
L.M. Iyer
L.P. Villarreal
L.P. Villarreal
L.R. Croal
L.V. Hooper
L.V. Hooper
L.V. Hooper
L.W. Buss
M. Breitbart
M. Breitbart
M. Dworkin
M. Dworkin
M. Dworkin
M. Hausner
M. Loreau
M. Oksanen
M. Penn
M. Travisano
M. Wainwright
M.A. O’Malley
M.B. Miller
M.D. Baker
M.E. Davey
M.G. Weinbauer
M.J. Carlile
M.J. Federle
M.J. McFall-Ngai
M.J. McFall-Ngai
M.L. Diaz-Torres
M.R. Buckley
M.R. Parsek
M.S. Lee
Maureen A. O’Malley
N. Ward
N.R. Pace
O. Béjà
O.T. Avery
P. Kämpfer
P. Stoodley
P. Vandamme
P. Watnick
P.B. Price
P.D. Schloss
P.E. Kolenbrander
P.G. Falkowski
P.J.M. Haastert van
P.R. Ehrlich
P.S. Stewart
R. Daniel
R. Lan
R. Lan
R. Roselló-Mora
R.A. Kerr
R.C. Looijen
R.D. Berg
R.D. Fleischmann
R.E. Kohler Jr.
R.E. Michod
R.E. Michod
R.G. Beiko
R.I. Amann
R.J. Redfield
R.J. Whitaker
R.L. Charlebois
R.M. Atlas
R.M. Figge
R.M. Maier
R.N. Brandon
R.R. Colwell
R.T. Papke
R.Y. Stanier
R.Y. Stanier
R.Y. Stanier
S. Conway Morris
S. Conway-Morris
S. Molin
S. Nee
S. Nee
S. Okasha
S. Park
S. Pääbo
S. Sarkar
S. Sonea
S.B. Carroll
S.C. Doney
S.D. Bell
S.E. Luria
S.E. Luria
S.J. Gould
S.J. Joseph
S.M. Adl
S.N. Peterson
S.P. Brown
T. Allers
T. Coenye
T. Fenchel
T. Kaeberlein
T. Palys
T.D. Brock
T.D. Brock
T.D. Brock
T.G. Whitham
T.R. Gregory
T.W. Grebe
V.T. Parker
W. Martin
W.B. Whitman
W.C. Summers
W.F. Doolittle
W.F. Doolittle
W.F. Doolittle
W.F. Doolittle
Y. Boucher
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/10/2013
Field of study

notes: As the primary author, O’Malley drafted the paper, and gathered and analysed data (scientific papers and talks). Conceptual analysis was conducted by both authors.publication-status: Publishedtypes: ArticlePhilosophers of biology, along with everyone else, generally perceive life to fall into two broad categories, the microbes and macrobes, and then pay most of their attention to the latter. ‘Macrobe’ is the word we propose for larger life forms, and we use it as part of an argument for microbial equality. We suggest that taking more notice of microbes – the dominant life form on the planet, both now and throughout evolutionary history – will transform some of the philosophy of biology’s standard ideas on ontology, evolution, taxonomy and biodiversity. We set out a number of recent developments in microbiology – including biofilm formation, chemotaxis, quorum sensing and gene transfer – that highlight microbial capacities for cooperation and communication and break down conventional thinking that microbes are solely or primarily single-celled organisms. These insights also bring new perspectives to the levels of selection debate, as well as to discussions of the evolution and nature of multicellularity, and to neo-Darwinian understandings of evolutionary mechanisms. We show how these revisions lead to further complications for microbial classification and the philosophies of systematics and biodiversity. Incorporating microbial insights into the philosophy of biology will challenge many of its assumptions, but also give greater scope and depth to its investigations

Crossref

Open Research Exeter