Search CORE

22 research outputs found

Efficient computation of Faith's phylogenetic diversity with applications in characterizing microbiomes

Author: Armstrong George
Beck Kristen L.
Cantrell Kalen
Carrieri Anna Paola
Gonzalez Antonio
Haiminen Niina
Hakim Daniel
Havulinna Aki S.
Huang Shi
Inouye Michael
Jain Mohit
Kim Ho-Cheol
Knight Rob
Lahti Leo
McDonald Daniel
McGrath Imran
Meric Guillaume
Niiranen Teemu
Parida Laxmi
Salomaa Veikko
Swafford Austin D.
Vazquez-Baeza Yoshiki
Zhu Qiyun
Publication venue
Publication date: 01/11/2021
Field of study

The number of publicly available microbiome samples is continually growing. As data set size increases, bottlenecks arise in standard analytical pipelines. Faith's phylogenetic diversity (Faith's PD) is a highly utilized phylogenetic alpha diversity metric that has thus far failed to effectively scale to trees with millions of vertices. Stacked Faith's phylogenetic diversity (SFPhD) enables calculation of this widely adopted diversity metric at a much larger scale by implementing a computationally efficient algorithm. The algorithm reduces the amount of computational resources required, resulting in more accessible software with a reduced carbon footprint, as compared to previous approaches. The new algorithm produces identical results to the previous method. We further demonstrate that the phylogenetic aspect of Faith's PD provides increased power in detecting diversity differences between younger and older populations in the FINRISK study's metagenomic data.Peer reviewe

PubMed Central

eScholarship - University of California

Helsingin yliopiston digitaalinen arkisto

Multi-level analysis of the gut-brain axis shows autism spectrum disorder-associated molecular and microbial profiles

Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by heterogeneous cognitive, behavioral and communication impairments. Disruption of the gut-brain axis (GBA) has been implicated in ASD although with limited reproducibility across studies. In this study, we developed a Bayesian differential ranking algorithm to identify ASD-associated molecular and taxa profiles across 10 cross-sectional microbiome datasets and 15 other datasets, including dietary patterns, metabolomics, cytokine profiles and human brain gene expression profiles. We found a functional architecture along the GBA that correlates with heterogeneity of ASD phenotypes, and it is characterized by ASD-associated amino acid, carbohydrate and lipid profiles predominantly encoded by microbial species in the genera Prevotella, Bifidobacterium, Desulfovibrio and Bacteroides and correlates with brain gene expression changes, restrictive dietary patterns and pro-inflammatory cytokine profiles. The functional architecture revealed in age-matched and sex-matched cohorts is not present in sibling-matched cohorts. We also show a strong association between temporal changes in microbiome composition and ASD phenotypes. In summary, we propose a framework to leverage multi-omic datasets from well-defined cohorts and investigate how the GBA influences ASD

IUPUIScholarWorks

eScholarship - University of California

Archivio istituzionale della ricerca - Università di Cagliari

Hochschulbibliothekszentrum des Landes Nordrhein-Westfalen (hbz)

Recommended from our members

Effects of Variation in Urine Sample Storage Conditions on 16S Urogenital Microbiome Analyses

Author: Brubaker Linda
Bryant MacKenzie
Cantrell Kalen
Farmer Sawyer
Knight Rob
Kumar Tanya
Lukacz Emily S
McDonald Daniel
Song Jin
Tubb Helena M
Publication venue: eScholarship, University of California
Publication date: 23/02/2023
Field of study

Replicability is a well-established challenge in microbiome research with a variety of contributing factors at all stages, from sample collection to code execution. Here, we focus on voided urine sample storage conditions for urogenital microbiome analysis. Using urine samples collected from 10 adult females, we investigated the microbiome preservation efficacy of AssayAssure Genelock (Genelock), compared with no preservative, under different temperature conditions. We varied temperature over 48 h in order to examine the impact of conditions samples may experience with home voided urine collection and shipping to a central biorepository. The following common lab and shipping conditions were investigated: -20°C, ambient temperature, 4°C, freeze-thaw cycle, and heat cycle. At 48 h, all samples were stored at -80°C until processing. After generating 16S rRNA gene amplicon sequencing data using the highly sensitive KatharoSeq protocol, we observed individual variation in both alpha and beta diversity metrics below interhuman differences, corroborating reports of individual microbiome variability in other specimen types. While there was no significant difference in beta diversity when comparing Genelock versus no preservative, we did observe a higher concordance with Genelock samples shipped at colder temperatures (-20°C and 4°C) when compared with the samples shipped at -20°C without preservative. Our results indicate that Genelock does not introduce a significant amount of microbial bias when used on a range of temperatures and is most effective at colder temperatures. IMPORTANCE The urogenital microbiome is an understudied yet important human microbiome niche. Research has been stimulated by the relatively recent discovery that urine is not sterile; urinary tract microbes have been linked to health problems, including urinary infections, incontinence, and cancer. The quality of life and economic impact of UTIs and urgency incontinence alone are enormous, with

3.5 billion and

82.6 billion, respectively, spent in the United States. annually. Given the low biomass of urine, novelty of the field, and limited reproducibility evidence, it is critical to study urine sample storage conditions to optimize scientific rigor. Efficient and reliable preservation methods inform methods for home self-sample collection and shipping, increasing the potential use in larger-scale studies. Here, we examined both buffer and temperature variation effects on 16S rRNA gene amplicon sequencing results from urogenital samples, providing data on the consequences of common storage methods on urogenital microbiome results

eScholarship - University of California

Recommended from our members

Effects of variation in sample storage conditions and swab order on 16S vaginal microbiome analyses.

Author: Brubaker Linda
Bryant MacKenzie
Cantrell Kalen
Farmer Sawyer
Knight Robin
Kumar Tanya
Lewis Amanda
Lukacz Emily
McDonald Daniel
Song Se
Tubb Helena
Publication venue: eScholarship, University of California
Publication date: 11/01/2024
Field of study

The composition of the human vaginal microbiome has been linked to a variety of medical conditions including yeast infection, bacterial vaginosis, and sexually transmitted infection. The vaginal microbiome is becoming increasingly acknowledged as a key factor in personal health, and it is essential to establish methods to collect and process accurate samples with self-collection techniques to allow large, population-based studies. In this study, we investigate if using AssayAssure Genelock, a nucleic acid preservative, introduces microbial biases in self-collected vaginal samples. To our knowledge, we also contribute some of the first evidence regarding the impacts of multiple swabs taken at one time point. Vaginal samples have relatively low biomass, so the ability to collect multiple swabs from a unique participant at a single time would greatly improve the replicability and data available for future studies. This will hopefully lay the groundwork to gain a more complete and accurate understanding of the vaginal microbiome

eScholarship - University of California

Representing Diet in a Tree-Based Format for Interactive and Exploratory Assessment of Dietary Intake Data

Author: Cantrell Kalen
Cotillard Aurelie
Derrien Muriel
Johnson Abigail
Knight Rob
Lejzerowicz Franck
Litwin Nicole
Mcdonald Daniel
Nowinski Brent
Song Se Jin
Tap Julien
Veiga Patrick
Publication venue: American Society for Nutrition
Publication date: 14/06/2022
Field of study

International audienceAbstract Objectives We assessed the utility of representing dietary intake data in hierarchical tree structures that consider relationships among foods. Methods Dietary intake was collected from 1909 adults (≥18 years) using a food frequency questionnaire (FFQ; VioScreen) from the American Gut Project. FFQ food items were formatted into hierarchical tree structures based on 1) USDA's Food Nutrient and Database for Dietary Studies (FNDDS) classifications, 2) nutrient content, and 3) molecular compound information detected via mass spectrometry to capture the non-nutrient composition of foods. Next, we compared how well representing dissimilarities (or distances) between individuals based on their diet corresponded with indices such as the Healthy Eating Index (HEI-2015), when those distances are calculated using tree-based versus non-tree-based metrics. We performed an Adonis test (PERMANOVA) to measure the amount of variation explained (R2) in these diet-based distances by HEI-2015. Results We observed that dietary ordinations generated using tree-based relationships between foods have better agreement with HEI than ordinations generated without considering relatedness between foods. The variation explained by HEI-2015 increased by 35% when using the FNDDS tree compared to using a non-tree based quantitative metric (Bray-Curtis (not tree-based) R2 = 0.02931 vs. Weighted UniFrac (tree-based) R2 = 0.03969), by >20% when using the nutrient tree (vs. Weighted UniFrac R2 = 0.03627), and only marginally (6%) when using the molecular compound tree (vs. Weighted UniFrac R2 = 0.03116). Conclusions We show that tree-based measurements of dietary similarity lead to better agreement with diet indices (e.g., HEI) than when relationships among foods are not considered. We also show that representing dietary intake in a tree-like structure can offer interactive visualizations of data that can be used to inform hypotheses regarding dietary characteristics. Funding Sources Danone Nutricia Research

PubMed Central

HAL Descartes

Hal-Diderot

Compositionally Aware Phylogenetic Beta-Diversity Measures Better Resolve Microbiomes Associated with Phenotype.

Author: Allaband Celeste
Armstrong George
Cantrell Kalen
Dilmore Amanda Hazel
Knight Rob
Martino Cameron
McDonald Daniel
Rahman Gibraan
Shaffer Justin P
Shenhav Liat
Song Se Jin
Vázquez-Baeza Yoshiki
Publication venue: eScholarship, University of California
Publication date: 28/04/2022
Field of study

Microbiome data have several specific characteristics (sparsity and compositionality) that introduce challenges in data analysis. The integration of prior information regarding the data structure, such as phylogenetic structure and repeated-measure study designs, into analysis, is an effective approach for revealing robust patterns in microbiome data. Past methods have addressed some but not all of these challenges and features: for example, robust principal-component analysis (RPCA) addresses sparsity and compositionality; compositional tensor factorization (CTF) addresses sparsity, compositionality, and repeated measure study designs; and UniFrac incorporates phylogenetic information. Here we introduce a strategy of incorporating phylogenetic information into RPCA and CTF. The resulting methods, phylo-RPCA, and phylo-CTF, provide substantial improvements over state-of-the-art methods in terms of discriminatory power of underlying clustering ranging from the mode of delivery to adult human lifestyle. We demonstrate quantitatively that the addition of phylogenetic information improves effect size and classification accuracy in both data-driven simulated data and real microbiome data. IMPORTANCE Microbiome data analysis can be difficult because of particular data features, some unavoidable and some due to technical limitations of DNA sequencing instruments. The first step in many analyses that ultimately reveals patterns of similarities and differences among sets of samples (e.g., separating samples from sick and healthy people or samples from seawater versus soil) is calculating the difference between each pair of samples. We introduce two new methods to calculate these differences that combine features of past methods, specifically being able to take into account the principles that most types of microbes are not in most samples (sparsity), that abundances are relative rather than absolute (compositionality), and that all microbes have a shared evolutionary history (phylogeny). We show using simulated and real data that our new methods provide improved classification accuracy of ordinal sample clusters and increased effect size between sample groups on beta-diversity distances

PubMed Central

eScholarship - University of California

Recommended from our members

Greengenes2 unifies microbial data in a single reference tree

Author: Albertsen Mads
Balaban Metin
Bartko Andrew
Cantrell Kalen
Cheng Susan
DeSantis Todd
Gonzalez Antonio
Havulinna Aki S
Hugenholtz Philip
Inouye Michael
Jain Mohit
Jiang Yueyu
Jousilahti Pekka
Karst Søren M
Knight Rob
Lahti Leo
McDonald Daniel
Mirarab Siavash
Morton James T
Nicolaou Giorgia
Niiranen Teemu
Parks Donovan H
Salomaa Veikko
Song Se Jin
Zhu Qiyun
Publication venue: eScholarship, University of California
Publication date: 27/07/2023
Field of study

Studies using 16S rRNA and shotgun metagenomics typically yield different results, usually attributed to PCR amplification biases. We introduce Greengenes2, a reference tree that unifies genomic and 16S rRNA databases in a consistent, integrated resource. By inserting sequences into a whole-genome phylogeny, we show that 16S rRNA and shotgun metagenomic data generated from the same samples agree in principal coordinates space, taxonomy and phenotype effect size when analyzed with the same tree

eScholarship - University of California

VBN

Recommended from our members

Efficient computation of Faith's phylogenetic diversity with applications in characterizing microbiomes.

Author: Armstrong George
Beck Kristen L
Cantrell Kalen
Carrieri Anna Paola
Gonzalez Antonio
Haiminen Niina
Hakim Daniel
Havulinna Aki S
Huang Shi
Inouye Michael
Jain Mohit
Kim Ho-Cheol
Knight Rob
Lahti Leo
McDonald Daniel
McGrath Imran
Méric Guillaume
Niiranen Teemu
Parida Laxmi
Salomaa Veikko
Swafford Austin D
Vázquez-Baeza Yoshiki
Zhu Qiyun
Publication venue: eScholarship, University of California
Publication date: 01/11/2021
Field of study

eScholarship - University of California

Recommended from our members

Greengenes2 unifies microbial data in a single reference tree.

Author: Albertsen Mads
Balaban Metin
Bartko Andrew
Cantrell Kalen
Cheng Susan
DeSantis Todd
Gonzalez Antonio
Havulinna Aki S
Hugenholtz Philip
Inouye Michael
Jain Mohit
Jiang Yueyu
Jousilahti Pekka
Karst Søren M
Knight Rob
Lahti Leo
McDonald Daniel
Mirarab Siavash
Morton James T
Nicolaou Giorgia
Niiranen Teemu
Parks Donovan H
Salomaa Veikko
Song Se Jin
Zhu Qiyun
Publication venue: Nat Biotechnol
Publication date: 17/05/2024
Field of study

Funder: Emerald Foundation 3022Funder: Intramural research program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD)Studies using 16S rRNA and shotgun metagenomics typically yield different results, usually attributed to PCR amplification biases. We introduce Greengenes2, a reference tree that unifies genomic and 16S rRNA databases in a consistent, integrated resource. By inserting sequences into a whole-genome phylogeny, we show that 16S rRNA and shotgun metagenomic data generated from the same samples agree in principal coordinates space, taxonomy and phenotype effect size when analyzed with the same tree

Apollo (Cambridge)

Recommended from our members

Author Correction: Greengenes2 unifies microbial data in a single reference tree.

Author: Albertsen Mads
Balaban Metin
Bartko Andrew
Cantrell Kalen
Cheng Susan
DeSantis Todd
Gonzalez Antonio
Havulinna Aki S
Hugenholtz Philip
Inouye Michael
Jain Mohit
Jiang Yueyu
Jousilahti Pekka
Karst Søren M
Knight Rob
Lahti Leo
McDonald Daniel
Mirarab Siavash
Morton James T
Nicolaou Giorgia
Niiranen Teemu
Parks Donovan H
Salomaa Veikko
Song Se Jin
Zhu Qiyun
Publication venue: Nat Biotechnol
Publication date: 17/05/2024
Field of study

Funder: Emerald Foundation 3022Funder: Intramural research program of the Eunice Kennedy Shriver National Institute of Child Health and Human Development (NICHD

Apollo (Cambridge)