31 research outputs found
Finishing the euchromatic sequence of the human genome
The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead
Retrospective evaluation of whole exome and genome mutation calls in 746 cancer samples
Funder: NCI U24CA211006Abstract: The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC) curated consensus somatic mutation calls using whole exome sequencing (WES) and whole genome sequencing (WGS), respectively. Here, as part of the ICGC/TCGA Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium, which aggregated whole genome sequencing data from 2,658 cancers across 38 tumour types, we compare WES and WGS side-by-side from 746 TCGA samples, finding that ~80% of mutations overlap in covered exonic regions. We estimate that low variant allele fraction (VAF < 15%) and clonal heterogeneity contribute up to 68% of private WGS mutations and 71% of private WES mutations. We observe that ~30% of private WGS mutations trace to mutations identified by a single variant caller in WES consensus efforts. WGS captures both ~50% more variation in exonic regions and un-observed mutations in loci with variable GC-content. Together, our analysis highlights technological divergences between two reproducible somatic variant detection efforts
Infant gut strain persistence is associated with maternal origin, phylogeny, and traits including surface adhesion and iron acquisition.
dRepGenomes_Lou2021.tar.gz
Dereplicated genomes for "Infant gut strain persistence is associated with maternal origin, phylogeny, and functional potential including surface adhesion and iron acquisition
Recommended from our members
Using strain-resolved analysis to identify contamination in metagenomics data
BackgroundMetagenomics analyses can be negatively impacted by DNA contamination. While external sources of contamination such as DNA extraction kits have been widely reported and investigated, contamination originating within the study itself remains underreported.ResultsHere, we applied high-resolution strain-resolved analyses to identify contamination in two large-scale clinical metagenomics datasets. By mapping strain sharing to DNA extraction plates, we identified well-to-well contamination in both negative controls and biological samples in one dataset. Such contamination is more likely to occur among samples that are on the same or adjacent columns or rows of the extraction plate than samples that are far apart. Our strain-resolved workflow also reveals the presence of externally derived contamination, primarily in the other dataset. Overall, in both datasets, contamination is more significant in samples with lower biomass.ConclusionOur work demonstrates that genome-resolved strain tracking, with its essentially genome-wide nucleotide-level resolution, can be used to detect contamination in sequencing-based microbiome studies. Our results underscore the value of strain-specific methods to detect contamination and the critical importance of looking for contamination beyond negative and positive controls. Video Abstract
Infant gut strain persistence is associated with maternal origin, phylogeny, and traits including surface adhesion and iron acquisition.
Gut microbiome succession affects infant development. However, it remains unclear what factors promote persistence of initial bacterial colonizers in the developing gut. Here, we perform strain-resolved analyses to compare gut colonization of preterm and full-term infants throughout the first year of life and evaluate associations between strain persistence and strain origin as well as genetic potential. Analysis of fecal metagenomes collected from 13 full-term and 9 preterm infants reveals that infants' initially distinct microbiomes converge by age 1 year. Approximately 11% of early colonizers, primarily Bacteroides and Bifidobacterium, persist during the first year of life, and those are more prevalent in full-term, compared with preterm infants. Examination of 17 mother-infant pairs reveals maternal gut strains are significantly more likely to persist in the infant gut than other strains. Enrichment in genes for surface adhesion, iron acquisition, and carbohydrate degradation may explain persistence of some strains through the first year of life
Infant microbiome cultivation and metagenomic analysis reveal Bifidobacterium 2’-fucosyllactose utilization can be facilitated by coexisting species
The early-life gut microbiome development has long-term health impacts and can be influenced by factors such as infant diet. Human milk oligosaccharides (HMOs), an essential component of breast milk that can only be metabolized by some beneficial gut microorganisms, ensure proper gut microbiome establishment and infant development. However, how HMOs are metabolized by gut microbiomes is not fully elucidated. Isolate studies have revealed the genetic basis for HMO metabolism, but they exclude the possibility of HMO assimilation via synergistic interactions involving multiple organisms. Here, we investigate microbiome responses to 2’-fucosyllactose (2’FL), a prevalent HMO and a common infant formula additive, by establishing individualized microbiomes using fecal samples from three infants as the inocula. Bifidobacterium breve, a prominent member of infant microbiomes, typically cannot metabolize 2’FL. Using metagenomic data, we predict that extracellular fucosidases encoded by co-existing members such as Ruminococcus gnavus initiate 2’FL breakdown, thus critical for B. breve’s growth. Using both targeted co-cultures and by supplementation of R. gnavus into one microbiome, we show that R. gnavus can promote extensive growth of B. breve through the release of lactose from 2’FL. Overall, microbiome cultivation combined with genome-resolved metagenomics demonstrates that HMO utilization can vary with an individual’s microbiome.ISSN:2041-172
Recommended from our members
Widespread stop-codon recoding in bacteriophages may regulate translation of lytic genes
Bacteriophages (phages) are obligate parasites that use host bacterial translation machinery to produce viral proteins. However, some phages have alternative genetic codes with reassigned stop codons that are predicted to be incompatible with bacterial translation systems. We analysed 9,422 phage genomes and found that stop-codon recoding has evolved in diverse clades of phages that infect bacteria present in both human and animal gut microbiota. Recoded stop codons are particularly over-represented in phage structural and lysis genes. We propose that recoded stop codons might function to prevent premature production of late-stage proteins. Stop-codon recoding has evolved several times in closely related lineages, which suggests that adaptive recoding can occur over very short evolutionary timescales
Recommended from our members
Genetic and behavioral adaptation of Candida parapsilosis to the microbiome of hospitalized infants revealed by in situ genomics, transcriptomics, and proteomics.
BackgroundCandida parapsilosis is a common cause of invasive candidiasis, especially in newborn infants, and infections have been increasing over the past two decades. C. parapsilosis has been primarily studied in pure culture, leaving gaps in understanding of its function in a microbiome context.ResultsHere, we compare five unique C. parapsilosis genomes assembled from premature infant fecal samples, three of which are newly reconstructed, and analyze their genome structure, population diversity, and in situ activity relative to reference strains in pure culture. All five genomes contain hotspots of single nucleotide variants, some of which are shared by strains from multiple hospitals. A subset of environmental and hospital-derived genomes share variants within these hotspots suggesting derivation of that region from a common ancestor. Four of the newly reconstructed C. parapsilosis genomes have 4 to 16 copies of the gene RTA3, which encodes a lipid translocase and is implicated in antifungal resistance, potentially indicating adaptation to hospital antifungal use. Time course metatranscriptomics and metaproteomics on fecal samples from a premature infant with a C. parapsilosis blood infection revealed highly variable in situ expression patterns that are distinct from those of similar strains in pure cultures. For example, biofilm formation genes were relatively less expressed in situ, whereas genes linked to oxygen utilization were more highly expressed, indicative of growth in a relatively aerobic environment. In gut microbiome samples, C. parapsilosis co-existed with Enterococcus faecalis that shifted in relative abundance over time, accompanied by changes in bacterial and fungal gene expression and proteome composition.ConclusionsThe results reveal potentially medically relevant differences in Candida function in gut vs. laboratory environments, and constrain evolutionary processes that could contribute to hospital strain persistence and transfer into premature infant microbiomes. Video abstract