199 research outputs found

    Robust estimation of microbial diversity in theory and in practice

    Get PDF
    Quantifying diversity is of central importance for the study of structure, function and evolution of microbial communities. The estimation of microbial diversity has received renewed attention with the advent of large-scale metagenomic studies. Here, we consider what the diversity observed in a sample tells us about the diversity of the community being sampled. First, we argue that one cannot reliably estimate the absolute and relative number of microbial species present in a community without making unsupported assumptions about species abundance distributions. The reason for this is that sample data do not contain information about the number of rare species in the tail of species abundance distributions. We illustrate the difficulty in comparing species richness estimates by applying Chao's estimator of species richness to a set of in silico communities: they are ranked incorrectly in the presence of large numbers of rare species. Next, we extend our analysis to a general family of diversity metrics ("Hill diversities"), and construct lower and upper estimates of diversity values consistent with the sample data. The theory generalizes Chao's estimator, which we retrieve as the lower estimate of species richness. We show that Shannon and Simpson diversity can be robustly estimated for the in silico communities. We analyze nine metagenomic data sets from a wide range of environments, and show that our findings are relevant for empirically-sampled communities. Hence, we recommend the use of Shannon and Simpson diversity rather than species richness in efforts to quantify and compare microbial diversity.Comment: To be published in The ISME Journal. Main text: 16 pages, 5 figures. Supplement: 16 pages, 4 figure

    Evidence from GC-TRFLP that Bacterial Communities in Soil Are Lognormally Distributed

    Get PDF
    The Species Abundance Distribution (SAD) is a fundamental property of ecological communities and the form and formation of SADs have been examined for a wide range of communities including those of microorganisms. Progress in understanding microbial SADs, however, has been limited by the remarkable diversity and vast size of microbial communities. As a result, few microbial systems have been sampled with sufficient depth to generate reliable estimates of the community SAD. We have used a novel approach to characterize the SAD of bacterial communities by coupling genomic DNA fractionation with analysis of terminal restriction fragment length polymorphisms (GC-TRFLP). Examination of a soil microbial community through GC-TRFLP revealed 731 bacterial operational taxonomic units (OTUs) that followed a lognormal distribution. To recover the same 731 OTUs through analysis of DNA sequence data is estimated to require analysis of 86,264 16S rRNA sequences. The approach is examined and validated through construction and analysis of simulated microbial communities in silico. Additional simulations performed to assess the potential effects of PCR bias show that biased amplification can cause a community whose distribution follows a power-law function to appear lognormally distributed. We also show that TRFLP analysis, in contrast to GC-TRFLP, is not able to effectively distinguish between competing SAD models. Our analysis supports use of the lognormal as the null distribution for studying the SAD of bacterial communities as for plant and animal communities

    Recurrent Die-Offs of Adult Coho Salmon Returning to Spawn in Puget Sound Lowland Urban Streams

    Get PDF
    Several Seattle-area streams in Puget Sound were the focus of habitat restoration projects in the 1990s. Post-project effectiveness monitoring surveys revealed anomalous behaviors among adult coho salmon returning to spawn in restored reaches. These included erratic surface swimming, gaping, fin splaying, and loss of orientation and equilibrium. Affected fish died within hours, and female carcasses generally showed high rates (>90%) of egg retention. Beginning in the fall of 2002, systematic spawner surveys were conducted to 1) assess the severity of the adult die-offs, 2) compare spawner mortality in urban vs. non-urban streams, and 3) identify water quality and spawner condition factors that might be associated with the recurrent fish kills. The forensic investigation focused on conventional water quality parameters (e.g., dissolved oxygen, temperature, ammonia), fish condition, pathogen exposure and disease status, and exposures to metals, polycyclic aromatic hydrocarbons, and current use pesticides. Daily surveys of a representative urban stream (Longfellow Creek) from 2002–2009 revealed premature spawner mortality rates that ranged from 60–100% of each fall run. The comparable rate in a non-urban stream was <1% (Fortson Creek, surveyed in 2002). Conventional water quality, pesticide exposure, disease, and spawner condition showed no relationship to the syndrome. Coho salmon did show evidence of exposure to metals and petroleum hydrocarbons, both of which commonly originate from motor vehicles in urban landscapes. The weight of evidence suggests that freshwater-transitional coho are particularly vulnerable to an as-yet unidentified toxic contaminant (or contaminant mixture) in urban runoff. Stormwater may therefore place important constraints on efforts to conserve and recover coho populations in urban and urbanizing watersheds throughout the western United States

    Measurement of the inclusive and dijet cross-sections of b-jets in pp collisions at sqrt(s) = 7 TeV with the ATLAS detector

    Get PDF
    The inclusive and dijet production cross-sections have been measured for jets containing b-hadrons (b-jets) in proton-proton collisions at a centre-of-mass energy of sqrt(s) = 7 TeV, using the ATLAS detector at the LHC. The measurements use data corresponding to an integrated luminosity of 34 pb^-1. The b-jets are identified using either a lifetime-based method, where secondary decay vertices of b-hadrons in jets are reconstructed using information from the tracking detectors, or a muon-based method where the presence of a muon is used to identify semileptonic decays of b-hadrons inside jets. The inclusive b-jet cross-section is measured as a function of transverse momentum in the range 20 < pT < 400 GeV and rapidity in the range |y| < 2.1. The bbbar-dijet cross-section is measured as a function of the dijet invariant mass in the range 110 < m_jj < 760 GeV, the azimuthal angle difference between the two jets and the angular variable chi in two dijet mass regions. The results are compared with next-to-leading-order QCD predictions. Good agreement is observed between the measured cross-sections and the predictions obtained using POWHEG + Pythia. MC@NLO + Herwig shows good agreement with the measured bbbar-dijet cross-section. However, it does not reproduce the measured inclusive cross-section well, particularly for central b-jets with large transverse momenta.Comment: 10 pages plus author list (21 pages total), 8 figures, 1 table, final version published in European Physical Journal

    Measurement of the cross-section for b-jets produced in association with a Z boson at root s=7 TeV with the ATLAS detector ATLAS Collaboration

    Get PDF
    A measurement is presented of the inclusive cross-section for b-jet production in association with a Z boson in pp collisions at a centre-of-mass energy of root s = 7 TeV. The analysis uses the data sample collected by the ATLAS experiment in 2010, corresponding to an integrated luminosity of approximately 36 pb(-1). The event selection requires a Z boson decaying into high P-T electrons or muons, and at least one b-jet, identified by its displaced vertex, with transverse momentum p(T) > 25 GeV and rapidity vertical bar y vertical bar < 2.1. After subtraction of background processes, the yield is extracted from the vertex mass distribution of the candidate b-jets. The ratio of this cross-section to the inclusive Z cross-section (the average number of b-jets per Z event) is also measured. Both results are found to be in good agreement with perturbative QCD predictions at next-to-leading order

    ICAR: endoscopic skull‐base surgery

    Get PDF
    n/

    Pan-cancer analysis of whole genomes

    Get PDF
    Cancer is driven by genetic change, and the advent of massively parallel sequencing has enabled systematic documentation of this variation at the whole-genome scale(1-3). Here we report the integrative analysis of 2,658 whole-cancer genomes and their matching normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, facilitated by international data sharing using compute clouds. On average, cancer genomes contained 4-5 driver mutations when combining coding and non-coding genomic elements; however, in around 5% of cases no drivers were identified, suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which many clustered structural variants arise in a single catastrophic event, is frequently an early event in tumour evolution; in acral melanoma, for example, these events precede most somatic point mutations and affect several cancer-associated genes simultaneously. Cancers with abnormal telomere maintenance often originate from tissues with low replicative activity and show several mechanisms of preventing telomere attrition to critical levels. Common and rare germline variants affect patterns of somatic mutation, including point mutations, structural variants and somatic retrotransposition. A collection of papers from the PCAWG Consortium describes non-coding mutations that drive cancer beyond those in the TERT promoter(4); identifies new signatures of mutational processes that cause base substitutions, small insertions and deletions and structural variation(5,6); analyses timings and patterns of tumour evolution(7); describes the diverse transcriptional consequences of somatic mutation on splicing, expression levels, fusion genes and promoter activity(8,9); and evaluates a range of more-specialized features of cancer genomes(8,10-18).Peer reviewe
    corecore