82 research outputs found

    Median population estimates (SD) for the double sampling method.

    No full text
    <p>Values are for low (LC) and high clustered (HC) populations with different proportions intensively surveyed and detection probabilities. Proportion surveyed with the rapid method was 50%. Values are based on 1,000 simulated surveys.</p

    Population estimates for varying levels of population clustering.

    No full text
    <p>Population size was estimated using the double sampling and double observer methods for 1,000 simulations each for the low, moderate, and high clustered populations (Panels A, B, and C, respectively) with the survey proportion at 50%. White indicates the double observer method; gray indicates the double sampling method. Horizontal lines show true population sizes. The double observer method yielded more variable population estimates than the double sampling method for detection probabilities below 50% regardless of population clustering.</p

    Changes in coverage probabilities of 95% confidence intervals for population estimates.

    No full text
    <p>The figures show the changes in coverage probability relative to detection probability for the low (LC) and high clustered (HC) populations at survey proportions of 25% and 75% for the double observer and double sampling methods (Panels A and B, respectively). The dashed line indicates 95% level. Coverage probability for the double sampling method remains close to the nominal level for all detection probabilities and populations but declines to less than 50% for the double observer methods at high detection probabilities.</p

    Characteristics of simulated populations.

    No full text
    <p>500 survey units were populated by drawing random variables from 1 of 3 gamma distributions with different shape and scale parameters selected to yield 3 levels of clustering and a mean of 1.5 birds per survey unit.</p

    Population estimates for varying survey proportions.

    No full text
    <p>Population estimates were obtained based on the double observer and double sampling methods for 1,000 simulations each for the low clustered population with survey proportions of 25% and 75% (Panels A and B, respectively) and for the high clustered population with survey proportions of 25% and 75% (Panels C and D, respectively). White indicates the double observer method; gray indicates the double sampling method. Horizontal lines show true population sizes. The double observer method generally produced biased estimates when the detection probability was below 30%.</p

    Median population estimates (SD) for the double observer method.

    No full text
    a<p>p<sub>1</sub> and p<sub>2</sub> are the detection probabilities for observers 1 and 2, respectively.</p><p>The overall detection probability was near 64 or 84% with equal and or unequal detection probabilities for the two observers. Results are provided for the low (LC) and high (HC) clustered populations. Proportion surveyed was 50%. Values are based on 1,000 simulated surveys.</p

    Failing to account for tree structure results in an elevated false positive rate.

    No full text
    <p>Continuous phenotypes and binary genotypes were simulated across the trees for the four phyla under consideration. A-D show results for the null of no true phenotype-genotype correlation. A-B) Histogram of <i>p</i>-values for simulated phenotypes and genotypes on the Bacteroidetes tree, using (A) phylogenetic or (B) standard linear models. The phylogenetic model distribution was similar to a uniform distribution, while the standard model was very anticonservative, having an excess of small <i>p</i>-values. C-D) False positive rates (Type I error rates) at <i>p</i> = 0.05 for the C) phylogenetic and D) standard models, across varying levels of true phylogenetic signal (Ives-Garland <i>α</i>). E) Traits with varying levels of “true” association spanning values we observed in real data were simulated, and power (y-axis) was computed using phylogenetic linear models.</p

    Comparison of results from the overall prevalence and body-site specific models for Firmicutes.

    No full text
    <p>FDR-corrected significance (as −log<sub>10</sub>(<i>q</i>)) of the overall model is plotted on the horizontal axis, whereas the same quantity for the body-site-specific model is plotted on the vertical axis. All FIGfams significant (<i>q</i> ≤ 0.05) in at least one of the two models are plotted as contour lines: FIGfams significant in the overall prevalence model (and possibly also the gut specific model) are plotted in orange, while FIGfams significant in the gut specific model (and possibly also the overall prevalence model) are plotted in blue. Selected SEED subsystems are displayed as colored points (legend), and selected individual genes are plotted as black points.</p

    Phylogeny-corrected identification of microbial gene families relevant to human gut colonization

    No full text
    <div><p>The mechanisms by which different microbes colonize the healthy human gut versus other body sites, the gut in disease states, or other environments remain largely unknown. Identifying microbial genes influencing fitness in the gut could lead to new ways to engineer probiotics or disrupt pathogenesis. We approach this problem by measuring the statistical association between a species having a gene and the probability that the species is present in the gut microbiome. The challenge is that closely related species tend to be jointly present or absent in the microbiome and also share many genes, only a subset of which are involved in gut adaptation. We show that this phylogenetic correlation indeed leads to many false discoveries and propose phylogenetic linear regression as a powerful solution. To apply this method across the bacterial tree of life, where most species have not been experimentally phenotyped, we use metagenomes from hundreds of people to quantify each species’ prevalence in and specificity for the gut microbiome. This analysis reveals thousands of genes potentially involved in adaptation to the gut across species, including many novel candidates as well as processes known to contribute to fitness of gut bacteria, such as acid tolerance in Bacteroidetes and sporulation in Firmicutes. We also find microbial genes associated with a preference for the gut over other body sites, which are significantly enriched for genes linked to fitness in an <i>in vivo</i> competition experiment. Finally, we identify gene families associated with higher prevalence in patients with Crohn’s disease, including Proteobacterial genes involved in conjugation and fimbria regulation, processes previously linked to inflammation. These gene targets may represent new avenues for modulating host colonization and disease. Our strategy of combining metagenomics with phylogenetic modeling is general and can be used to identify genes associated with adaptation to any environment.</p></div

    Examples of hits from standard linear (blue highlights) and phylogenetic (orange highlights) models.

    No full text
    <p>In each panel, the tree on the left is colored by species prevalence (black to orange), while the tree on the right is colored by gene presence-absence (blue to black). Selected species are displayed in the middle; lines link species with the leaves to which they refer. The color of the line matches the color of the leaf. A-B) The standard model recovered hits that matched large clades but without recapitulating fine structure. C-D) The phylogenetic model recovered associations for which more of the fine structure was mirrored between the left-hand and right-hand trees, as exemplified by the species labeled in the middle. E) Violin plots of Ives-Garland <i>α</i>, a summary of the rate of gain and loss of a binary trait across a tree, for genes significantly associated with prevalence in the standard (left, blue) and phylogenetic (right, orange) linear models. Horizontal lines mark the median of the distributions. The phylogenetic (orange) and standard linear (blue) models were significantly different for each phylum (Wilcox test for Bacteroidetes: 8.2 × 10<sup>−41</sup>; Firmicutes: 7.6 × 10<sup>−279</sup>; Proteobacteria: 1.8 × 10<sup>−235</sup>; Actinobacteria: 9.0 × 10<sup>−133</sup>).</p
    corecore