76 research outputs found

    Quantifying uncertainty of taxonomic placement in DNA barcoding and metabarcoding

    Get PDF
    A crucial step in the use of DNA markers for biodiversity surveys is the assignment of Linnaean taxonomies (species, genus, etc.) to sequence reads. This allows the use of all the information known based on the taxonomic names. Taxonomic placement of DNA barcoding sequences is inherently probabilistic because DNA sequences contain errors, because there is natural variation among sequences within a species, and because reference data bases are incomplete and can have false annotations. However, most existing bioinformatics methods for taxonomic placement either exclude uncertainty, or quantify it using metrics other than probability. In this paper we evaluate the performance of the recently proposed probabilistic taxonomic placement method PROTAX by applying it to both annotated reference sequence data as well as to unknown environmental data. Our four case studies include contrasting taxonomic groups (fungi, bacteria, mammals and insects), variation in the length and quality of the barcoding sequences (from individually Sanger-sequenced sequences to short Illumina reads), variation in the structures and sizes of the taxonomies (800–130 000 species) and variation in the completeness of the reference data bases (representing 15–100% of known species). Our results demonstrate that PROTAX yields essentially unbiased probabilities of taxonomic placement, which means its quantification of species identification uncertainty is reliable. As expected, the accuracy of taxonomic placement increases with increasing coverage of taxonomic and reference sequence data bases, and with increasing ratio of genetic variation among taxonomic levels over within taxonomic levels. We conclude that reliable species-level identification from environmental samples is still challenging and that neglecting identification uncertainty can lead to spurious inference. A key aim for future research is the completion of taxonomic and reference sequence data bases and making these two types of data compatible

    Extracting abundance information from DNA‐based data

    Get PDF
    The accurate extraction of species-abundance information from DNA-based data (metabarcoding, metagenomics) could contribute usefully to diet analysis and foodweb reconstruction, the inference of species interactions, the modelling of population dynamics and species distributions, the biomonitoring of environmental state and change, and the inference of false positives and negatives. However, multiple sources of bias and noise in sampling and processing combine to inject error into DNA-based data sets. To understand how to extract abundance information, it is useful to distinguish two concepts. (i) Within-sample across-species quantification describes relative species abundances in one sample. (ii) Across-sample within-species quantification describes how the abundance of each individual species varies from sample to sample, such as over a time series, an environmental gradient or different experimental treatments. First, we review the literature on methods to recover across-species abundance information (by removing what we call “species pipeline biases”) and within-species abundance information (by removing what we call “pipeline noise”). We argue that many ecological questions can be answered with just within-species quantification, and we therefore demonstrate how to use a “DNA spike-in” to correct for pipeline noise and recover within-species abundance information. We also introduce a modelbased estimator that can be used on data sets without a physical spike-in to approximate and correct for pipeline noise

    Accounting for species interactions is necessary for predicting how arctic arthropod communities respond to climate change

    Get PDF
    Species interactions are known to structure ecological communities. Still, the influence of climate change on biodiversity has primarily been evaluated by correlating individual species distributions with local climatic descriptors, then extrapolating into future climate scenarios. We ask whether predictions on arctic arthropod response to climate change can be improved by accounting for species interactions. For this, we use a 14-year-long, weekly time series from Greenland, resolved to the species level by mitogenome mapping. During the study period, temperature increased by 2 degrees C and arthropod species richness halved. We show that with abiotic variables alone, we are essentially unable to predict species responses, but with species interactions included, the predictive power of the models improves considerably. Cascading trophic effects thereby emerge as important in structuring biodiversity response to climate change. Given the need to scale up from species-level to community-level projections of biodiversity change, these results represent a major step forward for predictive ecology.Peer reviewe

    SPIKEPIPE: A metagenomic pipeline for the accurate quantification of eukaryotic species occurrences and intraspecific abundance change using DNA barcodes or mitogenomes

    Get PDF
    The accurate quantification of eukaryotic species abundances from bulk samples remains a key challenge for community ecology and environmental biomonitoring. We resolve this challenge by combining shotgun sequencing, mapping to reference DNA barcodes or to mitogenomes, and three correction factors: (a) a percent-coverage threshold to filter out false positives, (b) an internal-standard DNA spike-in to correct for stochasticity during sequencing, and (c) technical replicates to correct for stochasticity across sequencing runs. The SPIKEPIPE pipeline achieves a strikingly high accuracy of intraspecific abundance estimates (in terms of DNA mass) from samples of known composition (mapping to barcodes R2 = .93, mitogenomes R2 = .95) and a high repeatability across environmental-sample replicates (barcodes R2 = .94, mitogenomes R2 = .93). As proof of concept, we sequence arthropod samples from the High Arctic, systematically collected over 17 years, detecting changes in species richness, species-specific abundances, and phenology. SPIKEPIPE provides cost-efficient and reliable quantification of eukaryotic communities

    Selective-logging and oil palm: multitaxon impacts, biodiversity indicators, and trade-offs for conservation planning

    Get PDF
    Strong global demand for tropical timber and agricultural products has driven large-scale logging and subsequent conversion of tropical forests. Given that the majority of tropical landscapes have been or will likely be logged, the protection of biodiversity within tropical forests thus depends on whether species can persist in these economically exploited lands, and if species cannot persist, whether we can protect enough primary forest from logging and conversion. However, our knowledge of the impact of logging and conversion on biodiversity is limited to a few taxa, often sampled in different locations with complex land-use histories, hampering attempts to plan cost-effective conservation strategies and to draw conclusions across taxa. Spanning a land-use gradient of primary forest, once- and twice-logged forests, and oil palm plantations, we used traditional sampling and DNA metabarcoding to compile an extensive data set in Sabah, Malaysian Borneo for nine vertebrate and invertebrate taxa to quantify the biological impacts of logging and oil palm, develop cost-effective methods of protecting biodiversity, and examine whether there is congruence in response among taxa. Logged forests retained high species richness, including, on average, 70% of species found in primary forest. In contrast, conversion to oil palm dramatically reduces species richness, with significantly fewer primary-forest species than found on logged forest transects for seven taxa. Using a systematic conservation planning analysis, we show that efficient protection of primary-forest species is achieved with land portfolios that include a large proportion of logged-forest plots. Protecting logged forests is thus a cost-effective method of protecting an ecologically and taxonomically diverse range of species, particularly when conservation budgets are limited. Six indicator groups (birds, leaf-litter ants, beetles, aerial hymenopterans, flies, and true bugs) proved to be consistently good predictors of the response of the other taxa to logging and oil palm. Our results confidently establish the high conservation value of logged forests and the low value of oil palm. Cross-taxon congruence in responses to disturbance also suggests that the practice of focusing on key indicator taxa yields important information of general biodiversity in studies of logging and oil palm

    Measuring protected-area effectiveness using vertebrate distributions from leech iDNA

    Get PDF
    Protected areas are key to meeting biodiversity conservation goals, but direct measures of effectiveness have proven difficult to obtain. We address this challenge by using environmental DNA from leech-ingested bloodmeals to estimate spatially-resolved vertebrate occupancies across the 677 km 2 Ailaoshan reserve in Yunnan, China. From 30,468 leeches collected by 163 park rangers across 172 patrol areas, we identify 86 vertebrate species, including amphibians, mammals, birds and squamates. Multi-species occupancy modelling shows that species richness increases with elevation and distance to reserve edge. Most large mammals (e.g. sambar, black bear, serow, tufted deer) follow this pattern; the exceptions are the three domestic mammal species (cows, sheep, goats) and muntjak deer, which are more common at lower elevations. Vertebrate occupancies are a direct measure of conservation outcomes that can help guide protected-area management and improve the contributions that protected areas make towards global biodiversity goals. Here, we show the feasibility of using invertebrate-derived DNA to estimate spatially-resolved vertebrate occupancies across entire protected areas

    Reliable, verifiable and efficient monitoring of biodiversity via metabarcoding

    Get PDF
    To manage and conserve biodiversity, one must know what is being lost, where, and why, as well as which remedies are likely to be most effective. Metabarcoding technology can characterise the species compositions of mass samples of eukaryotes or of environmental DNA. Here, we validate metabarcoding by testing it against three high‐quality standard data sets that were collected in Malaysia (tropical), China (subtropical) and the United Kingdom (temperate) and that comprised 55,813 arthropod and bird specimens identified to species level with the expenditure of 2,505 person‐hours of taxonomic expertise. The metabarcode and standard data sets exhibit statistically correlated alpha‐ and beta‐diversities, and the two data sets produce similar policy conclusions for two conservation applications: restoration ecology and systematic conservation planning. Compared with standard biodiversity data sets, metabarcoded samples are taxonomically more comprehensive, many times quicker to produce, less reliant on taxonomic expertise and auditable by third parties, which is essential for dispute resolution.We thank Yang Yahan, Alice Wang, Vincent Moulton, David Warton and Wadud Miah for support and advice and to Ding Zhaoli for sequencing. LA, YT, AN and RK were supported by the Queensland‐Chinese Academy of Sciences (QCAS) Biotechnology Fund (GJHZ1130) and Griffith University. DPE was supported by a STEP fellowship at Princeton University. SP was supported by the Natural Environment Research Council, Forestry Commission, Norfolk Biodiversity Information Service and Suffolk Biodiversity Partnership. Additional support for DPE, PW, FAE, THL and WHH was provided by a grant from the High Meadows Foundation to DSW. YQJ, XYW and DWY were supported by Yunnan Province (20080A001), the Chinese Academy of Sciences (0902281081, KSCX2‐YW‐Z‐1027), the National Natural Science Foundation of China (31170498), the Ministry of Science and Technology of China (2012FY110800), the University of East Anglia, and the State Key Laboratory of Genetic Resources and Evolution at the Kunming Institute of Zoology
    • …
    corecore