118 research outputs found

    Mitochondrial metagenomics: letting the genes out of the bottle

    Get PDF
    ‘Mitochondrial metagenomics’ (MMG) is a methodology for shotgun sequencing of total DNA from specimen mixtures and subsequent bioinformatic extraction of mitochondrial sequences. The approach can be applied to phylogenetic analysis of taxonomically selected taxa, as an economical alternative to mitogenome sequencing from individual species, or to environmental samples of mixed specimens, such as from mass trapping of invertebrates. The routine generation of mitochondrial genome sequences has great potential both for systematics and community phylogenetics. Mapping of reads from low-coverage shotgun sequencing of environmental samples also makes it possible to obtain data on spatial and temporal turnover in whole-community phylogenetic and species composition, even in complex ecosystems where species-level taxonomy and biodiversity patterns are poorly known. In addition, read mapping can produce information on species biomass, and potentially allows quantification of within-species genetic variation. The success of MMG relies on the formation of numerous mitochondrial genome contigs, achievable with standard genome assemblers, but various challenges for the efficiency of assembly remain, particularly in the face of variable relative species abundance and intra-specific genetic variation. Nevertheless, several studies have demonstrated the power of mitogenomes from MMG for accurate phylogenetic placement, evolutionary analysis of species traits, biodiversity discovery and the establishment of species distribution patterns; it offers a promising avenue for unifying the ecological and evolutionary understanding of species diversity

    Developing MMG: A method for the study of biodiversity linking taxonomy, phylogeny and ecology

    Get PDF
    High-throughput sequencing technologies are changing the way in which diversity is studied at all scales and has the greatest potential to facilitate studies of taxa that are intractable to other methods. Insect ecology is one such field, with great abundance and diversity combining with incomplete taxonomic knowledge to hamper studies of diversity at large spatial and temporal scales. A new high-throughput method has recently been proposed to address such issues within a self-contained phylogenetic framework that is linkable with existing biological knowledge via Linnaean taxonomy. This method, ‘mitochondrial metagenomics’ (MMG), has already been the subject of a number of proof-of-concept studies, frequently focussed on Coleoptera. These studies are unified here with additional similar datasets for the first time to draw together the lessons to be learnt from the results obtained to date and infer the immediate methodological questions that remain to be answered. Particular attention is paid to the prospect of bulk sequencing of mixed specimens and the associated bioinformatics challenges. Consideration is given to mitochondrial phylogeny reconstruction with the prospect of rapidly increasing taxon sampling and the potential for phylogeny-based taxonomy assignment for otherwise uncharacterised communities. Mitochondrial metagenomics is then applied to a landscape-level assessment of the response of the leaf litter beetle communities to habitat differences, taking a combined compositional and phylogenetic perspective. Finally, the results are synthesised for a perspective on the remaining methodological impediments to the further development of MMG, and the future prospects for synthetic analyses of diversity are considered

    Intraspecific genetic variation in complex assemblages from mitochondrial metagenomics: comparison with DNA barcodes

    Get PDF
    Metagenomic shotgun sequencing, using Illumina technology, and de novo genome assembly of mixed field-collected amples of invertebrates readily produce mitochondrial genome sequences, allowing rapid identification and quantification of species diversity. However, intraspecific genetic variability present in the specimen pools is lost during mitogenome assembly, which limits the utility of ‘mitochondrial metagenomics’ for studies of population diversity. 2. Using 10 natural communities (>2600 individuals) of leaf beetles (Chrysomelidae), DNA variation in the mitochondrial cox1-5’ ‘barcode’ was compared for Sanger sequenced individuals and Illumina shotgun sequenced specimen pools. 3. Generally, only a single mitochondrial contig was assembled per species, even in the presence of intraspecific variation. Ignoring ambiguity from the use of two different assemblers, the cox1 barcode regions from these assemblies were exact nucleotide matches of a Sanger sequenced barcode in 90.7% of cases, which dropped to 76.0% in assemblies from samples with large intra and interspecific variability. Nucleotide differences between barcodes from both data types were almost exclusively in synonymous 3rd codon position, although the number of affected sites was very low, and the greatest discrepancies were correlated with poor quality of Sanger sequences. 4. Unassembled shotgun reads were also used to score single nucleotide polymorphisms and to calculate intraspecific nucleotide diversity (pi) for all available populations at each site. These values correlated with Sanger sequenced cox1 variation but were significantly higher. 5. Overall, the assemblage-focused shotgun sequencing of pooled samples produced nucleotide variation data comparable to the well-established specimen-focused Sanger approach. The findings thus extend the application of mitochondrial metagenomics of complex biodiversity samples to the estimation of diversity below the species level

    Metagenome skimming of insect specimen pools: potential for comparative genomics

    Get PDF
    Metagenomic analyses are challenging in metazoans, but high-copy number and repeat regions can be assembled from lowcoverage sequencing by “genome skimming,” which is applied here as a new way of characterizing metagenomes obtained in an ecological or taxonomic context. Illumina shotgun sequencing on two pools of Coleoptera (beetles) of approximately 200 species each were assembled into tens of thousands of scaffolds. Repeated low-coverage sequencing recovered similar scaffold sets consistently, although approximately 70% of scaffolds could not be identified against existing genome databases. Identifiable scaffolds included mitochondrial DNA, conserved sequences with hits to expressed sequence tag and protein databases, and knownrepeatelementsof high and low complexity, includingnumerous copies ofrRNAandhistone genes.Assemblies of histones captured a diversity of gene order and primary sequence in Coleoptera. Scaffolds with similarity to multiple sites in available coleopteran genome sequences for Dendroctonus and Tribolium revealed high specificity of scaffolds to either of these genomes, in particular for high-copy number repeats. Numerous “clusters” of scaffolds mapped to the same genomic site revealed intraand/or intergenomic variation within a metagenome pool. In addition to effect of taxonomic composition of the metagenomes, the number of mapped scaffolds also revealed structural differences between the two reference genomes, although the significance of this striking finding remains unclear. Finally, apparently exogenous sequences were recovered, including potential food plants, fungal pathogens, and bacterial symbionts. The “metagenome skimming” approach is useful for capturing the genomic diversity of poorly studied, species-rich lineages and opens new prospects in environmental genomic

    The mitogenome of Hydropsyche pellucidula (Hydropsychidae): first gene arrangement in the insect order Trichoptera

    Get PDF
    International audienceWe describe the mitochondrial genome of Hydropsyche pellucidula Curtis 1834, which is first described for the suborder Annulipalpia and the first in the order Trichoptera to show a non-canonical gene order. The mitogenome was obtained by de novo assembly of shotgun sequenced total genomic DNA using Illumina Miseq technology, which produced an average coverage of 115× and a minimum coverage of 48×. The mitochondrial genome includes 13 protein-coding genes, 2 rRNAs and 22 tRNAs. The genome is characterized by a rearrangement in the relative position of protein-coding and ribosomal genes. This mitogenome sequence will be useful for studying the family Hydropsychidae, which is commonly used for freshwater pollution biomonitoring

    Bulk de novo mitogenome assembly from pooled total DNA elucidates the phylogeny of weevils (Coleoptera: Curculionoidea)

    Get PDF
    Complete mitochondrial genomes have been shown to be reliable markers for phylogeny reconstruction among diverse animal groups. However, the relative difficulty and high cost associated with obtaining de novo full mitogenomes have frequently led to conspicuously low taxon sampling in ensuing studies. Here, we report the successful use of an economical and accessible method for assembling complete or near-complete mitogenomes through shot-gun next-generation sequencing of a single library made from pooled total DNA extracts of numerous target species. To avoid the use of separate indexed libraries for each specimen, and an associated increase in cost, we incorporate standard polymerase chain reaction-based “bait” sequences to identify the assembled mitogenomes. The method was applied to study the higher level phylogenetic relationships in the weevils (Coleoptera: Curculionoidea), producing 92 newly assembled mitogenomes obtained in a single Illumina MiSeq run. The analysis supported a separate origin of wood-boring behavior by the subfamilies Scolytinae, Platypodinae, and Cossoninae. This finding contradicts morphological hypotheses proposing a close relationship between the first two of these but is congruent with previous molecular studies, reinforcing the utility of mitogenomes in phylogeny reconstruction. Our methodology provides a technically simple procedure for generating densely sampled trees from whole mitogenomes and is widely applicable to groups of animals for which bait sequences are the only required prior genome knowledge

    Soup to tree: the phylogeny of beetles inferred by mitochondrial metagenomics of a Bornean rainforest sample

    Get PDF
    In spite of the growth of molecular ecology, systematics and next-generation sequencing, the discovery and analysis of diversity is not currently integrated with building the tree-of-life. Tropical arthropod ecologists are well placed to accelerate this process if all specimens obtained via masstrapping, many of which will be new species, could be incorporated routinely in phylogeny reconstruction. Here we test a shotgun sequencing approach, whereby mitochondrial genomes are assembled from complex ecological mixtures via mitochondrial metagenomics, and demonstrate how the approach overcomes many of the taxonomic impediments to the study of biodiversity. DNA from ~500 beetle specimens, originating from a single rainforest canopy fogging sample from Borneo, was pooled and shotgun sequenced, followed by de novo assembly of complete and partial mitogenomes for 175 species. The phylogenetic tree obtained from this local sample was highly similar to that from existing mitogenomes selected for global coverage of major lineages of Coleoptera. When all sequences were combined, only minor topological changes are induced against this reference set, indicating an increasingly stable estimate of coleopteran phylogeny, whilst the ecological sample expands the tip-level representation of several lineages. Robust trees generated from ecological samples now enable an evolutionary framework for ecology. Meanwhile, the inclusion of uncharacterized samples in the tree-of-life rapidly expands taxon and biogeographic representation of lineages without morphological identification. Mitogenomes from shotgun sequencing of unsorted environmental samples and their associated metadata, placed robustly into the phylogenetic tree, constitute novel DNA ‘superbarcodes’ for testing hypotheses regarding global patterns of diversity

    Intraspecific genetic variation in complex assemblages from mitochondrial metagenomics: comparison with DNA barcodes

    Get PDF
    Metagenomic shotgun sequencing, using Illumina technology, and de novo genome assembly of mixed field-collected amples of invertebrates readily produce mitochondrial genome sequences, allowing rapid identification and quantification of species diversity. However, intraspecific genetic variability present in the specimen pools is lost during mitogenome assembly, which limits the utility of ‘mitochondrial metagenomics’ for studies of population diversity. 2. Using 10 natural communities (>2600 individuals) of leaf beetles (Chrysomelidae), DNA variation in the mitochondrial cox1-5’ ‘barcode’ was compared for Sanger sequenced individuals and Illumina shotgun sequenced specimen pools. 3. Generally, only a single mitochondrial contig was assembled per species, even in the presence of intraspecific variation. Ignoring ambiguity from the use of two different assemblers, the cox1 barcode regions from these assemblies were exact nucleotide matches of a Sanger sequenced barcode in 90.7% of cases, which dropped to 76.0% in assemblies from samples with large intra and interspecific variability. Nucleotide differences between barcodes from both data types were almost exclusively in synonymous 3rd codon position, although the number of affected sites was very low, and the greatest discrepancies were correlated with poor quality of Sanger sequences. 4. Unassembled shotgun reads were also used to score single nucleotide polymorphisms and to calculate intraspecific nucleotide diversity (pi) for all available populations at each site. These values correlated with Sanger sequenced cox1 variation but were significantly higher. 5. Overall, the assemblage-focused shotgun sequencing of pooled samples produced nucleotide variation data comparable to the well-established specimen-focused Sanger approach. The findings thus extend the application of mitochondrial metagenomics of complex biodiversity samples to the estimation of diversity below the species level

    An efficient and robust laboratory workflow and tetrapod database for larger scale environmental DNA studies

    Get PDF
    BACKGROUND: The use of environmental DNA for species detection via metabarcoding is growing rapidly. We present a co-designed lab workflow and bioinformatic pipeline to mitigate the 2 most important risks of environmental DNA use: sample contamination and taxonomic misassignment. These risks arise from the need for polymerase chain reaction (PCR) amplification to detect the trace amounts of DNA combined with the necessity of using short target regions due to DNA degradation. FINDINGS: Our high-throughput workflow minimizes these risks via a 4-step strategy: (i) technical replication with 2 PCR replicates and 2 extraction replicates; (ii) using multi-markers (12S,16S,CytB); (iii) a "twin-tagging," 2-step PCR protocol; and (iv) use of the probabilistic taxonomic assignment method PROTAX, which can account for incomplete reference databases. Because annotation errors in the reference sequences can result in taxonomic misassignment, we supply a protocol for curating sequence datasets. For some taxonomic groups and some markers, curation resulted in >50% of sequences being deleted from public reference databases, owing to (i) limited overlap between our target amplicon and reference sequences, (ii) mislabelling of reference sequences, and (iii) redundancy. Finally, we provide a bioinformatic pipeline to process amplicons and conduct PROTAX assignment and tested it on an invertebrate-derived DNA dataset from 1,532 leeches from Sabah, Malaysia. Twin-tagging allowed us to detect and exclude sequences with non-matching tags. The smallest DNA fragment (16S) amplified most frequently for all samples but was less powerful for discriminating at species rank. Using a stringent and lax acceptance criterion we found 162 (stringent) and 190 (lax) vertebrate detections of 95 (stringent) and 109 (lax) leech samples. CONCLUSIONS: Our metabarcoding workflow should help research groups increase the robustness of their results and therefore facilitate wider use of environmental and invertebrate-derived DNA, which is turning into a valuable source of ecological and conservation information on tetrapods
    • 

    corecore