43 research outputs found

    One species in eight : DNA barcodes from type specimens resolve a taxonomic quagmire

    Get PDF
    Each holotype specimen provides the only objective link to a particular Linnean binomen. Sequence information from them is increasingly valuable due to the growing usage of DNA barcodes in taxonomy. As type specimens are often old, it may only be possible to recover fragmentary sequence information from them. We tested the efficacy of short sequences from type specimens in the resolution of a challenging taxonomic puzzle: the Elachista dispunctella complex which includes 64 described species with minuscule morphological differences. We applied a multistep procedure to resolve the taxonomy of this species complex. First, we sequenced a large number of newly collected specimens and as many holotypes as possible. Second, we used all >400 bp examine species boundaries. We employed three unsupervised methods (BIN, ABGD, GMYC) with specified criteria on how to handle discordant results and examined diagnostic bases from each delineated putative species (operational taxonomic units, OTUs). Third, we evaluated the morphological characters of each OTU. Finally, we associated short barcodes from types with the delineated OTUs. In this step, we employed various supervised methods, including distance-based, tree-based and character-based. We recovered 658 bp barcode sequences from 194 of 215 fresh specimens and recovered an average of 141bp from 33 of 42 holotypes. We observed strong congruence among all methods and good correspondence with morphology. We demonstrate potential pitfalls with tree-, distance- and character-based approaches when associating sequences of varied length. Our results suggest that sequences as short as 56bp can often provide valuable taxonomic information. The results support significant taxonomic oversplitting of species in the Elachista dispunctella complex.Peer reviewe

    A reference library for Canadian invertebrates with 1.5 million barcodes, voucher specimens, and DNA samples

    Get PDF
    The synthesis of this dataset was enabled by funding from the Canada Foundation for Innovation, from Genome Canada through Ontario Genomics, from NSERC, and from the Ontario Ministry of Research, Innovation and Science in support of the International Barcode of Life project. It was also enabled by philanthropic support from the Gordon and Betty Moore Foundation and from Ann McCain Evans and Chris Evans. The release of the data on GGBN was supported by a GGBN – Global Genome Initiative Award and we thank G. Droege, L. Loo, K. Barker, and J. Coddington for their support. Our work depended heavily on the analytical capabilities of the Barcode of Life Data Systems (BOLD, www.boldsystems.org). We also thank colleagues at the CBG for their support, including S. Adamowicz, S. Bateson, E. Berzitis, V. Breton, V. Campbell, A. Castillo, C. Christopoulos, J. Cossey, C. Gallant, J. Gleason, R. Gwiazdowski, M. Hajibabaei, R. Hanner, K. Hough, P. Janetta, A. Pawlowski, S. Pedersen, J. Robertson, D. Roes, K. Seidle, M. A. Smith, B. St. Jacques, A. Stoneham, J. Stahlhut, R. Tabone, J.Topan, S. Walker, and C. Wei. For bioblitz-related assistance, we are grateful to D. Ireland, D. Metsger, A. Guidotti, J. Quinn and other members of Bioblitz Canada and Ontario Bioblitz. For our work in Canada’s national parks, we thank S. Woodley and J. Waithaka for their lead role in organizing permits and for the many Parks Canada staff who facilitated specimen collections, including M. Allen, D. Amirault-Langlais, J. Bastick, C. Belanger, C. Bergman, J.-F. Bisaillon, S. Boyle, J. Bridgland, S. Butland, L. Cabrera, R. Chapman, J. Chisholm, B. Chruszcz, D. Crossland, H. Dempsey, N. Denommee, T. Dobbie, C. Drake, J. Feltham, A. Forshner, K. Forster, S. Frey, L. Gardiner, P. Giroux, T. Golumbia, D. Guedo, N. Guujaaw, S. Hairsine, E. Hansen, C. Harpur, S. Hayes, J. Hofman, S. Irwin, B. Johnston, V. Kafa, N. Kang, P. Langan, P. Lawn, M. Mahy, D. Masse, D. Mazerolle, C. McCarthy, I. McDonald, J. McIntosh, C. McKillop, V. Minelga, C. Ouimet, S. Parker, N. Perry, J. Piccin, A. Promaine, P. Roy, M. Savoie, D. Sigouin, P. Sinkins, R. Sissons, C. Smith, R. Smith, H. Stewart, G. Sundbo, D. Tate, R. Tompson, E. Tremblay, Y. Troutet, K. Tulk, J. Van Wieren, C. Vance, G. Walker, D. Whitaker, C. White, R. Wissink, C. Wong, and Y. Zharikov. For our work near Canada’s ports in Vancouver, Toronto, Montreal, and Halifax, we thank R. Worcester, A. Chreston, M. Larrivee, and T. Zemlak, respectively. Many other organizations improved coverage in the reference library by providing access to specimens – they included the Canadian National Collection of Insects, Arachnids and Nematodes, Smithsonian Institution’s National Museum of Natural History, the Canadian Museum of Nature, the University of Guelph Insect Collection, the Royal British Columbia Museum, the Royal Ontario Museum, the Pacifc Forestry Centre, the Northern Forestry Centre, the Lyman Entomological Museum, the Churchill Northern Studies Centre, and rare Charitable Research Reserve. We also thank the many taxonomic specialists who identifed specimens, including A. Borkent, B. Brown, M. Buck, C. Carr, T. Ekrem, J. Fernandez Triana, C. Guppy, K. Heller, J. Huber, L. Jacobus, J. Kjaerandsen, J. Klimaszewski, D. Lafontaine, J-F. Landry, G. Martin, A. Nicolai, D. Porco, H. Proctor, D. Quicke, J. Savage, B. C. Schmidt, M. Sharkey, A. Smith, E. Stur, A. Tomas, J. Webb, N. Woodley, and X. Zhou. We also thank K. Kerr and T. Mason for facilitating collections at Toronto Zoo and D. Iles for servicing the trap at Wapusk National Park. This paper contributes to the University of Guelph’s Food from Thought research program supported by the Canada First Research Excellence Fund. The Barcode of Life Data System (BOLD; www.boldsystems.org)8 was used as the primary workbench for creating, storing, analyzing, and validating the specimen and sequence records and the associated data resources48. The BOLD platform has a private, password-protected workbench for the steps from specimen data entry to data validation (see details in Data Records), and a public data portal for the release of data in various formats. The latter is accessible through an API (http://www.boldsystems.org/index.php/resources/api?type=webservices) that can also be controlled through R75 with the package ‘bold’76.Peer reviewedPublisher PD

    Next generation sequencing-aided comprehensive geographic coverage sheds light on the status of rare and extinct populations of Aporia butterflies (Lepidoptera: Pieridae)

    No full text
    Abstract The Black-veined White Aporia crataegi (Linnaeus, 1758), a common and widespread butterfly ranging from northwestern Africa to Europe and Asia, has been extinct in Britain since the 1920s and is on a steady decline in several other parts of its range. In order to investigate genetic diversity within A. crataegi and its correspondence with current subspecies-level taxonomy, we barcoded 173 specimens from across its range including, for the first time, extinct populations from Britain and Korea. Using next generation sequencing we also obtained a sequence for Aporia joubini, a peculiar taxon from China known only by its type specimen collected in the early twentieth century. Our phylogenetic analysis placed A. joubini sister to A. oberthuri, although further taxon sampling may reveal a different scheme. Within A. crataegi, we observed a shallow and weak mitogenomic structure with only a few distinct lineages in North Africa, Sicily, Iran, and Japan. Eurasian populations, including those extinct in Britain and Korea, clustered into a large set of closely allied lineages, consistent with a recent expansion during the Late Pleistocene glacial period. This study highlights the importance of museum collections and the unique opportunities they provide in documenting species diversity and helping conservation efforts

    Data from: DNA barcodes from century-old type specimens using next generation sequencing

    No full text
    Type specimens have high scientific importance because they provide the only certain connection between the application of a Linnean name and a physical specimen. Many other individuals may have been identified as a particular species, but their linkage to the taxon concept is inferential. Because type specimens are often more than a century old and have experienced conditions unfavorable for DNA preservation, success in sequence recovery has been uncertain. The present study addresses this challenge by employing next generation sequencing (NGS) to recover sequences for the barcode region of the cytochrome c oxidase 1 gene from small amounts of template DNA. DNA quality was first screened in more than 1800 century-old type specimens of Lepidoptera by attempting to recover 164bp and 94bp reads via Sanger sequencing. This analysis permitted the assignment of each specimen to one of three DNA quality categories – high (164bp sequence), medium (94bp sequence), or low (no sequence). Ten specimens from each category were subsequently analyzed via a PCR-based NGS protocol requiring very little template DNA. It recovered sequence information from all specimens with average read lengths ranging from 458bp to 610bp for the three DNA categories. By sequencing ten specimens in each NGS run, costs were similar to Sanger analysis. Future increases in the number of specimens processed in each run promise substantial reductions in cost, making it possible to anticipate a future where barcode sequences are available from most type specimens

    Sequence alignments and tree files

    No full text
    The file contains sequence alignments of sequences generated using Sanger sequencing and next generation sequencing. The NJ tree files associated with Figures 3 and 4 are also included in this file

    DNA barcoding as an aid for species identification in Austral black flies (Insecta: Diptera: Simuliidae)

    No full text
    ABSTRACT. In this paper, the utility of a partial sequence of the COI gene, the DNA barcoding region, for the identification of species of black flies in the Austral Region was assessed. Twenty eight morphospecies were analysed, 8 in the genus Austrosimulium (4 species in the subgenus Austrosimulium s.str., 3 in the subgenus Novaustrosimulium, and 1 species unassigned to subgenus), 2 of the genus Cnesia, 8 of Gigantodax, 3 of Paracnephia, 1 Paraustrosimulium and 4 species to Simulium, subgenera (Morops, Nevermannia, and Pternaspatha). The Neighbour Joining tree derived from the DNA barcodes sequences grouped most specimens according to species or species groups recognized by morphotaxonomic studies. Intraspecific sequence divergences within morphologically distinct species ranged from 0% to 1.8%, while higher divergences (2% - 4.2%) in certain species suggested the presence of cryptic diversity. The existence of well-defined groups within S. simile revealed the likely inclusion of cryptic diversity. DNA barcodes also showed that specimens of species identified as C. dissimilis, C. nr. pussilla and C. ornata might be conspecific, suggesting possible synonymy. DNA barcoding combined with a sound morphotaxonomic framework would provide an effective approach for the identification of black flies in the region.The accepted manuscript in pdf format is listed with the files at the bottom of this page. The presentation of the authors' names and (or) special characters in the title of the manuscript may differ slightly between what is listed on this page and what is listed in the pdf file of the accepted manuscript; that in the pdf file of the accepted manuscript is what was submitted by the author

    DNA barcoding as an aid for species identification in austral black flies (Insecta: Diptera: Simuliidae)

    No full text
    In this paper, the utility of a partial sequence of the COI gene, the DNA barcoding region, for the identification of species of black flies in the austral region was assessed. Twenty-eight morphospecies were analyzed: eight of the genus Austrosimulium (four species in the subgenus Austrosimulium s. str., three species in the subgenus Novaustrosimulium, and one species unassigned to subgenus), two of the genus Cnesia, eight of Gigantodax, three of Paracnephia, one of Paraustrosimulium, and six of Simulium (subgenera Morops, Nevermannia, and Pternaspatha). The neighbour-joining tree derived from the DNA barcode sequences grouped most specimens according to species or species groups recognized by morphotaxonomic studies. Intraspecific sequence divergences within morphologically distinct species ranged from 0% to 1.8%, while higher divergences (2%-4.2%) in certain species suggested the presence of cryptic diversity. The existence of well-defined groups within S. simile revealed the likely inclusion of cryptic diversity. DNA barcodes also showed that specimens identified as C. dissimilis, C. nr. pussilla, and C. ornata might be conspecific, suggesting possible synonymy. DNA barcoding combined with a sound morphotaxonomic framework would provide an effective approach for the identification of black flies in the region.Fil: Hernández Triana, Luis M.. Animal and Plant Health Agency; Reino UnidoFil: Montes de Oca, Fernanda. Consejo Nacional de Investigaciones Científicas y Técnicas; Argentina. Administración de Parques Nacionales. Parque Nacional "Nahuel Huapi"; ArgentinaFil: Prosser, Sean W. J.. University of Guelph; CanadáFil: Hebert, Paul David Neil. University of Guelph; CanadáFil: Gregory, T. Ryan. University of Guelph; CanadáFil: McMurtrie, Shelley. EOS Ecology; Nueva Zeland

    Calibrating the taxonomy of a megadiverse insect family: 3000 DNA barcodes from geometrid type specimens (Lepidoptera, Geometridae)

    No full text
    It is essential that any DNA barcode reference library be based upon correctly identified specimens. The Barcode of Life Data Systems (BOLD) requires information such as images, geo-referencing, and details on the museum holding the voucher specimen for each barcode record to aid recognition of potential misidentifications. Nevertheless, there are misidentifications and incomplete identifications (e.g. to a genus or family) on BOLD, mainly for species from tropical regions. Unfortunately, experts are often unavailable to correct taxonomic assignments due to time constraints and the lack of specialists for many groups and regions. However, considerable progress could be made if barcode records were available for all type specimens. As a result of recent improvements in analytical protocols, it is now possible to recover barcode sequences from museum specimens that date to the start of taxonomic work in the 18th Century. The present study discusses success in the recovery of DNA barcode sequences from 2805 type specimens of geometrid moths which represent 1965 species, corresponding to about 9% of the 23,000 described species in this family worldwide and including 1875 taxa represented by name-bearing types. Sequencing success was high (73% of specimens), even for specimens that were more than a century old. Several case studies are discussed to show the efficiency, reliability, and sustainability of this approach.The accepted manuscript in pdf format is listed with the files at the bottom of this page. The presentation of the authors' names and (or) special characters in the title of the manuscript may differ slightly between what is listed on this page and what is listed in the pdf file of the accepted manuscript; that in the pdf file of the accepted manuscript is what was submitted by the author

    A Sequel to Sanger: amplicon sequencing that scales

    No full text
    Abstract Background Although high-throughput sequencers (HTS) have largely displaced their Sanger counterparts, the short read lengths and high error rates of most platforms constrain their utility for amplicon sequencing. The present study tests the capacity of single molecule, real-time (SMRT) sequencing implemented on the SEQUEL platform to overcome these limitations, employing 658 bp amplicons of the mitochondrial cytochrome c oxidase I gene as a model system. Results By examining templates from more than 5000 species and 20,000 specimens, the performance of SMRT sequencing was tested with amplicons showing wide variation in GC composition and varied sequence attributes. SMRT and Sanger sequences were very similar, but SMRT sequencing provided more complete coverage, especially for amplicons with homopolymer tracts. Because it can characterize amplicon pools from 10,000 DNA extracts in a single run, the SEQUEL can reduce greatly reduce sequencing costs in comparison to first (Sanger) and second generation platforms (Illumina, Ion). Conclusions SMRT analysis generates high-fidelity sequences from amplicons with varying GC content and is resilient to homopolymer tracts. Analytical costs are low, substantially less than those for first or second generation sequencers. When implemented on the SEQUEL platform, SMRT analysis enables massive amplicon characterization because each instrument can recover sequences from more than 5 million DNA extracts a year

    Phylogenetic reassignment of basal cyclostome braconid parasitoid wasps (Hymenoptera) with description of a new, enigmatic Afrotropical tribe with a highly anomalous 28S D2 secondary structure

    No full text
    A new tribe of braconid wasps provisionally included in the Rhyssalinae, Laibaleini trib. nov., type genus Laibalea gen. nov. (type species Laibalea enigmatica sp. nov.), from Kenya and the Central African Republic, is described. A molecular dataset, with emphasis on basally derived taxa based on four gene fragments (28S D2–D3 expansion region, COI barcode, elongation factor 1-alpha and 16S ribosomal DNA), was analysed both alone and in combination with a morphological dataset. Molecular phylogenetic placement of the new species into an existing subfamily is complicated by the extreme sequence divergence of the three sequences obtained for Laibalea. In both the combined sequence analysis and the combined DNA plus morphological tree, Laibalea is recovered as a sister group to the Rhyssalinae plus all non-cyclostome lineage braconids excluding Mesostoinae, Maxfischeriinae and Aphidiinae. A consensus of morphological characters and molecular analyses suggests inclusion of Laibalea either in the otherwise principally Holarctic subfamily Rhyssalinae or perhap more basally, in the principally Gondwanan Mesostoinae s.l., although we cannot exclude the possibility that it might represent a separate basal lineage. We place Laibalea in its own tribe, provisionally included in Rhyssalinae. The DNA sequence data are presented for several genera for the first time. Avga, the type genus of Avgini, is shown not to belong to Mesostoinae s.l. or Hormiinae, but its exact relationships remain uncertain. The generic compositions of Rhyssalinae and Mesostoinae s.l. are revised. Anachyra, Apoavga, Neptihormius, Neoavga and Opiopterus are shown to belong to Mesostoinae s.s. A key to the tribes of Rhyssalinae is provided
    corecore