435 research outputs found

    jMOTU and Taxonerator: Turning DNA Barcode Sequences into Annotated Operational Taxonomic Units

    Get PDF
    BACKGROUND: DNA barcoding and other DNA sequence-based techniques for investigating and estimating biodiversity require explicit methods for associating individual sequences with taxa, as it is at the taxon level that biodiversity is assessed. For many projects, the bioinformatic analyses required pose problems for laboratories whose prime expertise is not in bioinformatics. User-friendly tools are required for both clustering sequences into molecular operational taxonomic units (MOTU) and for associating these MOTU with known organismal taxonomies. RESULTS: Here we present jMOTU, a Java program for the analysis of DNA barcode datasets that uses an explicit, determinate algorithm to define MOTU. We demonstrate its usefulness for both individual specimen-based Sanger sequencing surveys and bulk-environment metagenetic surveys using long-read next-generation sequencing data. jMOTU is driven through a graphical user interface, and can analyse tens of thousands of sequences in a short time on a desktop computer. A companion program, Taxonerator, that adds traditional taxonomic annotation to MOTU, is also presented. Clustering and taxonomic annotation data are stored in a relational database, and are thus amenable to subsequent data mining and web presentation. CONCLUSIONS: jMOTU efficiently and robustly identifies the molecular taxa present in survey datasets, and Taxonerator decorates the MOTU with putative identifications. jMOTU and Taxonerator are freely available from http://www.nematodes.org/

    Barcoding Bugs: DNA-Based Identification of the True Bugs (Insecta: Hemiptera: Heteroptera)

    Get PDF
    oxidase I (COI) gene, has been shown to provide an efficient method for the identification of species in a wide range of animal taxa. In order to assess the effectiveness of barcodes in the discrimination of Heteroptera, we examined 344 species belonging to 178 genera, drawn from specimens in the Canadian National Collection of Insects.Analysis of the COI gene revealed less than 2% intra-specific divergence in 90% of the taxa examined, while minimum interspecific distances exceeded 3% in 77% of congeneric species pairs. Instances where barcodes fail to distinguish species represented clusters of morphologically similar species, except one case of barcode identity between species in different genera. Several instances of deep intraspecific divergence were detected suggesting possible cryptic species.Although this analysis encompasses 0.8% of the described global fauna, our results indicate that DNA barcodes will aid the identification of Heteroptera. This advance will be useful in pest management, regulatory and environmental applications and will also reveal species that require further taxonomic research

    Patterns of primary care and mortality among patients with schizophrenia or diabetes: a cluster analysis approach to the retrospective study of healthcare utilization

    Get PDF
    Abstract Background Patients with schizophrenia have difficulty managing their medical healthcare needs, possibly resulting in delayed treatment and poor outcomes. We analyzed whether patients reduced primary care use over time, differentially by diagnosis with schizophrenia, diabetes, or both schizophrenia and diabetes. We also assessed whether such patterns of primary care use were a significant predictor of mortality over a 4-year period. Methods The Veterans Healthcare Administration (VA) is the largest integrated healthcare system in the United States. Administrative extracts of the VA's all-electronic medical records were studied. Patients over age 50 and diagnosed with schizophrenia in 2002 were age-matched 1:4 to diabetes patients. All patients were followed through 2005. Cluster analysis explored trajectories of primary care use. Proportional hazards regression modelled the impact of these primary care utilization trajectories on survival, controlling for demographic and clinical covariates. Results Patients comprised three diagnostic groups: diabetes only (n = 188,332), schizophrenia only (n = 40,109), and schizophrenia with diabetes (Scz-DM, n = 13,025). Cluster analysis revealed four distinct trajectories of primary care use: consistent over time, increasing over time, high and decreasing, low and decreasing. Patients with schizophrenia only were likely to have low-decreasing use (73% schizophrenia-only vs 54% Scz-DM vs 52% diabetes). Increasing use was least common among schizophrenia patients (4% vs 8% Scz-DM vs 7% diabetes) and was associated with improved survival. Low-decreasing primary care, compared to consistent use, was associated with shorter survival controlling for demographics and case-mix. The observational study was limited by reliance on administrative data. Conclusion Regular primary care and high levels of primary care were associated with better survival for patients with chronic illness, whether psychiatric or medical. For schizophrenia patients, with or without comorbid diabetes, primary care offers a survival benefit, suggesting that innovations in treatment retention targeting at-risk groups can offer significant promise of improving outcomes.http://deepblue.lib.umich.edu/bitstream/2027.42/78274/1/1472-6963-9-127.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78274/2/1472-6963-9-127.pdfPeer Reviewe

    A Two-Locus Global DNA Barcode for Land Plants: The Coding rbcL Gene Complements the Non-Coding trnH-psbA Spacer Region

    Get PDF
    BACKGROUND: A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. METHODOLOGY/PRINCIPAL FINDINGS: Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. CONCLUSIONS/SIGNIFICANCE: A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination

    Evolutionary factors affecting Lactate dehydrogenase A and B variation in the Daphnia pulex species complex

    Get PDF
    Background: Evidence for historical, demographic and selective factors affecting enzyme evolution can be obtained by examining nucleotide sequence variation in candidate genes such as Lactate dehydrogenase (Ldh). Two closely related Daphnia species can be distinguished by their electrophoretic Ldh genotype and habitat. Daphnia pulex populations are fixed for the S allele and inhabit temporary ponds, while D. pulicaria populations are fixed for the F allele and inhabit large stratified lakes. One locus is detected in most allozyme surveys, but genome sequencing has revealed two genes, LdhA and LdhB. Results: We sequenced both Ldh genes from 70 isolates of these two species from North America to determine if the association between Ldh genotype and habitat shows evidence for selection, and to elucidate the evolutionary history of the two genes. We found that alleles in the pond-dwelling D. pulex and in the lake-dwelling D. pulicaria form distinct groups at both loci, and the substitution of Glutamine (S) for Glutamic acid (F) at amino acid 229 likely causes the electrophoretic mobility shift in the LDHA protein. Nucleotide diversity in both Ldh genes is much lower in D. pulicaria than in D. pulex. Moreover, the lack of spatial structuring of the variation in both genes over a wide geographic area is consistent with a recent demographic expansion of lake populations. Neutrality tests indicate that both genes are under purifying selection, but the intensity is much stronger on LdhA. Conclusions: Although lake-dwelling D. pulicaria hybridizes with the other lineages in the pulex species complex, it remains distinct ecologically and genetically. This ecological divergence, coupled with the intensity of purifying selection on LdhA and the strong association between its genotype and habitat, suggests that experimental studies would be useful to determine if variation in molecular function provides evidence that LDHA variants are adaptive

    The Genotype Specific Competitive Ability Does Not Correlate with Infection in Natural Daphnia magna Populations

    Get PDF
    Different evolutionary hypotheses predict a correlation between the fitness of a genotype in the absence of infection and the likelihood to become infected. The cost of resistance hypothesis predicts that resistant genotypes pay a cost of being resistant and are less fit in the absence of parasites. The inbreeding-infection hypothesis predicts that the susceptible individuals are less fit due to inbreeding depression.Here we tested if a host's natural infection status was associated with its fitness. First, we experimentally confirmed that cured but formerly infected Daphnia magna are genetically more susceptible to reinfections with Octosporea bayeri than naturally uninfected D. magna. We then collected from each of 22 populations both uninfected and infected D. magna genotypes. All were treated against parasites and kept in their asexual phase. We estimated their relative fitness in an experiment against a tester genotype and in another experiment in direct competition. Consistently, we found no difference in competitive abilities between uninfected and cured but formerly infected genotypes. This was the case both in the presence as well as in the absence of sympatric parasites during the competition trials.Our data do not support the inbreeding-infection hypothesis. They also do not support a cost of resistance, however ignoring other parasite strains or parasite species. We suggest as a possible explanation for our results that resistance genes might segregate largely independently of other fitness associated genes in this system

    Taxonomic Reliability of DNA Sequences in Public Sequence Databases: A Fungal Perspective

    Get PDF
    BACKGROUND: DNA sequences are increasingly seen as one of the primary information sources for species identification in many organism groups. Such approaches, popularly known as barcoding, are underpinned by the assumption that the reference databases used for comparison are sufficiently complete and feature correctly and informatively annotated entries. METHODOLOGY/PRINCIPAL FINDINGS: The present study uses a large set of fungal DNA sequences from the inclusive International Nucleotide Sequence Database to show that the taxon sampling of fungi is far from complete, that about 20% of the entries may be incorrectly identified to species level, and that the majority of entries lack descriptive and up-to-date annotations. CONCLUSIONS: The problems with taxonomic reliability and insufficient annotations in public DNA repositories form a tangible obstacle to sequence-based species identification, and it is manifest that the greatest challenges to biological barcoding will be of taxonomical, rather than technical, nature

    Epidemiology of a Daphnia-Multiparasite System and Its Implications for the Red Queen

    Get PDF
    The Red Queen hypothesis can explain the maintenance of host and parasite diversity. However, the Red Queen requires genetic specificity for infection risk (i.e., that infection depends on the exact combination of host and parasite genotypes) and strongly virulent effects of infection on host fitness. A European crustacean (Daphnia magna) - bacterium (Pasteuria ramosa) system typifies such specificity and high virulence. We studied the North American host Daphnia dentifera and its natural parasite Pasteuria ramosa, and also found strong genetic specificity for infection success and high virulence. These results suggest that Pasteuria could promote Red Queen dynamics with D. dentifera populations as well. However, the Red Queen might be undermined in this system by selection from a more common yeast parasite (Metschnikowia bicuspidata). Resistance to the yeast did not correlate with resistance to Pasteuria among host genotypes, suggesting that selection by Metschnikowia should proceed relatively independently of selection by Pasteuria

    A Cytoplasmic Domain Mutation in ClC-Kb Affects Long-Distance Communication Across the Membrane

    Get PDF
    BACKGROUND: ClC-Kb and ClC-Ka are homologous chloride channels that facilitate chloride homeostasis in the kidney and inner ear. Disruption of ClC-Kb leads to Bartter's Syndrome, a kidney disease. A point mutation in ClC-Kb, R538P, linked to Bartter's Syndrome and located in the C-terminal cytoplasmic domain was hypothesized to alter electrophysiological properties due to its proximity to an important membrane-embedded helix. METHODOLOGY/PRINCIPAL FINDINGS: Two-electrode voltage clamp experiments were used to examine the electrophysiological properties of the mutation R538P in both ClC-Kb and ClC-Ka. R538P selectively abolishes extracellular calcium activation of ClC-Kb but not ClC-Ka. In attempting to determine the reason for this specificity, we hypothesized that the ClC-Kb C-terminal domain had either a different oligomeric status or dimerization interface than that of ClC-Ka, for which a crystal structure has been published. We purified a recombinant protein corresponding to the ClC-Kb C-terminal domain and used multi-angle light scattering together with a cysteine-crosslinking approach to show that the dimerization interface is conserved between the ClC-Kb and ClC-Ka C-terminal domains, despite the fact that there are several differences in the amino acids that occur at this interface. CONCLUSIONS: The R538P mutation in ClC-Kb, which leads to Bartter's Syndrome, abolishes calcium activation of the channel. This suggests that a significant conformational change--ranging from the cytoplasmic side of the protein to the extracellular side of the protein--is involved in the Ca(2+)-activation process for ClC-Kb, and shows that the cytoplasmic domain is important for the channel's electrophysiological properties. In the highly similar ClC-Ka (90% identical), the R538P mutation does not affect activation by extracellular Ca(2+). This selective outcome indicates that ClC-Ka and ClC-Kb differ in how conformational changes are translated to the extracellular domain, despite the fact that the cytoplasmic domains share the same quaternary structure

    A Ranking System for Reference Libraries of DNA Barcodes: Application to Marine Fish Species from Portugal

    Get PDF
    BACKGROUND: The increasing availability of reference libraries of DNA barcodes (RLDB) offers the opportunity to the screen the level of consistency in DNA barcode data among libraries, in order to detect possible disagreements generated from taxonomic uncertainty or operational shortcomings. We propose a ranking system to attribute a confidence level to species identifications associated with DNA barcode records from a RLDB. Here we apply the proposed ranking system to a newly generated RLDB for marine fish of Portugal. METHODOLOGY/PRINCIPAL FINDINGS: Specimens (n = 659) representing 102 marine fish species were collected along the continental shelf of Portugal, morphologically identified and archived in a museum collection. Samples were sequenced at the barcode region of the cytochrome oxidase subunit I gene (COI-5P). Resultant DNA barcodes had average intra-specific and inter-specific Kimura-2-parameter distances (0.32% and 8.84%, respectively) within the range usually observed for marine fishes. All specimens were ranked in five different levels (A-E), according to the reliability of the match between their species identification and the respective diagnostic DNA barcodes. Grades A to E were attributed upon submission of individual specimen sequences to BOLD-IDS and inspection of the clustering pattern in the NJ tree generated. Overall, our study resulted in 73.5% of unambiguous species IDs (grade A), 7.8% taxonomically congruent barcode clusters within our dataset, but awaiting external confirmation (grade B), and 18.7% of species identifications with lower levels of reliability (grades C/E). CONCLUSION/SIGNIFICANCE: We highlight the importance of implementing a system to rank barcode records in RLDB, in order to flag taxa in need of taxonomic revision, or reduce ambiguities of discordant data. With increasing DNA barcode records publicly available, this cross-validation system would provide a metric of relative accuracy of barcodes, while enabling the continuous revision and annotation required in taxonomic work
    corecore