551 research outputs found

    jMOTU and Taxonerator: Turning DNA Barcode Sequences into Annotated Operational Taxonomic Units

    Get PDF
    BACKGROUND: DNA barcoding and other DNA sequence-based techniques for investigating and estimating biodiversity require explicit methods for associating individual sequences with taxa, as it is at the taxon level that biodiversity is assessed. For many projects, the bioinformatic analyses required pose problems for laboratories whose prime expertise is not in bioinformatics. User-friendly tools are required for both clustering sequences into molecular operational taxonomic units (MOTU) and for associating these MOTU with known organismal taxonomies. RESULTS: Here we present jMOTU, a Java program for the analysis of DNA barcode datasets that uses an explicit, determinate algorithm to define MOTU. We demonstrate its usefulness for both individual specimen-based Sanger sequencing surveys and bulk-environment metagenetic surveys using long-read next-generation sequencing data. jMOTU is driven through a graphical user interface, and can analyse tens of thousands of sequences in a short time on a desktop computer. A companion program, Taxonerator, that adds traditional taxonomic annotation to MOTU, is also presented. Clustering and taxonomic annotation data are stored in a relational database, and are thus amenable to subsequent data mining and web presentation. CONCLUSIONS: jMOTU efficiently and robustly identifies the molecular taxa present in survey datasets, and Taxonerator decorates the MOTU with putative identifications. jMOTU and Taxonerator are freely available from http://www.nematodes.org/

    Search for High Mass Photon Pairs in p-pbar --> gamma-gamma-jet-jet Events at sqrt(s)=1.8 TeV

    Get PDF
    A search has been carried out for events in the channel p-barp --> gamma gamma jet jet. Such a signature can characterize the production of a non-standard Higgs boson together with a W or Z boson. We refer to this non-standard Higgs, having standard model couplings to vector bosons but no coupling to fermions, as a "bosonic Higgs." With the requirement of two high transverse energy photons and two jets, the diphoton mass (m(gamma gamma)) distribution is consistent with expected background. A 90(95)% C.L. upper limit on the cross section as a function of mass is calculated, ranging from 0.60(0.80) pb for m(gamma gamma) = 65 GeV/c^2 to 0.26(0.34) pb for m(gamma gamma) = 150 GeV/c^2, corresponding to a 95% C.L. lower limit on the mass of a bosonic Higgs of 78.5 GeV/c^2.Comment: 9 pages, 3 figures. Replacement has new H->gamma gamma branching ratios and corresponding new mass limit

    Ratio of the Isolated Photon Cross Sections at \sqrt{s} = 630 and 1800 GeV

    Get PDF
    The inclusive cross section for production of isolated photons has been measured in \pbarp collisions at s=630\sqrt{s} = 630 GeV with the \D0 detector at the Fermilab Tevatron Collider. The photons span a transverse energy (ETE_T) range from 7-49 GeV and have pseudorapidity η<2.5|\eta| < 2.5. This measurement is combined with to previous \D0 result at s=1800\sqrt{s} = 1800 GeV to form a ratio of the cross sections. Comparison of next-to-leading order QCD with the measured cross section at 630 GeV and ratio of cross sections show satisfactory agreement in most of the ETE_T range.Comment: 7 pages. Published in Phys. Rev. Lett. 87, 251805, (2001

    Statin Induced Myopathy and Myalgia: Time Trend Analysis and Comparison of Risk Associated with Statin Class from 1991-2006

    Get PDF
    Background: Statins are widely used as a cholesterol lowering medication, reduce cardiovascular mortality and morbidity in high risk patients; and only rarely cause serious adverse drug reactions (ADRs). UK primary care databases of morbidity and prescription data, which now cover several million people, have potential for more powerful analytical approaches to study ADRs including adjusting for confounders and examining temporal effects.Methods: Case-crossover design in detecting statin associated myopathy ADR in 93, 831 patients, using two independent primary care databases (1991-2006). We analysed risk by drug class, by disease code and cumulative year, exploring different cut-off exposure times and confounding by temporality.Results: Using a 12 and 26 week exposure period, large risk ratios (RR) are associated with all classes of statins and fibrates for myopathy: RR 10.6 (9.8-11.4) and 19.9 (17.6-22.6) respectively. At 26 weeks, the largest risks are with fluvastatin RR 33.3 (95% CI 16.8-66.0) and ciprofibrate (with previous statin use) RR 40.5 (95% CI 13.4-122.0). AT 12 weeks the differences between cerivastatin and atorvastatin RR for myopathy were found to be significant, RR 2.05 (95% CI 1.2-3.5), and for rosuvastatin and fluvastatin RR 3.0 (95% CI 1.6-5.7). After 12 months of statin initiation, the relative risk for myopathy for all statins and fibrates increased to 25.7 (95% CI 21.8-30.3). Furthermore, this signal was detected within 2 years of first events being recorded. Our data suggests an annual incidence of statin induced myopathy or myalgia of around 11.4 for 16, 591 patients or 689 per million per year.Conclusion: There may be differential risks associated with some classes of statin and fibrate. Myopathy related to statin or fibrate use may persist after a long exposure time (12 months or more). These methods could be applied for early detection of harmful drug side effects, using similar primary care diagnostic and prescribing data.</p

    The Barcode of Life Data Portal: Bridging the Biodiversity Informatics Divide for DNA Barcoding

    Get PDF
    With the volume of molecular sequence data that is systematically being generated globally, there is a need for centralized resources for data exploration and analytics. DNA Barcode initiatives are on track to generate a compendium of molecular sequence–based signatures for identifying animals and plants. To date, the range of available data exploration and analytic tools to explore these data have only been available in a boutique form—often representing a frustrating hurdle for many researchers that may not necessarily have resources to install or implement algorithms described by the analytic community. The Barcode of Life Data Portal (BDP) is a first step towards integrating the latest biodiversity informatics innovations with molecular sequence data from DNA barcoding. Through establishment of community driven standards, based on discussion with the Data Analysis Working Group (DAWG) of the Consortium for the Barcode of Life (CBOL), the BDP provides an infrastructure for incorporation of existing and next-generation DNA barcode analytic applications in an open forum

    High Levels of Diversity Uncovered in a Widespread Nominal Taxon: Continental Phylogeography of the Neotropical Tree Frog

    Get PDF
    Species distributed across vast continental areas and across major biomes provide unique model systems for studies of biotic diversification, yet also constitute daunting financial, logistic and political challenges for data collection across such regions. The tree frog Dendropsophus minutus (Anura: Hylidae) is a nominal species, continentally distributed in South America, that may represent a complex of multiple species, each with a more limited distribution. To understand the spatial pattern of molecular diversity throughout the range of this species complex, we obtained DNA sequence data from two mitochondrial genes, cytochrome oxidase I (COI) and the 16S rhibosomal gene (16S) for 407 samples of D. minutus and closely related species distributed across eleven countries, effectively comprising the entire range of the group. We performed phylogenetic and spatially explicit phylogeographic analyses to assess the genetic structure of lineages and infer ancestral areas. We found 43 statistically supported, deep mitochondrial lineages, several of which may represent currently unrecognized distinct species. One major clade, containing 25 divergent lineages, includes samples from the type locality of D. minutus. We defined that clade as the D. minutus complex. The remaining lineages together with the D. minutus complex constitute the D. minutus species group. Historical analyses support an Amazonian origin for the D. minutus species group with a subsequent dispersal to eastern Brazil where the D. minutus complex originated. According to our dataset, a total of eight mtDNA lineages have ranges >100,000 km2. One of them occupies an area of almost one million km2 encompassing multiple biomes. Our results, at a spatial scale and resolution unprecedented for a Neotropical vertebrate, confirm that widespread amphibian species occur in lowland South America, yet at the same time a large proportion of cryptic diversity still remains to be discovered

    RegExpBlasting (REB), a Regular Expression Blasting algorithm based on multiply aligned sequences

    Get PDF
    Background: One of the most frequent uses of bioinformatics tools concerns functional characterization of a newly produced nucleotide sequence (a query sequence) by applying Blast or FASTA against a set of sequences (the subject sequences). However, in some specific contexts, it is useful to compare the query sequence against a cluster such as a MultiAlignment (MA). We present here the RegExpBlasting (REB) algorithm, which compares an unclassified sequence with a dataset of patterns defined by application of Regular Expression rules to a given-as-input MA datasets. The REB algorithm workflow consists in i. the definition of a dataset of multialignments ii. the association of each MA to a pattern, defined by application of regular expression rules; iii. automatic characterization of a submitted biosequence according to the function of the sequences described by the pattern best matching the query sequence. Results: An application of this algorithm is used in the "characterize your sequence" tool available in the PPNEMA resource. PPNEMA is a resource of Ribosomal Cistron sequences from various species, grouped according to nematode genera. It allows the retrieval of plant nematode multialigned sequences or the classification of new nematode rDNA sequences by applying REB. The same algorithm also supports automatic updating of the PPNEMA database. The present paper gives examples of the use of REB within PPNEMA. Conclusion: The use of REB in PPNEMA updating, the PPNEMA "characterize your sequence" option clearly demonstrates the power of the method. Using REB can also rapidly solve any other bioinformatics problem, where the addition of a new sequence to a pre-existing cluster is required. The statistical tests carried out here show the powerful flexibility of the method
    corecore