45 research outputs found

    Evolutionary algorithms for the selection of single nucleotide polymorphisms

    Get PDF
    BACKGROUND: Large databases of single nucleotide polymorphisms (SNPs) are available for use in genomics studies. Typically, investigators must choose a subset of SNPs from these databases to employ in their studies. The choice of subset is influenced by many factors, including estimated or known reliability of the SNP, biochemical factors, intellectual property, cost, and effectiveness of the subset for mapping genes or identifying disease loci. We present an evolutionary algorithm for multiobjective SNP selection. RESULTS: We implemented a modified version of the Strength-Pareto Evolutionary Algorithm (SPEA2) in Java. Our implementation, Multiobjective Analyzer for Genetic Marker Acquisition (MAGMA), approximates the set of optimal trade-off solutions for large problems in minutes. This set is very useful for the design of large studies, including those oriented towards disease identification, genetic mapping, population studies, and haplotype-block elucidation. CONCLUSION: Evolutionary algorithms are particularly suited for optimization problems that involve multiple objectives and a complex search space on which exact methods such as exhaustive enumeration cannot be applied. They provide flexibility with respect to the problem formulation if a problem description evolves or changes. Results are produced as a trade-off front, allowing the user to make informed decisions when prioritizing factors. MAGMA is open source and available at . Evolutionary algorithms are well suited for many other applications in genomics

    PiggyBac-ing on a Primate Genome: Novel Elements, Recent Activity and Horizontal Transfer

    Get PDF
    To better understand the extent of Class II transposable element activity in mammals, we investigated the mouse lemur, Microcebus murinus, whole genome shotgun (2X) draft assembly. Analysis of this strepsirrhine primate extended previous research that targeted anthropoid primates and found no activity within the last 37 Myr. We tested the hypothesis that members of the piggyBac Class II superfamily have been inactive in the strepsirrhine lineage of primates during the same period. Evidence against this hypothesis was discovered in the form of three nonautonomous piggyBac elements with activity periods within the past 40 Myr and possibly into the very recent past. In addition, a novel family of piggyBac transposons was identified, suggesting introduction via horizontal transfer. A second autonomous element was also found with high similarity to an element recently described from the little brown bat, Myotis lucifugus, further implicating horizontal transfer in the evolution of this genome. These findings indicate a more complex history of transposon activity in mammals rather than a uniform shutdown of Class II transposition, which had been suggested by analyses of more common model organisms

    RepeatModeler2 for automated genomic discovery of transposable element families.

    No full text
    The accelerating pace of genome sequencing throughout the tree of life is driving the need for improved unsupervised annotation of genome components such as transposable elements (TEs). Because the types and sequences of TEs are highly variable across species, automated TE discovery and annotation are challenging and time-consuming tasks. A critical first step is the de novo identification and accurate compilation of sequence models representing all of the unique TE families dispersed in the genome. Here we introduce RepeatModeler2, a pipeline that greatly facilitates this process. This program brings substantial improvements over the original version of RepeatModeler, one of the most widely used tools for TE discovery. In particular, this version incorporates a module for structural discovery of complete long terminal repeat (LTR) retroelements, which are widespread in eukaryotic genomes but recalcitrant to automated identification because of their size and sequence complexity. We benchmarked RepeatModeler2 on three model species with diverse TE landscapes and high-quality, manually curated TE libraries

    A call for benchmarking transposable element annotation methods

    No full text
    DNA derived from transposable elements (TEs) constitutes large parts of the genomes of complex eukaryotes, with major impacts not only on genomic research but also on how organisms evolve and function. Although a variety of methods and tools have been developed to detect and annotate TEs, there are as yet no standard benchmarks - that is, no standard way to measure or compare their accuracy. This lack of accuracy assessment calls into question conclusions from a wide range of research that depends explicitly or implicitly on TE annotation. In the absence of standard benchmarks, toolmakers are impeded in improving their tools, annotators cannot properly assess which tools might best suit their needs, and downstream researchers cannot judge how accuracy limitations might impact their studies. We therefore propose that the TE research community create and adopt standard TE annotation benchmarks, and we call for other researchers to join the authors in making this long-overdue effort a success

    Insights into mammalian TE diversity through the curation of 248 genome assemblies.

    No full text
    We examined transposable element (TE) content of 248 placental mammal genome assemblies, the largest de novo TE curation effort in eukaryotes to date. We found that although mammals resemble one another in total TE content and diversity, they show substantial differences with regard to recent TE accumulation. This includes multiple recent expansion and quiescence events across the mammalian tree. Young TEs, particularly long interspersed elements, drive increases in genome size, whereas DNA transposons are associated with smaller genomes. Mammals tend to accumulate only a few types of TEs at any given time, with one TE type dominating. We also found association between dietary habit and the presence of DNA transposon invasions. These detailed annotations will serve as a benchmark for future comparative TE analyses among placental mammals

    Chiropterans are a hotspot for horizontal transfer of DNA transposons in mammalia

    Get PDF
    This project was supported by the National Science Foundation (grant numbers DEB 1838283 and IOS 2032006 to D.M.-S. and Dav.R. and DEB 1838273 and DGE 1633299 to L.D.), National Institutes of Health (grant numbers R01HG002939 and U24HG010136 to J.S., R.H., A.F.A.S., and Je.R.), NHGRI (grant number R01HG008742 to Z.C.), Irish Research Council (grant number IRCLA/ 2017/58 to E.T.), Science Foundation Ireland (grant number 19/FFP/6790 to E.T.), Max Planck Research Group awarded by the Max Planck Gesellschaft to S.V., Human Frontiers Science Program (grant number RGP0058/2016 to S.V.), UK Research and Innovation (grant number MR/T021985/1 to S.V.), and the Swedish Research Council Distinguished Professor Award to K.L.-T.Horizontal transfer of transposable elements (TEs) is an important mechanism contributing to genetic diversity and innovation. Bats (order Chiroptera) have repeatedly been shown to experience horizontal transfer of TEs at what appears to be a high rate compared with other mammals. We investigated the occurrence of horizontally transferred (HT) DNA transposons involving bats. We found over 200 putative HT elements within bats; 16 transposons were shared across distantly related mammalian clades, and 2 other elements were shared with a fish and two lizard species. Our results indicate that bats are a hotspot for horizontal transfer of DNA transposons. These events broadly coincide with the diversification of several bat clades, supporting the hypothesis that DNA transposon invasions have contributed to genetic diversification of bats.Publisher PDFPeer reviewe
    corecore