19 research outputs found

    Discovery of large genomic inversions using long range information

    Get PDF
    Background: Although many algorithms are now available that aim to characterize different classes of structural variation, discovery of balanced rearrangements such as inversions remains an open problem. This is mainly due to the fact that breakpoints of such events typically lie within segmental duplications or common repeats, which reduces the mappability of short reads. The algorithms developed within the 1000 Genomes Project to identify inversions are limited to relatively short inversions, and there are currently no available algorithms to discover large inversions using high throughput sequencing technologies. Results: Here we propose a novel algorithm, Valor, to discover large inversions using new sequencing methods that provide long range information such as 10X Genomics linked-read sequencing, pooled clone sequencing, or other similar technologies that we commonly refer to as long range sequencing. We demonstrate the utility of Valor using both pooled clone sequencing and 10X Genomics linked-read sequencing generated from the genome of an individual from the HapMap project (NA12878). We also provide a comprehensive comparison of Valor against several state-of-the-art structural variation discovery algorithms that use whole genome shotgun sequencing data. Conclusions: In this paper, we show that Valor is able to accurately discover all previously identified and experimentally validated large inversions in the same genome with a low false discovery rate. Using Valor, we also predicted a novel inversion, which we validated using fluorescent in situ hybridization. Valor is available at https://github.com/BilkentCompGen/Valor. © 2017 The Author(s)

    The western painted turtle genome, a model for the evolution of extreme physiological adaptations in a slowly evolving lineage

    Get PDF
    Background: We describe the genome of the western painted turtle, Chrysemys picta bellii, one of the most widespread, abundant, and well-studied turtles. We place the genome into a comparative evolutionary context, and focus on genomic features associated with tooth loss, immune function, longevity, sex differentiation and determination, and the species' physiological capacities to withstand extreme anoxia and tissue freezing.Results: Our phylogenetic analyses confirm that turtles are the sister group to living archosaurs, and demonstrate an extraordinarily slow rate of sequence evolution in the painted turtle. The ability of the painted turtle to withstand complete anoxia and partial freezing appears to be associated with common vertebrate gene networks, and we identify candidate genes for future functional analyses. Tooth loss shares a common pattern of pseudogenization and degradation of tooth-specific genes with birds, although the rate of accumulation of mutations is much slower in the painted turtle. Genes associated with sex differentiation generally reflect phylogeny rather than convergence in sex determination functionality. Among gene families that demonstrate exceptional expansions or show signatures of strong natural selection, immune function and musculoskeletal patterning genes are consistently over-represented.Conclusions: Our comparative genomic analyses indicate that common vertebrate regulatory networks, some of which have analogs in human diseases, are often involved in the western painted turtle's extraordinary physiological capacities. As these regulatory pathways are analyzed at the functional level, the painted turtle may offer important insights into the management of a number of human health disorders

    A second Xenopus

    No full text

    The amphioxus Hox cluster: Characterization, comparative genomics, and evolution

    No full text
    The amphioxus Hox cluster is often viewed as "archetypal" for the chordate lineage. Here, we present a descriptive account of the 448 kb region spanning the Hox cluster of the amphioxus Branchiostoma floridae from Hox14 to Hox1. We provide complete coding sequences of all 14 previously described amphioxus sequences and give a detailed analysis of the conserved noncoding regulatory sequence elements. We find that the posterior part of the Hox cluster is so highly derived that even the complete genomic sequence is insufficient to decide whether the posterior Hox genes arose by independent duplications or whether they are true orthologs of the corresponding gnathostome paralog groups. In contrast, the anterior region is much better conserved. The amphioxus Hox cluster strongly excludes repetitive elements with the exception of two repeat islands in the posterior region. Repeat exclusion is also observed in gnathostomes, but not protostome Hox clusters. We thus hypothesize that the much shorter vertebrate Hox clusters are the result of extensive resolution of the redundancy of regulatory DNA after the genome duplications rather than the consequence of a selection pressure to remove nonfunctional sequence from the Hox cluster
    corecore