149 research outputs found

    The sterlet sturgeon genome sequence and the mechanisms of segmental rediploidization

    Get PDF
    Sturgeons seem to be frozen in time. The archaic characteristics of this ancient fish lineage place it in a key phylogenetic position at the base of the ~30,000 modern teleost fish species. Moreover, sturgeons are notoriously polyploid, providing unique opportunities to investigate the evolution of polyploid genomes. We assembled a high-quality chromosome-level reference genome for the sterlet, Acipenser ruthenus. Our analysis revealed a very low protein evolution rate that is at least as slow as in other deep branches of the vertebrate tree, such as that of the coelacanth. We uncovered a whole-genome duplication that occurred in the Jurassic, early in the evolution of the entire sturgeon lineage. Following this polyploidization, the rediploidization of the genome included the loss of whole chromosomes in a segmental deduplication process. While known adaptive processes helped conserve a high degree of structural and functional tetraploidy over more than 180 million years, the reduction of redundancy of the polyploid genome seems to have been remarkably random

    The piranha genome provides molecular insight associated to its unique feeding behavior

    Get PDF
    The piranha enjoys notoriety due to its infamous predatory behavior but much is still not understood about its evolutionary origins and the underlying molecular mechanisms for its unusual feeding biology. We sequenced and assembled the red-bellied piranha (Pygocentrus nattereri) genome to aid future phenotypic and genetic investigations. The assembled draft genome is similar to other related fishes in repeat composition and gene count. Our evaluation of genes under positive selection suggests candidates for adaptations of piranhas’ feeding behavior in neural functions, behavior, and regulation of energy metabolism. In the fasted brain, we find genes differentially expressed that are involved in lipid metabolism and appetite regulation as well as genes that may control the aggression/boldness behavior of hungry piranhas. Our first analysis of the piranha genome offers new insight and resources for the study of piranha biology and for feeding motivation and starvation in other organisms

    INTEGRATE: Gene fusion discovery using whole genome and transcriptome data

    Get PDF
    While next-generation sequencing (NGS) has become the primary technology for discovering gene fusions, we are still faced with the challenge of ensuring that causative mutations are not missed while minimizing false positives. Currently, there are many computational tools that predict structural variations (SV) and gene fusions using whole genome (WGS) and transcriptome sequencing (RNA-seq) data separately. However, as both WGS and RNA-seq have their limitations when used independently, we hypothesize that the orthogonal validation from integrating both data could generate a sensitive and specific approach for detecting high-confidence gene fusion predictions. Fortunately, decreasing NGS costs have resulted in a growing quantity of patients with both data available. Therefore, we developed a gene fusion discovery tool, INTEGRATE, that leverages both RNA-seq and WGS data to reconstruct gene fusion junctions and genomic breakpoints by split-read mapping. To evaluate INTEGRATE, we compared it with eight additional gene fusion discovery tools using the well-characterized breast cell line HCC1395 and peripheral blood lymphocytes derived from the same patient (HCC1395BL). The predictions subsequently underwent a targeted validation leading to the discovery of 131 novel fusions in addition to the seven previously reported fusions. Overall, INTEGRATE only missed six out of the 138 validated fusions and had the highest accuracy of the nine tools evaluated. Additionally, we applied INTEGRATE to 62 breast cancer patients from The Cancer Genome Atlas (TCGA) and found multiple recurrent gene fusions including a subset involving estrogen receptor. Taken together, INTEGRATE is a highly sensitive and accurate tool that is freely available for academic use
    corecore