733 research outputs found

    A high-resolution map of human evolutionary constraint using 29 mammals

    Get PDF
    The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ~4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ~60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease

    A quantitative framework for characterizing the evolutionary history of mammalian gene expression

    Get PDF
    The evolutionary history of a gene helps predict its function and relationship to phenotypic traits. Although sequence conservation is commonly used to decipher gene function and assess medical relevance, methods for functional inference from comparative expression data are lacking. Here, we use RNA-seq across seven tissues from 17 mammalian species to show that expression evolution across mammals is accurately modeled by the Ornstein–Uhlenbeck process, a commonly proposed model of continuous trait evolution. We apply this model to identify expression pathways under neutral, stabilizing, and directional selection. We further demonstrate novel applications of this model to quantify the extent of stabilizing selection on a gene’s expression, parameterize the distribution of each gene’s optimal expression level, and detect deleterious expression levels in expression data from individual patients. Our work provides a statistical framework for interpreting expression data across species and in disease

    Системний підхід у соціальній адаптації студентів-іноземців

    Get PDF
    SUMMARY: High-throughput genotyping and sequencing technologies facilitate studies of complex genetic traits and provide new research opportunities. The increasing popularity of genome-wide association studies (GWAS) leads to the discovery of new associated loci and a better understanding of the genetic architecture underlying not only diseases, but also other monogenic and complex phenotypes. Several softwares are available for performing GWAS analyses, R environment being one of them. RESULTS: We present cgmisc, an R package that enables enhanced data analysis and visualisation of results from GWAS. The package contains several utilities and modules that complement and enhance the functionality of the existing software. It also provides several tools for advanced visualisation of genomic data and utilises the power of the R language to aid in preparation of publication-quality figures. Some of the package functions are specific for the domestic dog (Canis familiaris) data. AVAILABILITY: The package is operating system-independent and is available from: https://github.com/cgmisc-team/cgmisc CONTACT: [email protected]

    The Naked Mole Rat Genome Resource: facilitating analyses of cancer and longevity-related adaptations

    Get PDF
    Motivation: The naked mole rat (Heterocephalus glaber) is an exceptionally long-lived and cancer-resistant rodent native to East Africa. Although its genome was previously sequenced, here we report a new assembly sequenced by us with substantially higher N50 values for scaffolds and contigs. Results: We analyzed the annotation of this new improved assembly and identified candidate genomic adaptations which may have contributed to the evolution of the naked mole rat’s extraordinary traits, including in regions of p53, and the hyaluronan receptors CD44 and HMMR (RHAMM). Furthermore, we developed a freely available web portal, the Naked Mole Rat Genome Resource (http://www.naked-mole-rat.org), featuring the data and results of our analysis, to assist researchers interested in the genome and genes of the naked mole rat, and also to facilitate further studies on this fascinating species. Availability and implementation: The Naked Mole Rat Genome Resource is freely available online at http://www.naked-mole-rat.org. This resource is open source and the source code is available at https://github.com/maglab/naked-mole-rat-portal. Contact: [email protected]

    A high-resolution map of human evolutionary constraint using 29 mammals

    Get PDF
    The comparison of related genomes has emerged as a powerful lens for genome interpretation. Here we report the sequencing and comparative analysis of 29 eutherian genomes. We confirm that at least 5.5% of the human genome has undergone purifying selection, and locate constrained elements covering ~4.2% of the genome. We use evolutionary signatures and comparisons with experimental data sets to suggest candidate functions for ~60% of constrained bases. These elements reveal a small number of new coding exons, candidate stop codon readthrough events and over 10,000 regions of overlapping synonymous constraint within protein-coding exons. We find 220 candidate RNA structural families, and nearly a million elements overlapping potential promoter, enhancer and insulator regions. We report specific amino acid residues that have undergone positive selection, 280,000 non-coding elements exapted from mobile elements and more than 1,000 primate- and human-accelerated elements. Overlap with disease-associated variants indicates that our findings will be relevant for studies of human biology, health and disease

    Canine Hereditary Ataxia in Old English Sheepdogs and Gordon Setters Is Associated with a Defect in the Autophagy Gene Encoding RAB24

    Get PDF
    Old English Sheepdogs and Gordon Setters suffer from a juvenile onset, autosomal recessive form of canine hereditary ataxia primarily affecting the Purkinje neuron of the cerebellar cortex. The clinical and histological characteristics are analogous to hereditary ataxias in humans. Linkage and genome-wide association studies on a cohort of related Old English Sheepdogs identified a region on CFA4 strongly associated with the disease phenotype. Targeted sequence capture and next generation sequencing of the region identified an A to C single nucleotide polymorphism (SNP) located at position 113 in exon 1 of an autophagy gene, RAB24, that segregated with the phenotype. Genotyping of six additional breeds of dogs affected with hereditary ataxia identified the same polymorphism in affected Gordon Setters that segregated perfectly with phenotype. The other breeds tested did not have the polymorphism. Genome-wide SNP genotyping of Gordon Setters identified a 1.9 MB region with an identical haplotype to affected Old English Sheepdogs. Histopathology, immunohistochemistry and ultrastructural evaluation of the brains of affected dogs from both breeds identified dramatic Purkinje neuron loss with axonal spheroids, accumulation of autophagosomes, ubiquitin positive inclusions and a diffuse increase in cytoplasmic neuronal ubiquitin staining. These findings recapitulate the changes reported in mice with induced neuron-specific autophagy defects. Taken together, our results suggest that a defect in RAB24, a gene associated with autophagy, is highly associated with and may contribute to canine hereditary ataxia in Old English Sheepdogs and Gordon Setters. This finding suggests that detailed investigation of autophagy pathways should be undertaken in human hereditary ataxia.American Kennel Club Canine Health Foundation (grant CHF 0407)American Kennel Club Canine Health Foundation (grant CHF 0925)Old English Sheepdog Club of AmericaTarTan Gordon Setter ClubEuropean Science Foundation (EURYI)Canine Health Information Center (DNA Repository

    FEELnc : a tool for long non-coding RNA annotation and its application to the dog transcriptome

    Get PDF
    Whole transcriptome sequencing (RNA-seq) has become a standard for cataloguing and monitoring RNA populations. One of the main bottlenecks, however, is to correctly identify the different classes of RNAs among the plethora of reconstructed transcripts, particularly those that will be translated (mRNAs) from the class of long non-coding RNAs (lncRNAs). Here, we present FEELnc (FlExible Extraction of LncRNAs), an alignment-free program that accurately annotates lncRNAs based on a Random Forest model trained with general features such as multi k-mer frequencies and relaxed open reading frames. Benchmarking versus five state-of-the-art tools shows that FEELnc achieves similar or better classification performance on GENCODE and NONCODE data sets. The program also provides specific modules that enable the user to fine-tune classification accuracy, to formalize the annotation of lncRNA classes and to identify lncRNAs even in the absence of a training set of non-coding RNAs. We used FEELnc on a real data set comprising 20 canine RNA-seq samples produced by the European LUPA consortium to substantially expand the canine genome annotation to include 10 374 novel lncRNAs and 58 640 mRNA transcripts. FEELnc moves beyond conventional coding potential classifiers by providing a standardized and complete solution for annotating lncRNAs and is freely available at https://github.com/tderrien/FEELnc.Peer reviewe

    How to make a rodent giant: Genomic basis and tradeoffs of gigantism in the capybara, the world\u27s largest rodent

    Get PDF
    Gigantism results when one lineage within a clade evolves extremely large body size relative to its small-bodied ancestors, a common phenomenon in animals. Theory predicts that the evolution of giants should be constrained by two tradeoffs. First, because body size is negatively correlated with population size, purifying selection is expected to be less efficient in species of large body size, leading to increased mutational load. Second, gigantism is achieved through generating a higher number of cells along with higher rates of cell proliferation, thus increasing the likelihood of cancer. To explore the genetic basis of gigantism in rodents and uncover genomic signatures of gigantism-related tradeoffs, we assembled a draft genome of the capybara (Hydrochoerus hydrochaeris), the world\u27s largest living rodent. We found that the genome-wide ratio of non-synonymous to synonymous mutations (omega) is elevated in the capybara relative to other rodents, likely caused by a generation-time effect and consistent with a nearly-neutral model of molecular evolution. A genome-wide scan for adaptive protein evolution in the capybara highlighted several genes controlling post-natal bone growth regulation and musculoskeletal development, which are relevant to anatomical and developmental modifications for an increase in overall body size. Capybara-specific gene-family expansions included a putative novel anticancer adaptation that involves T cell-mediated tumor suppression, offering a potential resolution to the increased cancer risk in this lineage. Our comparative genomic results uncovered the signature of an intragenomic conflict where the evolution of gigantism in the capybara involved selection on genes and pathways that are directly linked to cancer
    corecore