29 research outputs found

    The molecular basis, genetic control and pleiotropic effects of local gene co-expression.

    Get PDF
    Nearby genes are often expressed as a group. Yet, the prevalence, molecular mechanisms and genetic control of local gene co-expression are far from being understood. Here, by leveraging gene expression measurements across 49 human tissues and hundreds of individuals, we find that local gene co-expression occurs in 13% to 53% of genes per tissue. By integrating various molecular assays (e.g. ChIP-seq and Hi-C), we estimate the ability of several mechanisms, such as enhancer-gene interactions, in distinguishing gene pairs that are co-expressed from those that are not. Notably, we identify 32,636 expression quantitative trait loci (eQTLs) which associate with co-expressed gene pairs and often overlap enhancer regions. Due to affecting several genes, these eQTLs are more often associated with multiple human traits than other eQTLs. Our study paves the way to comprehend trait pleiotropy and functional interpretation of QTL and GWAS findings. All local gene co-expression identified here is available through a public database ( https://glcoex.unil.ch/ )

    Transcript- and annotation-guided genome assembly of the European starling

    Get PDF
    First published: 28 June 2022The European starling, Sturnus vulgaris, is an ecologically significant, globally invasive avian species that is also suffering from a major decline in its native range. Here, we present the genome assembly and long- read transcriptome of an Australian-sourced European starling (S. vulgaris vAU), and a second, North American, short- read genome assembly (S. vulgaris vNA), as complementary reference genomes for population genetic and evolutionary characterization. S. vulgaris vAU combined 10× genomics linked- reads, low-coverage Nanopore sequencing, and PacBio Iso-Seq full- length transcript scaffolding to generate a 1050 Mb assembly on 6222 scaffolds (7.6 Mb scaffold N50, 94.6% busco completeness). Further scaffolding against the high-quality zebra finch (Taeniopygia guttata) genome assigned 98.6% of the assembly to 32 puta-tive nuclear chromosome scaffolds. Species-specific transcript mapping and gene an-notation revealed good gene- level assembly and high functional completeness. Using S. vulgaris vAU, we demonstrate how the multifunctional use of PacBio Iso-Seq tran-script data and complementary homology-based annotation of sequential assembly steps (assessed using a new tool, saaga) can be used to assess, inform, and validate assembly workflow decisions. We also highlight some counterintuitive behaviour in traditional busco metrics, and present buscomp, a complementary tool for assembly comparison designed to be robust to differences in assembly size and base-calling quality. This work expands our knowledge of avian genomes and the available toolkit for assessing and improving genome quality. The new genomic resources presented will facilitate further global genomic and transcriptomic analysis on this ecologically important species.Katarina C. Stuart, Richard J. Edwards, Yuanyuan Cheng, Wesley C. Warren, David W. Burt, William B. Sherwin, Natalie R. Hofmeister, Scott J. Werner, Gregory F. Ball, Melissa Bateson, Matthew C. Brandley, Katherine L. Buchanan, Phillip Cassey, David F. Clayton, Tim De Meyer, Simone L. Meddle, Lee A. Rollin

    Dust in Supernovae and Supernova Remnants I : Formation Scenarios

    Get PDF
    Supernovae are considered as prime sources of dust in space. Observations of local supernovae over the past couple of decades have detected the presence of dust in supernova ejecta. The reddening of the high redshift quasars also indicate the presence of large masses of dust in early galaxies. Considering the top heavy IMF in the early galaxies, supernovae are assumed to be the major contributor to these large amounts of dust. However, the composition and morphology of dust grains formed in a supernova ejecta is yet to be understood with clarity. Moreover, the dust masses inferred from observations in mid-infrared and submillimeter wavelength regimes differ by two orders of magnitude or more. Therefore, the mechanism responsible for the synthesis of molecules and dust in such environments plays a crucial role in studying the evolution of cosmic dust in galaxies. This review summarises our current knowledge of dust formation in supernova ejecta and tries to quantify the role of supernovae as dust producers in a galaxy.Peer reviewe

    Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank.

    No full text
    Phasing involves distinguishing the two parentally inherited copies of each chromosome into haplotypes. Here, we introduce SHAPEIT5, a new phasing method that quickly and accurately processes large sequencing datasets and applied it to UK Biobank (UKB) whole-genome and whole-exome sequencing data. We demonstrate that SHAPEIT5 phases rare variants with low switch error rates of below 5% for variants present in just 1 sample out of 100,000. Furthermore, we outline a method for phasing singletons, which, although less precise, constitutes an important step towards future developments. We then demonstrate that the use of UKB as a reference panel improves the accuracy of genotype imputation, which is even more pronounced when phased with SHAPEIT5 compared with other methods. Finally, we screen the UKB data for loss-of-function compound heterozygous events and identify 549 genes where both gene copies are knocked out. These genes complement current knowledge of gene essentiality in the human genome

    Parent-of-Origin inference for biobanks.

    No full text
    Identical genetic variations can have different phenotypic effects depending on their parent of origin. Yet, studies focusing on parent-of-origin effects have been limited in terms of sample size due to the lack of parental genomes or known genealogies. We propose a probabilistic approach to infer the parent-of-origin of individual alleles that does not require parental genomes nor prior knowledge of genealogy. Our model uses Identity-By-Descent sharing with second- and third-degree relatives to assign alleles to parental groups and leverages chromosome X data in males to distinguish maternal from paternal groups. We combine this with robust haplotype inference and haploid imputation to infer the parent-of-origin for 26,393 UK Biobank individuals. We screen 99 phenotypes for parent-of-origin effects and replicate the discoveries of 6 GWAS studies, confirming signals on body mass index, type 2 diabetes, standing height and multiple blood biomarkers, including the known maternal effect at the MEG3/DLK1 locus on platelet phenotypes. We also report a novel maternal effect at the TERT gene on telomere length, thereby providing new insights on the heritability of this phenotype. All our summary statistics are publicly available to help the community to better characterize the molecular mechanisms leading to parent-of-origin effects and their implications for human health

    AlphaBeta: Computational inference of epimutation rates and spectra from high-throughput DNA methylation data in plants.

    Get PDF
    Stochastic changes in DNA methylation (i.e., spontaneous epimutations) contribute to methylome diversity in plants. Here, we describe AlphaBeta, a computational method for estimating the precise rate of such stochastic events using pedigree-based DNA methylation data as input. We demonstrate how AlphaBeta can be employed to study transgenerationally heritable epimutations in clonal or sexually derived mutation accumulation lines, as well as somatic epimutations in long-lived perennials. Application of our method to published and new data reveals that spontaneous epimutations accumulate neutrally at the genome-wide scale, originate mainly during somatic development and that they can be used as a molecular clock for age-dating trees
    corecore