27 research outputs found

    An integrated map of structural variation in 2,504 human genomes

    Get PDF
    Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association. © 2015 Macmillan Publishers Limited. All rights reserved

    Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel

    Get PDF
    A major use of the 1000 Genomes Project (1000GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants. © 2014 Macmillan Publishers Limited. All rights reserved

    HRAS1 and LASS1 with APOE are associated with human longevity and healthy aging

    No full text
    The search for longevity-determining genes in human has largely neglected the operation of genetic interactions. We have identified a novel combination of common variants of three genes that has a marked association with human lifespan and healthy aging. Subjects were recruited and stratified according to their genetically inferred ethnic affiliation to account for population structure. Haplotype analysis was performed in three candidate genes, and the haplotype combinations were tested for association with exceptional longevity. An HRAS1 haplotype enhanced the effect of an APOE haplotype on exceptional survival, and a LASS1 haplotype further augmented its magnitude. These results were replicated in a second population. A profile of healthy aging was developed using a deficit accumulation index, which showed that this combination of gene variants is associated with healthy aging. The variation in LASS1 is functional, causing enhanced expression of the gene, and it contributes to healthy aging and greater survival in the tenth decade of life. Thus, rare gene variants need not be invoked to explain complex traits such as aging; instead rare congruence of common gene variants readily fulfills this role. The interaction between the three genes described here suggests new models for cellular and molecular mechanisms underlying exceptional survival and healthy aging that involve lipotoxicity

    Rates and patterns of great ape retrotransposition

    Get PDF
    We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r2 =0.65) in contrast to Alu repeats, which show little correlation (r2 =0.07). We estimate that the rate of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation-the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human-great ape evolution, with increases and decreases occurring over very short periods of evolutionary time
    corecore