27 research outputs found
An integrated map of structural variation in 2,504 human genomes
Structural variants are implicated in numerous diseases and make up the majority of varying nucleotides among human genomes. Here we describe an integrated set of eight structural variant classes comprising both balanced and unbalanced variants, which we constructed using short-read DNA sequencing data and statistically phased onto haplotype blocks in 26 human populations. Analysing this set, we identify numerous gene-intersecting structural variants exhibiting population stratification and describe naturally occurring homozygous gene knockouts that suggest the dispensability of a variety of human genes. We demonstrate that structural variants are enriched on haplotypes identified by genome-wide association studies and exhibit enrichment for expression quantitative trait loci. Additionally, we uncover appreciable levels of structural variant complexity at different scales, including genic loci subject to clusters of repeated rearrangement and complex structural variants with multiple breakpoints likely to have formed through individual mutational events. Our catalogue will enhance future studies into structural variant demography, functional impact and disease association. © 2015 Macmillan Publishers Limited. All rights reserved
Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel
A major use of the 1000 Genomes Project (1000GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants. © 2014 Macmillan Publishers Limited. All rights reserved
Recommended from our members
The Alu Yc1 subfamily: sorting the wheat from the chaff
Members of the Alu Yc1 subfamily are distinguished from the older Alu Y subfamily by a signature G→A substitution at base 148 of their 281-bp consensus sequence. Members of the much older and larger Alu Y subfamily could have by chance accumulated this signature G→A substitution and be misclassified as belonging to the Alu Yc1 subfamily. Using a Mahanalobis classification method, it was estimated that the “authentic” Alu Yc1 subfamily consists of approximately 262 members in the human genome. PCR amplification and further analysis was successfully completed on 225 of the Yc1 Alu family members. One hundred and seventy-seven Yc1 Alu elements were determined to be monomorphic (fixed for presence) in a panel of diverse human genomes. Forty-eight of the Yc1 Alu elements were polymorphic for insertion presence/absence in diverse human genomes. The insertion polymorphism rate of 21% in the human genome is similar to rates reported previously for other “young” Alu subfamilies. The polymorphic Yc1 Alu elements will be useful genetic loci for the study of human population genetics.
Transposable element (TE) display and rapid detection of TE insertion polymorphism in the Anopheles gambiae species complex
Study of the formation of branched diane epoxide oligomers at advanced stages of synthesis
HRAS1 and LASS1 with APOE are associated with human longevity and healthy aging
The search for longevity-determining genes in human has largely neglected the operation of genetic interactions. We have identified a novel combination of common variants of three genes that has a marked association with human lifespan and healthy aging. Subjects were recruited and stratified according to their genetically inferred ethnic affiliation to account for population structure. Haplotype analysis was performed in three candidate genes, and the haplotype combinations were tested for association with exceptional longevity. An HRAS1 haplotype enhanced the effect of an APOE haplotype on exceptional survival, and a LASS1 haplotype further augmented its magnitude. These results were replicated in a second population. A profile of healthy aging was developed using a deficit accumulation index, which showed that this combination of gene variants is associated with healthy aging. The variation in LASS1 is functional, causing enhanced expression of the gene, and it contributes to healthy aging and greater survival in the tenth decade of life. Thus, rare gene variants need not be invoked to explain complex traits such as aging; instead rare congruence of common gene variants readily fulfills this role. The interaction between the three genes described here suggests new models for cellular and molecular mechanisms underlying exceptional survival and healthy aging that involve lipotoxicity
Rates and patterns of great ape retrotransposition
We analyzed 83 fully sequenced great ape genomes for mobile element insertions, predicting a total of 49,452 fixed and polymorphic Alu and long interspersed element 1 (L1) insertions not present in the human reference assembly and assigning each retrotransposition event to a different time point during great ape evolution. We used these homoplasy-free markers to construct a mobile element insertions-based phylogeny of humans and great apes and demonstrate their differential power to discern ape subspecies and populations. Within this context, we find a good correlation between L1 diversity and single-nucleotide polymorphism heterozygosity (r2 =0.65) in contrast to Alu repeats, which show little correlation (r2 =0.07). We estimate that the rate of Alu retrotransposition has differed by a factor of 15-fold in these lineages. Humans, chimpanzees, and bonobos show the highest rates of Alu accumulation-the latter two since divergence 1.5 Mya. The L1 insertion rate, in contrast, has remained relatively constant, with rates differing by less than a factor of three. We conclude that Alu retrotransposition has been the most variable form of genetic variation during recent human-great ape evolution, with increases and decreases occurring over very short periods of evolutionary time