335 research outputs found

    A Gene-Based Association Method for Mapping Traits Using Reference Transcriptome Data

    Get PDF
    Genome-wide association studies (GWAS) have identified thousands of variants robustly associated with complex traits. However, the biological mechanisms underlying these associations are, in general, not well understood. We propose a gene-based association method called PrediXcan that directly tests the molecular mechanisms through which genetic variation affects phenotype. The approach estimates the component of gene expression determined by an individual’s genetic profile and correlates ‘imputed’ gene expression with the phenotype under investigation to identify genes involved in the etiology of the phenotype. Genetically regulated gene expression is estimated using whole-genome tissue-dependent prediction models trained with reference transcriptome data sets. PrediXcan enjoys the benefits of gene-based approaches such as reduced multiple-testing burden and a principled approach to the design of follow-up experiments. Our results demonstrate that PrediXcan can detect known and new genes associated with disease traits and provide insights into the mechanism of these associations

    Modified penetrance of coding variants by cis-regulatory variation contributes to disease risk

    Get PDF
    Coding variants represent many of the strongest associations between genotype and phenotype; however, they exhibit interindividual differences in effect, termed 'variable penetrance'. Here, we study how cis-regulatory variation modifies the penetrance of coding variants. Using functional genomic and genetic data from the Genotype-Tissue Expression Project (GTEx), we observed that in the general population, purifying selection has depleted haplotype combinations predicted to increase pathogenic coding variant penetrance. Conversely, in cancer and autism patients, we observed an enrichment of penetrance increasing haplotype configurations for pathogenic variants in disease-implicated genes, providing evidence that regulatory haplotype configuration of coding variants affects disease risk. Finally, we experimentally validated this model by editing a Mendelian single-nucleotide polymorphism (SNP) using CRISPR/Cas9 on distinct expression haplotypes with the transcriptome as a phenotypic readout. Our results demonstrate that joint regulatory and coding variant effects are an important part of the genetic architecture of human traits and contribute to modified penetrance of disease-causing variants.Peer reviewe

    Next-gen sequencing identifies non-coding variation disrupting miRNA-binding sites in neurological disorders

    Get PDF
    Understanding the genetic factors underlying neurodevelopmental and neuropsychiatric disorders is a major challenge given their prevalence and potential severity for quality of life. While large-scale genomic screens have made major advances in this area, for many disorders the genetic underpinnings are complex and poorly understood. To date the field has focused predominantly on protein coding variation, but given the importance of tightly controlled gene expression for normal brain development and disorder, variation that affects non-coding regulatory regions of the genome is likely to play an important role in these phenotypes. Herein we show the importance of 3 prime untranslated region (3'UTR) non-coding regulatory variants across neurodevelopmental and neuropsychiatric disorders. We devised a pipeline for identifying and functionally validating putatively pathogenic variants from next generation sequencing (NGS) data. We applied this pipeline to a cohort of children with severe specific language impairment (SLI) and identified a functional, SLI-associated variant affecting gene regulation in cells and post-mortem human brain. This variant and the affected gene (ARHGEF39) represent new putative risk factors for SLI. Furthermore, we identified 3'UTR regulatory variants across autism, schizophrenia and bipolar disorder NGS cohorts demonstrating their impact on neurodevelopmental and neuropsychiatric disorders. Our findings show the importance of investigating non-coding regulatory variants when determining risk factors contributing to neurodevelopmental and neuropsychiatric disorders. In the future, integration of such regulatory variation with protein coding changes will be essential for uncovering the genetic causes of complex neurological disorders and the fundamental mechanisms underlying health and disease

    Joint sequencing of human and pathogen genomes reveals the genetics of pneumococcal meningitis.

    Get PDF
    Streptococcus pneumoniae is a common nasopharyngeal colonizer, but can also cause life-threatening invasive diseases such as empyema, bacteremia and meningitis. Genetic variation of host and pathogen is known to play a role in invasive pneumococcal disease, though to what extent is unknown. In a genome-wide association study of human and pathogen we show that human variation explains almost half of variation in susceptibility to pneumococcal meningitis and one-third of variation in severity, identifying variants in CCDC33 associated with susceptibility. Pneumococcal genetic variation explains a large amount of invasive potential (70%), but has no effect on severity. Serotype alone is insufficient to explain invasiveness, suggesting other pneumococcal factors are involved in progression to invasive disease. We identify pneumococcal genes involved in invasiveness including pspC and zmpD, and perform a human-bacteria interaction analysis. These genes are potential candidates for the development of more broadly-acting pneumococcal vaccines

    Integrative analysis of omics summary data reveals putative mechanisms underlying complex traits

    Get PDF
    The identification of genes and regulatory elements underlying the associations discovered by GWAS is essential to understanding the aetiology of complex traits (including diseases). Here, we demonstrate an analytical paradigm of prioritizing genes and regulatory elements at GWAS loci for follow-up functional studies. We perform an integrative analysis that uses summary-level SNP data from multi-omics studies to detect DNA methylation (DNAm) sites associated with gene expression and phenotype through shared genetic effects (i.e., pleiotropy). We identify pleiotropic associations between 7858 DNAm sites and 2733 genes. These DNAm sites are enriched in enhancers and promoters, and >40% of them are mapped to distal genes. Further pleiotropic association analyses, which link both the methylome and transcriptome to 12 complex traits, identify 149 DNAm sites and 66 genes, indicating a plausible mechanism whereby the effect of a genetic variant on phenotype is mediated by genetic regulation of transcription through DNAm

    Expression QTLs Mapping and Analysis: A Bayesian Perspective.

    Get PDF
    The aim of expression Quantitative Trait Locus (eQTL) mapping is the identification of DNA sequence variants that explain variation in gene expression. Given the recent yield of trait-associated genetic variants identified by large-scale genome-wide association analyses (GWAS), eQTL mapping has become a useful tool to understand the functional context where these variants operate and eventually narrow down functional gene targets for disease. Despite its extensive application to complex (polygenic) traits and disease, the majority of eQTL studies still rely on univariate data modeling strategies, i.e., testing for association of all transcript-marker pairs. However these "one at-a-time" strategies are (1) unable to control the number of false-positives when an intricate Linkage Disequilibrium structure is present and (2) are often underpowered to detect the full spectrum of trans-acting regulatory effects. Here we present our viewpoint on the most recent advances on eQTL mapping approaches, with a focus on Bayesian methodology. We review the advantages of the Bayesian approach over frequentist methods and provide an empirical example of polygenic eQTL mapping to illustrate the different properties of frequentist and Bayesian methods. Finally, we discuss how multivariate eQTL mapping approaches have distinctive features with respect to detection of polygenic effects, accuracy, and interpretability of the results

    The protocadherin 17 gene affects cognition, personality, amygdala structure and function, synapse development and risk of major mood disorders

    Get PDF
    Major mood disorders, which primarily include bipolar disorder and major depressive disorder, are the leading cause of disability worldwide and pose a major challenge in identifying robust risk genes. Here, we present data from independent large-scale clinical data sets (including 29 557 cases and 32 056 controls) revealing brain expressed protocadherin 17 (PCDH17) as a susceptibility gene for major mood disorders. Single-nucleotide polymorphisms (SNPs) spanning the PCDH17 region are significantly associated with major mood disorders; subjects carrying the risk allele showed impaired cognitive abilities, increased vulnerable personality features, decreased amygdala volume and altered amygdala function as compared with non-carriers. The risk allele predicted higher transcriptional levels of PCDH17 mRNA in postmortem brain samples, which is consistent with increased gene expression in patients with bipolar disorder compared with healthy subjects. Further, overexpression of PCDH17 in primary cortical neurons revealed significantly decreased spine density and abnormal dendritic morphology compared with control groups, which again is consistent with the clinical observations of reduced numbers of dendritic spines in the brains of patients with major mood disorders. Given that synaptic spines are dynamic structures which regulate neuronal plasticity and have crucial roles in myriad brain functions, this study reveals a potential underlying biological mechanism of a novel risk gene for major mood disorders involved in synaptic function and related intermediate phenotypes

    Genome-wide association study across European and African American ancestries identifies a SNP in DNMT3B contributing to nicotine dependence

    Get PDF
    Cigarette smoking is a leading cause of preventable mortality worldwide. Nicotine dependence, which reduces the likelihood of quitting smoking, is a heritable trait with firmly established associations with sequence variants in nicotine acetylcholine receptor genes and at other loci. To search for additional loci, we conducted a genome-wide association study (GWAS) meta-analysis of nicotine dependence, totaling 38,602 smokers (28,677 Europeans/European Americans and 9925 African Americans) across 15 studies. In this largest-ever GWAS meta-analysis for nicotine dependence and the largest-ever cross-ancestry GWAS meta-analysis for any smoking phenotype, we reconfirmed the well-known CHRNA5-CHRNA3-CHRNB4 genes and further yielded a novel association in the DNA methyltransferase gene DNMT3B. The intronic DNMT3B rs910083-C allele (frequency = 44-77%) was associated with increased risk of nicotine dependence at P = 3.7 x 10(-8) (odds ratio (OR) = 1.06 and 95% confidence interval (CI) = 1.04-1.07 for severe vs mild dependence). The association was independently confirmed in the UK Biobank (N = 48,931) using heavy vs never smoking as a proxy phenotype (P = 3.6 x 10(-4), OR = 1.05, and 95% CI = 1.02-1.08). Rs910083-C is also associated with increased risk of squamous cell lung carcinoma in the International Lung Cancer Consortium (N = 60,586, meta-analysis P = 0.0095, OR = 1.05, and 95% CI = 1.01-1.09). Moreover, rs910083-C was implicated as a cis-methylation quantitative trait locus (QTL) variant associated with higher DNMT3B methylation in fetal brain (N = 166, P = 2.3 x 10(-26)) and a cis-expression QTL variant associated with higher DNMT3B expression in adult cerebellum from the Genotype-Tissue Expression project (N = 103, P = 3.0 x 10(-6)) and the independent Brain eQTL Almanac (N = 134, P = 0.028). This novel DNMT3B cis-acting QTL variant highlights the importance of genetically influenced regulation in brain on the risks of nicotine dependence, heavy smoking and consequent lung cancer.Peer reviewe

    Genome-wide analysis of differential transcriptional and epigenetic variability across human immune cell types

    Get PDF
    Abstract Background A healthy immune system requires immune cells that adapt rapidly to environmental challenges. This phenotypic plasticity can be mediated by transcriptional and epigenetic variability. Results We apply a novel analytical approach to measure and compare transcriptional and epigenetic variability genome-wide across CD14+CD16− monocytes, CD66b+CD16+ neutrophils, and CD4+CD45RA+ naïve T cells from the same 125 healthy individuals. We discover substantially increased variability in neutrophils compared to monocytes and T cells. In neutrophils, genes with hypervariable expression are found to be implicated in key immune pathways and are associated with cellular properties and environmental exposure. We also observe increased sex-specific gene expression differences in neutrophils. Neutrophil-specific DNA methylation hypervariable sites are enriched at dynamic chromatin regions and active enhancers. Conclusions Our data highlight the importance of transcriptional and epigenetic variability for the key role of neutrophils as the first responders to inflammatory stimuli. We provide a resource to enable further functional studies into the plasticity of immune cells, which can be accessed from: http://blueprint-dev.bioinfo.cnio.es/WP10/hypervariability
    corecore