2,158 research outputs found

    Bivariate causal mixture model quantifies polygenic overlap between complex traits beyond genetic correlation.

    Get PDF
    Accumulating evidence from genome wide association studies (GWAS) suggests an abundance of shared genetic influences among complex human traits and disorders, such as mental disorders. Here we introduce a statistical tool, MiXeR, which quantifies polygenic overlap irrespective of genetic correlation, using GWAS summary statistics. MiXeR results are presented as a Venn diagram of unique and shared polygenic components across traits. At 90% of SNP-heritability explained for each phenotype, MiXeR estimates that 8.3 K variants causally influence schizophrenia and 6.4 K influence bipolar disorder. Among these variants, 6.2 K are shared between the disorders, which have a high genetic correlation. Further, MiXeR uncovers polygenic overlap between schizophrenia and educational attainment. Despite a genetic correlation close to zero, the phenotypes share 8.3 K causal variants, while 2.5 K additional variants influence only educational attainment. By considering the polygenicity, discoverability and heritability of complex phenotypes, MiXeR analysis may improve our understanding of cross-trait genetic architectures

    Dissecting the heritable risk of breast cancer:From statistical methods to susceptibility genes

    Get PDF
    Decades of research have shown that rare highly penetrant mutations can promote tumorigenesis, but it is still unclear whether variants observed at high-frequency in the broader population could modulate the risk of developing cancer. Genome-wide Association Studies (GWAS) have generated a wealth of data linking single nucleotide polymorphisms (SNPs) to increased cancer risk, but the effect of these mutations are usually subtle, leaving most of cancer heritability unexplained. Understanding the role of high-frequency mutations in cancer can provide new intervention points for early diagnostics, patient stratification and treatment in malignancies with high prevalence, such as breast cancer. Here we review state-of-the-art methods to study cancer heritability using GWAS data and provide an updated map of breast cancer susceptibility loci at the SNP and gene level

    Allelic expression mapping across cellular lineages to establish impact of non-coding SNPs

    Get PDF
    This is an open access article under the terms of the Creative Commons Attribution 4.0 License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited

    Second-generation PLINK: rising to the challenge of larger and richer datasets

    Get PDF
    PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format. To address these issues, we are developing a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information. The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.Comment: 2 figures, 1 additional fil

    Rare and common epilepsies converge on a shared gene regulatory network providing opportunities for novel antiepileptic drug discovery

    Get PDF
    Background The relationship between monogenic and polygenic forms of epilepsy is poorly understood, and the extent to which the genetic and acquired epilepsies share common pathways is unclear. Here, we use an integrated systems-level analysis of brain gene expression data to identify molecular networks disrupted in epilepsy. Results We identify a co-expression network of 320 genes (M30), which is significantly enriched for non-synonymous de novo mutations ascertained from patients with monogenic epilepsy, and for common variants associated with polygenic epilepsy. The genes in M30 network are expressed widely in the human brain under tight developmental control, and encode physically interacting proteins involved in synaptic processes. The most highly connected proteins within M30 network are preferentially disrupted by deleterious de novo mutations for monogenic epilepsy, in line with the centrality-lethality hypothesis. Analysis of M30 expression revealed consistent down-regulation in the epileptic brain in heterogeneous forms of epilepsy including human temporal lobe epilepsy, a mouse model of acquired temporal lobe epilepsy, and a mouse model of monogenic Dravet (SCN1A) disease. These results suggest functional disruption of M30 via gene mutation or altered expression as a convergent mechanism regulating susceptibility to epilepsy broadly. Using the large collection of drug-induced gene expression data from Connectivity Map, several drugs were predicted to preferentially restore the down-regulation of M30 in epilepsy toward health, most notably valproic acid, whose effect on M30 expression was replicated in neurons. Conclusions Taken together, our results suggest targeting the expression of M30 as a potential new therapeutic strategy in epilepsy
    corecore