189 research outputs found

    CollapsABEL: An R library for detecting compound heterozygote alleles in genome-wide association studies

    Get PDF
    Background: Compound Heterozygosity (CH) in classical genetics is the presence of two different recessive mutations at a particular gene locus. A relaxed form of CH alleles may account for an essential proportion of the missing heritability, i.e. heritability of phenotypes so far not accounted for by single genetic variants. Methods to detect CH-like effects in genome-wide association studies (GWAS) may facilitate explaining the missing heritability, but to our knowledge no viable software tools for this purpose are currently available. Results: In this work we present the Generalized Compound Double Heterozygosity (GCDH) test and its implementation in the R package CollapsABEL. Time-consuming procedures are optimized for computational efficiency using Java or C++. Intermediate results are stored either in an SQL database or in a so-called big.matrix file to achieve reasonable memory footprint. Our large scale simulation studies show that GCDH is capable of discovering genetic associations due to CH-like interactions with much higher power than a conventional single-SNP approach under various settings, whether the causal genetic variations are available or not. CollapsABEL provides a user-friendly pipeline for genotype collapsing, statistical testing, power estimation, type I error control and graphics generation in the R language. Conclusions: CollapsABEL provides a computationally efficient solution for screening general forms of CH alleles in densely imputed microarray or whole genome sequencing datasets. The GCDH test provides an improved power over single-SNP based methods in detecting the prevalence of CH in human complex phenotypes, offering an opportunity for tackling the missing heritability problem. Binary and source packages of CollapsABEL are available on CRAN (https://cran.r-project.org/web/packages/CollapsABEL) and the website of the GenABEL project (http://www.genabel.org/packages)

    Метафорична картина світу та її місце у системі світів

    Get PDF
    Статья посвящается исследованию понятия метафорической картины мира, целесообразность выделения которой автор объясняет тем, что по аналогии с языковой и концептуальной картинами мира, термин "метафорическая картина мира" содержит информацию о сложной структуре многосмысловых значений, которые в силу своей метафорической природе гармонически объединяются.У статті йдеться про поняття метафоричної картини світу, доцільність виділення якої авторка пояснює тим, що за аналогією до мовної й концептуальної картин світу, термін "метафорична картина світу" вміщує інформацію про складну структуру багатосмислових значень, що завдяки своїй метафоричній природі гармонійно поєднуються.The article deals with the notion of metaphorical world picture connected with the general principle of conceptualization. The term "metaphorical world picture" consists of a complex structure of various meanings harmonically combined due to their metaphorical nature

    Insight in Genome-Wide Association of Metabolite Quantitative Traits by Exome Sequence Analyses

    Get PDF
    Metabolite quantitative traits carry great promise for epidemiological studies, and their genetic background has been addressed using Genome-Wide Association Studies (GWAS). Thus far, the role of less common variants has not been exhaustively studied. Here, we set out a GWAS for metabolite quantitative traits in serum, followed by exome sequence analysis to zoom in on putative causal variants in the associated genes. 1H Nuclear Magnetic Resonance (1H-NMR) spectroscopy experiments yielded successful quantification of 42 unique metabolites in 2,482 individuals from The Erasmus Rucphen Family (ERF) study. Heritability of metabolites were estimated by SOLAR. GWAS was performed by linear mixed models, using HapMap imputations. Based on physical vicinity and pathway analyses, candidate genes were screened for coding region variation using exome sequence data. Heritability estimates for metabolites ranged between 10% and 52%. GWAS replicated three known loci in the metabolome wide significance: CPS1 with glycine (P-value  = 1.27×10−32), PRODH with proline (P-value  = 1.11×10−19), SLC16A9 with carnitine level (P-value  = 4.81×10−14) and uncovered a novel association between DMGDH and dimethyl-glycine (P-value  = 1.65×10−19) level. In addition, we found three novel, suggestively significant loci: TNP1 with pyruvate (P-value  = 1.26×10−8), KCNJ16 with 3-hydroxybutyrate (P-value  = 1.65×10−8) and 2p12 locus with valine (P-value  = 3.49×10−8). Exome sequence analysis identified potentially causal coding and regulatory variants located in the genes CPS1, KCNJ2 and PRODH, and revealed allelic heterogeneity for CPS1 and PRODH. Combined GWAS and exome analyses of metabolites detected by high-resolution 1H-NMR is a robust approach to uncover metabolite quantitative trait loci (mQTL), and the likely causative variants in these loci. It is anticipated that insight in the genetics of intermediate phenotypes will provide additional insight into the genetics of complex traits

    Omics\u27 biomarkers associated with chronic low back pain: Protocol of a retrospective longitudinal study

    Get PDF
    Introduction Chronic low back pain (CLBP) produces considerable direct costs as well as indirect burdens for society, industry and health systems. CLBP is characterised by heterogeneity, inclusion of several pain syndromes, different underlying molecular pathologies and interaction with psychosocial factors that leads to a range of clinical manifestations. There is still much to understand in the underlying pathological processes and the non-psychosocial factors which account for differences in outcomes. Biomarkers that may be objectively used for diagnosis and personalised, targeted and cost-effective treatment are still lacking. Therefore, any data that may be obtained at the-omics\u27 level (glycomics, Activomics and genome-wide association studies-GWAS) may be helpful to use as dynamic biomarkers for elucidating CLBP pathogenesis and may ultimately provide prognostic information too. By means of a retrospective, observational, case-cohort, multicentre study, we aim to investigate new promising biomarkers potentially able to solve some of the issues related to CLBP. Methods and analysis The study follows a two-phase, 1:2 case-control model. A total of 12 000 individuals (4000 cases and 8000 controls) will be enrolled; clinical data will be registered, with particular attention to pain characteristics and outcomes of pain treatments. Blood samples will be collected to perform-omics studies. The primary objective is to recognise genetic variants associated with CLBP; secondary objectives are to study glycomics and Activomics profiles associated with CLBP. Ethics and dissemination The study is part of the PainOMICS project funded by European Community in the Seventh Framework Programme. The study has been approved from competent ethical bodies and copies of approvals were provided to the European Commission before starting the study. Results of the study will be reviewed by the Scientific Board and Ethical Committee of the PainOMICS Consortium. The scientific results will be disseminated through peer-reviewed journals. Trial registration number NCT02037789; Pre-results

    Improved imputation quality of low-frequency and rare variants in European samples using the 'Genome of the Netherlands'

    Get PDF
    Although genome-wide association studies (GWAS) have identified many common variants associated with complex traits, low-frequency and rare variants have not been interrogated in a comprehensive manner. Imputation from dense reference panels, such as the 1000 Genomes Project (1000G), enables testing of ungenotyped variants for association. Here we present the results of imputation using a large, new population-specific panel: the Genome of The Netherlands (GoNL). We benchmarked the performance of the 1000G and GoNL reference sets by comparing imputation genotypes with 'true' genotypes typed on ImmunoChip in three European populations (Dutch, British, and Italian). GoNL showed significant improvement in the imputation quality for rare variants (MAF 0.05-0.5%) compared with 1000G. In Dutch samples, the mean observed Pearson correlation, r 2, increased from 0.61 to 0.71. W

    The challenges of genome-wide interaction studies: Lessons to learn from the analysis of HDL blood levels

    Get PDF
    Genome-wide association studies (GWAS) have revealed 74 single nucleotide polymorphisms (SNPs) associated with high-density lipoprotein cholesterol (HDL) blood levels. This study is, to our knowledge, the first genome-wide interaction study (GWIS) to identify SNP6SNP interactions associated with HDL levels. We performed a GWIS in the Rotterdam Study (RS) cohort I (RS-I) using the GLIDE tool which leverages the massively parallel computing power of Graphics Processing Units (GPUs) to perform linear regression on all genome-wide pairs of SNPs. By performing a meta-analysis together with Rotterdam Study cohorts II and III (RS-II and RS-III), we were able to filter 181 interaction terms with a p-value, 1 · 1028 that replicated in the two independent cohorts. We were not able to replicate any of these interaction term in the AGES, ARIC, CHS, ERF, FHS and NFBC-66 cohorts (Ntotal = 30, 011) when adjusting for multiple testing. Our GWIS resulted in the consistent finding of a possible interaction between rs774801 in ARMC8 (ENSG00000114098) and rs12442098 in SPATA8 (ENSG00000185594) being associated with HDL levels. However, p-values do not reach the preset Bonferroni correction of the p-values. Our study suggest that even for highly genetically determined traits such as HDL the sample sizes needed to detect SNP6SNP interactions are large and the 2-step filtering approaches do not yield a solution. Here we present our analysis plan and our reservations concerning GWIS

    <Book Reviews> Ingemar Fagerlind and Lawrence J Saha Education and National Development : A Comparative Perspective

    Get PDF
    textabstractVarious modeling methods have been proposed to estimate the potential predictive ability of polygenic risk variants that predispose to various common diseases. However, it is unknown whether differences between them affect their conclusions on predictive ability. We reviewed input parameters, assumptions and output of the five most common methods and compared their estimates of the area under the receiver operating characteristic (ROC) curve (AUC) using hypothetical data representing effect sizes and frequencies of genetic variants, population disease risk and number of variants. To assess the accuracy of the estimated AUCs, we aimed to reproduce the AUCs of published empirical studies. All methods assumed that the combined effect of genetic variants on disease risk followed a multiplicative risk model of independent genetic effects, but they either assumed per allele, per genotype or dominant/recessive effects for the genetic variants. Modeling strategy and input parameters differed. Methods used simulation analysis or analytical formulas with effect sizes quantified by odds ratios (ORs) or relative risks. Estimated AUC values were similar for lower ORs (0.7) due to variants with strong effects, differences in estimated AUCs between methods increased. The simulation methods accurately reproduced the AUC values of empirical studies, but the analytical methods did not. We conclude that despite differences in input parameters, the modeling methods estimate similar AUC for realistic values of the ORs. When one or more variants have stronger effects and AUC values are higher, the simulation methods tend to be more accurate
    corecore