189 research outputs found

    Mutual Information for Testing Gene-Environment Interaction

    Get PDF
    Despite current enthusiasm for investigation of gene-gene interactions and gene-environment interactions, the essential issue of how to define and detect gene-environment interactions remains unresolved. In this report, we define gene-environment interactions as a stochastic dependence in the context of the effects of the genetic and environmental risk factors on the cause of phenotypic variation among individuals. We use mutual information that is widely used in communication and complex system analysis to measure gene-environment interactions. We investigate how gene-environment interactions generate the large difference in the information measure of gene-environment interactions between the general population and a diseased population, which motives us to develop mutual information-based statistics for testing gene-environment interactions. We validated the null distribution and calculated the type 1 error rates for the mutual information-based statistics to test gene-environment interactions using extensive simulation studies. We found that the new test statistics were more powerful than the traditional logistic regression under several disease models. Finally, in order to further evaluate the performance of our new method, we applied the mutual information-based statistics to three real examples. Our results showed that P-values for the mutual information-based statistics were much smaller than that obtained by other approaches including logistic regression models

    Evaluation of presumably disease causing SCN1A variants in a cohort of common epilepsy syndromes

    Get PDF
    Objective: The SCN1A gene, coding for the voltage-gated Na+ channel alpha subunit NaV1.1, is the clinically most relevant epilepsy gene. With the advent of high-throughput next-generation sequencing, clinical laboratories are generating an ever-increasing catalogue of SCN1A variants. Variants are more likely to be classified as pathogenic if they have already been identified previously in a patient with epilepsy. Here, we critically re-evaluate the pathogenicity of this class of variants in a cohort of patients with common epilepsy syndromes and subsequently ask whether a significant fraction of benign variants have been misclassified as pathogenic. Methods: We screened a discovery cohort of 448 patients with a broad range of common genetic epilepsies and 734 controls for previously reported SCN1A mutations that were assumed to be disease causing. We re-evaluated the evidence for pathogenicity of the identified variants using in silico predictions, segregation, original reports, available functional data and assessment of allele frequencies in healthy individuals as well as in a follow up cohort of 777 patients. Results and Interpretation: We identified 8 known missense mutations, previously reported as pathogenic, in a total of 17 unrelated epilepsy patients (17/448; 3.80%). Our re-evaluation indicates that 7 out of these 8 variants (p.R27T; p.R28C; p.R542Q; p.R604H; p.T1250M; p.E1308D; p.R1928G; NP-001159435.1) are not pathogenic. Only the p.T1174S mutation may be considered as a genetic risk factor for epilepsy of small effect size based on the enrichment in patients (P = 6.60 7 10-4; OR = 0.32, fishers exact test), previous functional studies but incomplete penetrance. Thus, incorporation of previous studies in genetic counseling of SCN1A sequencing results is challenging and may produce incorrect conclusions

    A model-based approach to selection of tag SNPs

    Get PDF
    BACKGROUND: Single Nucleotide Polymorphisms (SNPs) are the most common type of polymorphisms found in the human genome. Effective genetic association studies require the identification of sets of tag SNPs that capture as much haplotype information as possible. Tag SNP selection is analogous to the problem of data compression in information theory. According to Shannon's framework, the optimal tag set maximizes the entropy of the tag SNPs subject to constraints on the number of SNPs. This approach requires an appropriate probabilistic model. Compared to simple measures of Linkage Disequilibrium (LD), a good model of haplotype sequences can more accurately account for LD structure. It also provides a machinery for the prediction of tagged SNPs and thereby to assess the performances of tag sets through their ability to predict larger SNP sets. RESULTS: Here, we compute the description code-lengths of SNP data for an array of models and we develop tag SNP selection methods based on these models and the strategy of entropy maximization. Using data sets from the HapMap and ENCODE projects, we show that the hidden Markov model introduced by Li and Stephens outperforms the other models in several aspects: description code-length of SNP data, information content of tag sets, and prediction of tagged SNPs. This is the first use of this model in the context of tag SNP selection. CONCLUSION: Our study provides strong evidence that the tag sets selected by our best method, based on Li and Stephens model, outperform those chosen by several existing methods. The results also suggest that information content evaluated with a good model is more sensitive for assessing the quality of a tagging set than the correct prediction rate of tagged SNPs. Besides, we show that haplotype phase uncertainty has an almost negligible impact on the ability of good tag sets to predict tagged SNPs. This justifies the selection of tag SNPs on the basis of haplotype informativeness, although genotyping studies do not directly assess haplotypes. A software that implements our approach is available

    Exome sequencing utility in defining the genetic landscape of hearing loss and novel-gene discovery in Iran

    Get PDF
    Hearing loss (HL) is one of the most common sensory defects affecting more than 466 million individuals worldwide. It is clinically and genetically heterogeneous with over 120 genes causing non-syndromic HL identified to date. Here, we performed exome sequencing (ES) on a cohort of Iranian families with no disease-causing variants in known deafness-associated genes after screening with a targeted gene panel. We identified likely causal variants in 20 out of 71 families screened. Fifteen families segregated variants in known deafness-associated genes. Eight families segregated variants in novel candidate genes for HL: DBH, TOP3A, COX18, USP31, TCF19, SCP2, TENM1, and CARMIL1. In the three of these families, intrafamilial locus heterogeneity was observed with variants in both known and novel candidate genes. In aggregate, we were able to identify the underlying genetic cause of HL in nearly 30 of our study cohort using ES. This study corroborates the observation that high-throughput DNA sequencing in populations with high rates of consanguineous marriages represents a more appropriate strategy to elucidate the genetic etiology of heterogeneous conditions such as HL. © 2021 John Wiley & Sons A/S. Published by John Wiley & Sons Lt

    Association of toll-interacting protein gene polymorphisms with atopic dermatitis

    Get PDF
    BACKGROUND: Atopic dermatitis (AD) is a common inflammatory skin disorder, affecting up to 15% of children in industrialized countries. Toll-interacting protein (TOLLIP) is an inhibitory adaptor protein within the toll-like receptor (TLR) pathway, a part of the innate immune system that recognizes structurally conserved molecular patterns of microbial pathogens, leading to an inflammatory immune response. METHODS: In order to detect a possible role of TOLLIP variation in the pathogenesis of AD, we screened the entire coding sequence of the TOLLIP gene by SSCP in 50 AD patients. We identified an amino acid exchange in exon 6 (Ala222Ser) and a synonymous variation in exon 4 (Pro139Pro). Subsequently, these two variations and four additional non-coding polymorphisms (-526 C/G, two polymorphisms in intron 1 and one in the 3'UTR) were genotyped in 317 AD patients and 224 healthy controls. RESULTS: The -526G allele showed borderline association with AD in our cohort (p = 0.012; significance level after correction for multiple testing 0.0102). Haplotype analysis did not yield additional information. Evaluation of mRNA expression by quantitative real-time polymerase chain reaction in six probands with the CC and six with the GG genotype at the -526 C/G locus did not reveal significant differences between genotypes. CONCLUSION: Variation in the TOLLIP gene may play a role in the pathogenesis of AD. Yet, replication studies in other cohorts and populations are warranted to confirm these association results

    Evaluation of presumably disease causing SCN1A variants in a cohort of common epilepsy syndromes

    Get PDF
    Objective: The SCN1A gene, coding for the voltage-gated Na+ channel alpha subunit NaV1.1, is the clinically most relevant epilepsy gene. With the advent of high-throughput next-generation sequencing, clinical laboratories are generating an ever-increasing catalogue of SCN1A variants. Variants are more likely to be classified as pathogenic if they have already been identified previously in a patient with epilepsy. Here, we critically re-evaluate the pathogenicity of this class of variants in a cohort of patients with common epilepsy syndromes and subsequently ask whether a significant fraction of benign variants have been misclassified as pathogenic. Methods: We screened a discovery cohort of 448 patients with a broad range of common genetic epilepsies and 734 controls for previously reported SCN1A mutations that were assumed to be disease causing. We re-evaluated the evidence for pathogenicity of the identified variants using in silico predictions, segregation, original reports, available functional data and assessment of allele frequencies in healthy individuals as well as in a follow up cohort of 777 patients. Results and Interpretation: We identified 8 known missense mutations, previously reported as path

    Pipeline for Large-Scale Microdroplet Bisulfite PCR-Based Sequencing Allows the Tracking of Hepitype Evolution in Tumors

    Get PDF
    Cytosine methylation provides an epigenetic level of cellular plasticity that is important for development, differentiation and cancerogenesis. We adopted microdroplet PCR to bisulfite treated target DNA in combination with second generation sequencing to simultaneously assess DNA sequence and methylation. We show measurement of methylation status in a wide range of target sequences (total 34 kb) with an average coverage of 95% (median 100%) and good correlation to the opposite strand (rho = 0.96) and to pyrosequencing (rho = 0.87). Data from lymphoma and colorectal cancer samples for SNRPN (imprinted gene), FGF6 (demethylated in the cancer samples) and HS3ST2 (methylated in the cancer samples) serve as a proof of principle showing the integration of SNP data and phased DNA-methylation information into “hepitypes” and thus the analysis of DNA methylation phylogeny in the somatic evolution of cancer

    Genetic risk profiles for depression and anxiety in adult and elderly cohorts

    Get PDF
    The first generation of genome-wide association studies (GWA studies) for psychiatric disorders has led to new insights regarding the genetic architecture of these disorders. We now start to realize that a larger number of genes, each with a small contribution, are likely to explain the heritability of psychiatric diseases. The contribution of a large number of genes to complex traits can be analyzed with genome-wide profiling. In a discovery sample, a genetic risk profile for depression was defined based on a GWA study of 1738 adult cases and 1802 controls. The genetic risk scores were tested in two population-based samples of elderly participants. The genetic risk profiles were evaluated for depression and anxiety in the Rotterdam Study cohort and the Erasmus Rucphen Family (ERF) study. The genetic risk scores were significantly associated with different measures of depression and explained up to ∼0.7% of the variance in depression in Rotterdam Study and up to ∼1% in ERF study. The genetic score for depression was also significantly associated with anxiety explaining up to 2.1% in Rotterdam study. These findings suggest the presence of many genetic loci of small effect that influence both depression and anxiety. Remarkably, the predictive value of these profiles was as large in the sample of elderly participants as in the middle-aged samples

    Rare coding variants in genes encoding GABA_A receptors in genetic generalised epilepsies: an exome-based case-control study

    Get PDF
    BACKGROUND: Genetic generalised epilepsy is the most common type of inherited epilepsy. Despite a high concordance rate of 80% in monozygotic twins, the genetic background is still poorly understood. We aimed to investigate the burden of rare genetic variants in genetic generalised epilepsy. METHODS: For this exome-based case-control study, we used three different genetic generalised epilepsy case cohorts and three independent control cohorts, all of European descent. Cases included in the study were clinically evaluated for genetic generalised epilepsy. Whole-exome sequencing was done for the discovery case cohort, a validation case cohort, and two independent control cohorts. The replication case cohort underwent targeted next-generation sequencing of the 19 known genes encoding subunits of GABAA receptors and was compared to the respective GABAA receptor variants of a third independent control cohort. Functional investigations were done with automated two-microelectrode voltage clamping in Xenopus laevis oocytes. FINDINGS: Statistical comparison of 152 familial index cases with genetic generalised epilepsy in the discovery cohort to 549 ethnically matched controls suggested an enrichment of rare missense (Nonsyn) variants in the ensemble of 19 genes encoding GABAA receptors in cases (odds ratio [OR] 2·40 [95% CI 1·41-4·10]; pNonsyn=0·0014, adjusted pNonsyn=0·019). Enrichment for these genes was validated in a whole-exome sequencing cohort of 357 sporadic and familial genetic generalised epilepsy cases and 1485 independent controls (OR 1·46 [95% CI 1·05-2·03]; pNonsyn=0·0081, adjusted pNonsyn=0·016). Comparison of genes encoding GABAA receptors in the independent replication cohort of 583 familial and sporadic genetic generalised epilepsy index cases, based on candidate-gene panel sequencing, with a third independent control cohort of 635 controls confirmed the overall enrichment of rare missense variants for 15 GABAA receptor genes in cases compared with controls (OR 1·46 [95% CI 1·02-2·08]; pNonsyn=0·013, adjusted pNonsyn=0·027). Functional studies for two selected genes (GABRB2 and GABRA5) showed significant loss-of-function effects with reduced current amplitudes in four of seven tested variants compared with wild-type receptors. INTERPRETATION: Functionally relevant variants in genes encoding GABAA receptor subunits constitute a significant risk factor for genetic generalised epilepsy. Examination of the role of specific gene groups and pathways can disentangle the complex genetic architecture of genetic generalised epilepsy. FUNDING: EuroEPINOMICS (European Science Foundation through national funding organisations), Epicure and EpiPGX (Sixth Framework Programme and Seventh Framework Programme of the European Commission), Research Unit FOR2715 (German Research Foundation and Luxembourg National Research Fund)
    corecore