27 research outputs found
DNA damage in circulating leukocytes measured with the comet assay may predict the risk of death
The comet assay or single cell gel electrophoresis, is the most common method used to measure strand breaks and a variety of other DNA lesions in human populations. To estimate the risk of overall mortality, mortality by cause, and cancer incidence associated to DNA damage, a cohort of 2,403 healthy individuals (25,978 person-years) screened in 16 laboratories using the comet assay between 1996 and 2016 was followed-up. Kaplan-Meier analysis indicated a worse overall survival in the medium and high tertile of DNA damage (p < 0.001). The effect of DNA damage on survival was modelled according to Cox proportional hazard regression model. The adjusted hazard ratio (HR) was 1.42 (1.06-1.90) for overall mortality, and 1.94 (1.04-3.59) for diseases of the circulatory system in subjects with the highest tertile of DNA damage. The findings of this study provide epidemiological evidence encouraging the implementation of the comet assay in preventive strategies for non-communicable diseases
Constructing Gröbner bases for Noetherian rings
We give a constructive proof showing that every finitely generated polynomial ideal has a Gröbner basis, provided the ring of coefficients is Noetherian in the sense of Richman and Seidenberg. That is, we give a constructive termination proof for a variant of the well-known algorithm for computing the Gröbner basis. In combination with a purely order-theoretic result we have proved in a separate paper, this yields a unified constructive proof of the Hilbert basis theorem for all Noether classes: if a ring belongs to a Noether class, then so does the polynomial ring. Our proof can be seen as a constructive reworking of one of the classical proofs, in the spirit of the partial realisation of Hilbert's programme in algebra put forward by Coquand and Lombardi. The rings under consideration need not be commutative, but are assumed to be coherent and strongly discrete: that is, they admit a membership test for every finitely generated ideal. As a complement to the proof, we provide a prime decomposition for commutative rings possessing the finite-depth property
Noetherian orders
Noether classes of posets arise in a natural way from the constructively meaningful variants
of the notion of a Noetherian ring. Using an axiomatic characterisation of a Noether class,
we prove that if a poset belongs to a Noether class, then so does the poset of the finite
descending chains. When applied to the poset of finitely generated ideals of a ring, this helps
towards a unified constructive proof of the Hilbert basis theorem for all Noether classes
Supplementary Material for: The Use of the Linear Mixed Model in Human Genetics
We give a short but detailed review of the methods used to deal with linear mixed models (restricted likelihood, AIREML algorithm, best linear unbiased predictors, etc.), with a few original points. Then we describe three common applications of the linear mixed model in contemporary human genetics: association testing (pathways analysis or rare variants association tests), genomic heritability estimates, and correction for population stratification in genome-wide association studies. We also consider the performance of best linear unbiased predictors for prediction in this context, through a simulation study for rare variants in a short genomic region, and through a short theoretical development for genome-wide data. For each of these applications, we discuss the relevance and the impact of modeling genetic effects as random effects
Moment estimators of relatedness from low-depth whole-genome sequencing data
International audienceAbstract Background Estimating relatedness is an important step for many genetic study designs. A variety of methods for estimating coefficients of pairwise relatedness from genotype data have been proposed. Both the kinship coefficient φ and the fraternity coefficient ψ for all pairs of individuals are of interest. However, when dealing with low-depth sequencing or imputation data, individual level genotypes cannot be confidently called. To ignore such uncertainty is known to result in biased estimates. Accordingly, methods have recently been developed to estimate kinship from uncertain genotypes. Results We present new method-of-moment estimators of both the coefficients φ and ψ calculated directly from genotype likelihoods. We have simulated low-depth genetic data for a sample of individuals with extensive relatedness by using the complex pedigree of the known genetic isolates of Cilento in South Italy. Through this simulation, we explore the behaviour of our estimators, demonstrate their properties, and show advantages over alternative methods. A demonstration of our method is given for a sample of 150 French individuals with down-sampled sequencing data. Conclusions We find that our method can provide accurate relatedness estimates whilst holding advantages over existing methods in terms of robustness, independence from external software, and required computation time. The method presented in this paper is referred to as LowKi ( Low -depth Ki nship) and has been made available in an R package ( https://github.com/genostats/LowKi )
Supplementary Material for: A Bagged, Partially Linear, Tree-Based Regression Procedure for Prediction and Variable Selection
<b><i>Objectives:</i></b> In genomics, variable selection and prediction accounting for the complex interrelationships between explanatory variables represent major challenges. Tree-based methods are powerful alternatives to classical regression models. We have recently proposed the generalized, partially linear, tree-based regression (GPLTR) procedure that integrates the advantages of generalized linear regression (allowing the incorporation of confounding variables) and of tree-based models. In this work, we use bagging to address a classical concern of tree-based methods: their instability. <b><i>Methods:</i></b> We present a bagged GPLTR procedure and three scores for variable importance. The prediction accuracy and the performance of the scores are assessed by simulation. The use of this procedure is exemplified by the analysis of a lung cancer data set. The aim is to predict the epidermal growth factor receptor (EGFR) mutation based on gene expression measurements, taking into account the ethnicity (confounder variable) and perform variable selection. <b><i>Results:</i></b> The procedure performs well in terms of prediction accuracy. The scores differentiate predictive variables from noise variables. Based on a lung adenocarcinoma data set, the procedure achieves good predictive performance for EGFR mutation and selects relevant genes. <b><i>Conclusion:</i></b> The proposed bagged GPLTR procedure performs well for prediction and variable selection
Determination of the real effect of genes identified in GWAS: The example of IL2RA in multiple sclerosis
Genome-wide association studies (GWAS), although efficient to detect genes involved in complex diseases, are not designed to measure the real effect of the genes. This is illustrated here by the example of IL2RA in multiple sclerosis (MS). Association between IL2RA and MS is clearly established, although the functional variation is still unknown: the effect of IL2RA might be better described by several SNPs than by a single one. This study investigates whether a pair of SNPs better explains the observed linkage and association data than a single SNP. In total, 522 trio families and 244 affected sib-pairs were typed for 26 IL2RA SNPs. For each SNP and pairs of SNPs, the phased genotypes of patients and controls were compared to determine the SNP set offering the best risk discrimination. Consistency between the genotype risks provided by the retained set and the identical by descent allele sharing in affected sib-pairs was assessed. After controlling for multiple testing, the set of SNPs rs2256774 and rs3118470, provides the best discrimination between the case and control genotype distributions (P-corrected0.009). The relative risk between the least and most at-risk genotypes is 3.54 with a 95% confidence interval of 2.14-5.94. Furthermore, the linkage information provided by the allele sharing between affected sibs is consistent with the retained set (P=0.80) but rejects the SNP reported in the literature (P=0.006). Establishing a valid modeling of a disease gene is essential to test its potential interaction with other genes and to reconstruct the pathophysiological pathways. © 2012 Macmillan Publishers Limited All rights reserved