2 research outputs found

    Automatic identification of variables in epidemiological datasets using logic regression

    Get PDF
    textabstractBackground: For an individual participant data (IPD) meta-analysis, multiple datasets must be transformed in a consistent format, e.g. using uniform variable names. When large numbers of datasets have to be processed, this can be a time-consuming and error-prone task. Automated or semi-automated identification of variables can help to reduce the workload and improve the data quality. For semi-automation high sensitivity in the recognition of matching variables is particularly important, because it allows creating software which for a target variable presents a choice of source variables, from which a user can choose the matching one, with only low risk of having missed a correct source variable. Methods: For each variable in a set of target variables, a number of simple rules were manually created. With logic regression, an optimal Boolean combination of these rules was searched for every target variable, using a random subset of a large database of epidemiological and clinical cohort data (construction subset). In a second subset of this database (validation subset), this optimal combination rules were validated. Results: In the construction sample, 41 target variables were allocated on average with a positive predictive value (PPV) of 34%, and a negative predictive value (NPV) of 95%. In the validation sample, PPV was 33%, whereas NPV remained at 94%. In the construction sample, PPV was 50% or less in 63% of all variables, in the validation sample in 71% of all variables. Conclusions: We demonstrated that the application of logic regression in a complex data management task in large epidemiological IPD meta-analyses is feasible. However, the performance of the algorithm is poor, which may require backup strategies

    Evidence for three genetic loci involved in both anorexia nervosa risk and variation of body mass index

    No full text
    The maintenance of normal body weight is disrupted in patients with anorexia nervosa (AN) for prolonged periods of time. Prior to the onset of AN, premorbid body mass index (BMI) spans the entire range from underweight to obese. After recovery, patients have reduced rates of overweight and obesity. As such, loci involved in body weight regulation may also be relevant for AN and vice versa. Our primary analysis comprised a cross-trait analysis of the 1000 single-nucleotide polymorphisms (SNPs) with the lowest P-values in a genome-wide association meta-analysis (GWAMA) of AN (GCAN) for evidence of association in the largest published GWAMA for BMI (GIANT). Subsequently we performed sex-stratified analyses for these 1000 SNPs. Functional ex vivo studies on four genes ensued. Lastly, a look-up of GWAMA-derived BMI-related loci was performed in the AN GWAMA. We detected significant associations (P-values <5 × 10-5, Bonferroni-corrected P<0.05) for nine SNP alleles at three independent loci. Interestingly, all AN susceptibility alleles were consistently associated with increased BMI. None of the genes (chr. 10: CTBP2, chr. 19: CCNE1, chr. 2: CARF and NBEAL1; the latter is a region with high linkage disequilibrium) nearest to these SNPs has previously been associated with AN or obesity. Sex-stratified analyses revealed that the strongest BMI signal originated predominantly from females (chr. 10 rs1561589; Poverall: 2.47 × 10-06/Pfemales: 3.45 × 10-07/Pmales: 0.043). Functional ex vivo studies in mice revealed reduced hypothalamic expression of Ctbp2 and Nbeal1 after fasting. Hypothalamic expression of Ctbp2 was increased in diet-induced obese (DIO) mice as compared with age-matched lean controls. We observed no evidence for associations for the look-up of BMI-related loci in the AN GWAMA. A cross-trait analysis of AN and BMI loci revealed variants at three chromosomal loci with potential joint impact. The chromosome 10 locus is particularly promising given that the association with obesity was primarily driven by females. In addition, the detected altered hypothalamic expression patterns of Ctbp2 and Nbeal1 as a result of fasting and DIO implicate these genes in weight regulation
    corecore