71 research outputs found

    LeafAI: query generator for clinical cohort discovery rivaling a human programmer

    Full text link
    Objective: Identifying study-eligible patients within clinical databases is a critical step in clinical research. However, accurate query design typically requires extensive technical and biomedical expertise. We sought to create a system capable of generating data model-agnostic queries while also providing novel logical reasoning capabilities for complex clinical trial eligibility criteria. Materials and Methods: The task of query creation from eligibility criteria requires solving several text-processing problems, including named entity recognition and relation extraction, sequence-to-sequence transformation, normalization, and reasoning. We incorporated hybrid deep learning and rule-based modules for these, as well as a knowledge base of the Unified Medical Language System (UMLS) and linked ontologies. To enable data-model agnostic query creation, we introduce a novel method for tagging database schema elements using UMLS concepts. To evaluate our system, called LeafAI, we compared the capability of LeafAI to a human database programmer to identify patients who had been enrolled in 8 clinical trials conducted at our institution. We measured performance by the number of actual enrolled patients matched by generated queries. Results: LeafAI matched a mean 43% of enrolled patients with 27,225 eligible across 8 clinical trials, compared to 27% matched and 14,587 eligible in queries by a human database programmer. The human programmer spent 26 total hours crafting queries compared to several minutes by LeafAI. Conclusions: Our work contributes a state-of-the-art data model-agnostic query generation system capable of conditional reasoning using a knowledge base. We demonstrate that LeafAI can rival a human programmer in finding patients eligible for clinical trials

    Does Habitual Physical Activity Increase the Sensitivity of the Appetite Control System? A Systematic Review.

    Get PDF
    BACKGROUND: It has been proposed that habitual physical activity improves appetite control; however, the evidence has never been systematically reviewed. OBJECTIVE: To examine whether appetite control (e.g. subjective appetite, appetite-related peptides, food intake) differs according to levels of physical activity. DATA SOURCES: Medline, Embase and SPORTDiscus were searched for articles published between 1996 and 2015, using keywords pertaining to physical activity, appetite, food intake and appetite-related peptides. STUDY SELECTION: Articles were included if they involved healthy non-smoking adults (aged 18-64 years) participating in cross-sectional studies examining appetite control in active and inactive individuals; or before and after exercise training in previously inactive individuals. STUDY APPRAISAL AND SYNTHESIS: Of 77 full-text articles assessed, 28 studies (14 cross-sectional; 14 exercise training) met the inclusion criteria. RESULTS: Appetite sensations and absolute energy intake did not differ consistently across studies. Active individuals had a greater ability to compensate for high-energy preloads through reductions in energy intake, in comparison with inactive controls. When physical activity level was graded across cross-sectional studies (low, medium, high, very high), a significant curvilinear effect on energy intake (z-scores) was observed. LIMITATIONS: Methodological issues existed concerning the small number of studies, lack of objective quantification of food intake, and various definitions used to define active and inactive individuals. CONCLUSION: Habitually active individuals showed improved compensation for the energy density of foods, but no consistent differences in appetite or absolute energy intake, in comparison with inactive individuals. This review supports a J-shaped relationship between physical activity level and energy intake. Further studies are required to confirm these findings. PROSPERO REGISTRATION NUMBER: CRD42015019696

    Germline HOXB13 mutations p.G84E and p.R217C do not confer an increased breast cancer risk

    Get PDF
    In breast cancer, high levels of homeobox protein Hox-B13 (HOXB13) have been associated with disease progression of ER-positive breast cancer patients and resistance to tamoxifen treatment. Since HOXB13 p.G84E is a prostate cancer risk allele, we evaluated the association between HOXB13 germline mutations and breast cancer risk in a previous study consisting of 3,270 familial non-BRCA1/2 breast cancer cases and 2,327 controls from the Netherlands. Although both recurrent HOXB13 mutations p.G84E and p.R217C were not associated with breast cancer risk, the risk estimation for p.R217C was not very precise. To provide more conclusive evidence regarding the role of HOXB13 in breast cancer susceptibility, we here evaluated the association between HOXB13 mutations and increased breast cancer risk within 81 studies of the international Breast Cancer Association Consortium containing 68,521 invasive breast cancer patients and 54,865 controls. Both HOXB13 p.G84E and p.R217C did not associate with the development of breast cancer in European women, neither in the overall analysis (OR = 1.035, 95% CI = 0.859-1.246, P = 0.718 and OR = 0.798, 95% CI = 0.482-1.322, P = 0.381 respectively), nor in specific high-risk subgroups or breast cancer subtypes. Thus, although involved in breast cancer progression, HOXB13 is not a material breast cancer susceptibility gene.Peer reviewe

    Association of the CHEK2 c.1100delC variant, radiotherapy, and systemic treatment with contralateral breast cancer risk and breast cancer-specific survival

    Get PDF
    Background: Breast cancer (BC) patients with a germline CHEK2 c.1100delC variant have an increased risk of contralateral BC (CBC) and worse BC-specific survival (BCSS) compared to non-carriers.Aim: To assessed the associations of CHEK2 c.1100delC, radiotherapy, and systemic treatment with CBC risk and BCSS.Methods: Analyses were based on 82,701 women diagnosed with a first primary invasive BC including 963 CHEK2 c.1100delC carriers; median follow-up was 9.1 years. Differential associations with treatment by CHEK2 c.1100delC status were tested by including interaction terms in a multivariable Cox regression model. A multi-state model was used for further insight into the relation between CHEK2 c.1100delC status, treatment, CBC risk and death. Results: There was no evidence for differential associations of therapy with CBC risk by CHEK2 c.1100delC status. The strongest association with reduced CBC risk was observed for the combination of chemotherapy and endocrine therapy [HR (95% CI): 0.66 (0.55-0.78)]. No association was observed with radiotherapy.Results from the multi-state model showed shorter BCSS for CHEK2 c.1100delC carriers versus non-carriers also after accounting for CBC occurrence [HR (95% CI): 1.30 (1.09-1.56)].Conclusion: Systemic therapy was associated with reduced CBC risk irrespective of CHEK2 c.1100delC status. Moreover, CHEK2 c.1100delC carriers had shorter BCSS, which appears not to be fully explained by their CBC risk.Peer reviewe

    Indicators of "Healthy Aging" in older women (65-69 years of age). A data-mining approach based on prediction of long-term survival

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Prediction of long-term survival in healthy adults requires recognition of features that serve as early indicators of successful aging. The aims of this study were to identify predictors of long-term survival in older women and to develop a multivariable model based upon longitudinal data from the Study of Osteoporotic Fractures (SOF).</p> <p>Methods</p> <p>We considered only the youngest subjects (<it>n </it>= 4,097) enrolled in the SOF cohort (65 to 69 years of age) and excluded older SOF subjects more likely to exhibit a "frail" phenotype. A total of 377 phenotypic measures were screened to determine which were of most value for prediction of long-term (19-year) survival. Prognostic capacity of individual predictors, and combinations of predictors, was evaluated using a cross-validation criterion with prediction accuracy assessed according to time-specific AUC statistics.</p> <p>Results</p> <p>Visual contrast sensitivity score was among the top 5 individual predictors relative to all 377 variables evaluated (mean AUC = 0.570). A 13-variable model with strong predictive performance was generated using a forward search strategy (mean AUC = 0.673). Variables within this model included a measure of physical function, smoking and diabetes status, self-reported health, contrast sensitivity, and functional status indices reflecting cumulative number of daily living impairments (HR ≥ 0.879 or RH ≤ 1.131; P < 0.001). We evaluated this model and show that it predicts long-term survival among subjects assigned differing causes of death (e.g., cancer, cardiovascular disease; P < 0.01). For an average follow-up time of 20 years, output from the model was associated with multiple outcomes among survivors, such as tests of cognitive function, geriatric depression, number of daily living impairments and grip strength (P < 0.03).</p> <p>Conclusions</p> <p>The multivariate model we developed characterizes a "healthy aging" phenotype based upon an integration of measures that together reflect multiple dimensions of an aging adult (65-69 years of age). Age-sensitive components of this model may be of value as biomarkers in human studies that evaluate anti-aging interventions. Our methodology could be applied to data from other longitudinal cohorts to generalize these findings, identify additional predictors of long-term survival, and to further develop the "healthy aging" concept.</p

    Identification of six new susceptibility loci for invasive epithelial ovarian cancer.

    Get PDF
    Genome-wide association studies (GWAS) have identified 12 epithelial ovarian cancer (EOC) susceptibility alleles. The pattern of association at these loci is consistent in BRCA1 and BRCA2 mutation carriers who are at high risk of EOC. After imputation to 1000 Genomes Project data, we assessed associations of 11 million genetic variants with EOC risk from 15,437 cases unselected for family history and 30,845 controls and from 15,252 BRCA1 mutation carriers and 8,211 BRCA2 mutation carriers (3,096 with ovarian cancer), and we combined the results in a meta-analysis. This new study design yielded increased statistical power, leading to the discovery of six new EOC susceptibility loci. Variants at 1p36 (nearest gene, WNT4), 4q26 (SYNPO2), 9q34.2 (ABO) and 17q11.2 (ATAD5) were associated with EOC risk, and at 1p34.3 (RSPO1) and 6p22.1 (GPX6) variants were specifically associated with the serous EOC subtype, all with P < 5 × 10(-8). Incorporating these variants into risk assessment tools will improve clinical risk predictions for BRCA1 and BRCA2 mutation carriers.COGS project is funded through a European Commission's Seventh Framework Programme grant (agreement number 223175 ] HEALTH ]F2 ]2009 ]223175). The CIMBA data management and data analysis were supported by Cancer Research.UK grants 12292/A11174 and C1287/A10118. The Ovarian Cancer Association Consortium is supported by a grant from the Ovarian Cancer Research Fund thanks to donations by the family and friends of Kathryn Sladek Smith (PPD/RPCI.07). The scientific development and funding for this project were in part supported by the US National Cancer Institute GAME ]ON Post ]GWAS Initiative (U19 ]CA148112). This study made use of data generated by the Wellcome Trust Case Control consortium. Funding for the project was provided by the Wellcome Trust under award 076113. The results published here are in part based upon data generated by The Cancer Genome Atlas Pilot Project established by the National Cancer Institute and National Human Genome Research Institute (dbGap accession number phs000178.v8.p7). The cBio portal is developed and maintained by the Computational Biology Center at Memorial Sloan ] Kettering Cancer Center. SH is supported by an NHMRC Program Grant to GCT. Details of the funding of individual investigators and studies are provided in the Supplementary Note. This study made use of data generated by the Wellcome Trust Case Control consortium, funding for which was provided by the Wellcome Trust under award 076113. The results published here are, in part, based upon data generated by The Cancer Genome Atlas Pilot Project established by the National Cancerhttp://dx.doi.org/10.1038/ng.3185This is the Author Accepted Manuscript of 'Identification of six new susceptibility loci for invasive epithelial ovarian cancer' which was published in Nature Genetics 47, 164–171 (2015) © Nature Publishing Group - content may only be used for academic research

    Assessment of variation in immunosuppressive pathway genes reveals TGFBR2 to be associated with risk of clear cell ovarian cancer.

    Get PDF
    BACKGROUND: Regulatory T (Treg) cells, a subset of CD4+ T lymphocytes, are mediators of immunosuppression in cancer, and, thus, variants in genes encoding Treg cell immune molecules could be associated with ovarian cancer. METHODS: In a population of 15,596 epithelial ovarian cancer (EOC) cases and 23,236 controls, we measured genetic associations of 1,351 SNPs in Treg cell pathway genes with odds of ovarian cancer and tested pathway and gene-level associations, overall and by histotype, for the 25 genes, using the admixture likelihood (AML) method. The most significant single SNP associations were tested for correlation with expression levels in 44 ovarian cancer patients. RESULTS: The most significant global associations for all genes in the pathway were seen in endometrioid ( p = 0.082) and clear cell ( p = 0.083), with the most significant gene level association seen with TGFBR2 ( p = 0.001) and clear cell EOC. Gene associations with histotypes at p < 0.05 included: IL12 ( p = 0.005 and p = 0.008, serous and high-grade serous, respectively), IL8RA ( p = 0.035, endometrioid and mucinous), LGALS1 ( p = 0.03, mucinous), STAT5B ( p = 0.022, clear cell), TGFBR1 ( p = 0.021 endometrioid) and TGFBR2 ( p = 0.017 and p = 0.025, endometrioid and mucinous, respectively). CONCLUSIONS: Common inherited gene variation in Treg cell pathways shows some evidence of germline genetic contribution to odds of EOC that varies by histologic subtype and may be associated with mRNA expression of immune-complex receptor in EOC patients
    corecore