1,212 research outputs found
Integrative DNA methylome analysis of pan-cancer biomarkers in cancer discordant monozygotic twin-pairs
BACKGROUND: A key focus in cancer research is the discovery of biomarkers that accurately diagnose early lesions in non-invasive tissues. Several studies have identified malignancy-associated DNA methylation changes in blood, yet no general cancer biomarker has been identified to date. Here, we explore the potential of blood DNA methylation as a biomarker of pan-cancer (cancer of multiple different origins) in 41 female cancer discordant monozygotic (MZ) twin-pairs sampled before or after diagnosis using the Illumina HumanMethylation450 BeadChip. RESULTS: We analysed epigenome-wide DNA methylation profiles in 41 cancer discordant MZ twin-pairs with affected individuals diagnosed with tumours at different single primary sites: the breast, cervix, colon, endometrium, thyroid gland, skin (melanoma), ovary, and pancreas. No significant global differences in whole blood DNA methylation profiles were observed. Epigenome-wide analyses identified one novel pan-cancer differentially methylated position at false discovery rate (FDR) threshold of 10 % (cg02444695, P = 1.8 × 10(-7)) in an intergenic region 70 kb upstream of the SASH1 tumour suppressor gene, and three suggestive signals in COL11A2, AXL, and LINC00340. Replication of the four top-ranked signals in an independent sample of nine cancer-discordant MZ twin-pairs showed a similar direction of association at COL11A2, AXL, and LINC00340, and significantly greater methylation discordance at AXL compared to 480 healthy concordant MZ twin-pairs. The effects at cg02444695 (near SASH1), COL11A2, and LINC00340 were the most promising in biomarker potential because the DNA methylation differences were found to pre-exist in samples obtained prior to diagnosis and were limited to a 5-year period before diagnosis. Gene expression follow-up at the top-ranked signals in 283 healthy individuals showed correlation between blood methylation and gene expression in lymphoblastoid cell lines at PRL, and in the skin tissue at AXL. A significant enrichment of differential DNA methylation was observed in enhancer regions (P = 0.03). CONCLUSIONS: We identified DNA methylation signatures in blood associated with pan-cancer, at or near SASH1, COL11A2, AXL, and LINC00340. Three of these signals were present up to 5 years prior to cancer diagnosis, highlighting the potential clinical utility of whole blood DNA methylation analysis in cancer surveillance
Prediction of atrial fibrillation and stroke using machine learning models in UK Biobank
Objective: Atrial fibrillation (AF) is the most common cardiac arrythmia, and it is associated with increased risk for ischemic stroke, which is underestimated, as AF can be asymptomatic. The aim of this study was to develop optimal ML models for prediction of AF in the population, and secondly for ischemic stroke in AF patients. Methods: To develop ML models for prediction of 1) AF in the general population and 2) ischemic stroke in patients with AF we constructed XGBoost, LightGBM, Random Forest, Deep Neural Network, Support Vector Machine and Lasso penalised logistic regression models using UK-Biobank's extensive real-world clinical data, questionnaires, as well as biochemical and genetic data, and their predictive performances were compared. Ranking and contribution of the different features was assessed by SHapley Additive exPlanations (SHAP) analysis. The clinical tool CHA2DS2-VASc for prediction of ischemic stroke among AF patients, was used for comparison to the best performing ML model. Findings: The best performing model for AF prediction was LightGBM, with an area-under-the-roc-curve (AUROC) of 0.729 (95% confidence intervals (CI): 0.719, 0.738). The best performing model for ischemic stroke prediction in AF patients was XGBoost with AUROC of 0.631 (95% CI: 0.604, 0.657). The improved AUROC in the XGBoost model compared to CHA2DS2-VASc was statistically significant based on DeLong's test (p-value = 2.20E-06). In addition, the SHAP analysis showed that several peripheral blood biomarkers (e.g. creatinine, glycated haemoglobin, monocytes) were associated with ischemic stroke, which are not considered by CHA2DS2-VASc. Implications: The best performing ML models presented have the potential for clinical use, but further validation in independent studies is required. Our results endorse the incorporation of some routinely measured blood biomarkers for ischemic stroke prediction in AF patients
Applications of machine and deep learning to thyroid cytology and histopathology: a review.
This review synthesises past research into how machine and deep learning can improve the cyto- and histopathology processing pipelines for thyroid cancer diagnosis. The current gold-standard preoperative technique of fine-needle aspiration cytology has high interobserver variability, often returns indeterminate samples and cannot reliably identify some pathologies; histopathology analysis addresses these issues to an extent, but it requires surgical resection of the suspicious lesions so cannot influence preoperative decisions. Motivated by these issues, as well as by the chronic shortage of trained pathologists, much research has been conducted into how artificial intelligence could improve current pipelines and reduce the pressure on clinicians. Many past studies have indicated the significant potential of automated image analysis in classifying thyroid lesions, particularly for those of papillary thyroid carcinoma, but these have generally been retrospective, so questions remain about both the practical efficacy of these automated tools and the realities of integrating them into clinical workflows. Furthermore, the nature of thyroid lesion classification is significantly more nuanced in practice than many current studies have addressed, and this, along with the heterogeneous nature of processing pipelines in different laboratories, means that no solution has proven itself robust enough for clinical adoption. There are, therefore, multiple avenues for future research: examine the practical implementation of these algorithms as pathologist decision-support systems; improve interpretability, which is necessary for developing trust with clinicians and regulators; and investigate multiclassification on diverse multicentre datasets, aiming for methods that demonstrate high performance in a process- and equipment-agnostic manner
COVID-19 susceptibility variants associate with blood clots, thrombophlebitis and circulatory diseases.
Epidemiological studies suggest that individuals with comorbid conditions including diabetes, chronic lung, inflammatory and vascular disease, are at higher risk of adverse COVID-19 outcomes. Genome-wide association studies have identified several loci associated with increased susceptibility and severity for COVID-19. However, it is not clear whether these associations are genetically determined or not. We used a Phenome-Wide Association (PheWAS) approach to investigate the role of genetically determined COVID-19 susceptibility on disease related outcomes. PheWAS analyses were performed in order to identify traits and diseases related to COVID-19 susceptibility and severity, evaluated through a predictive COVID-19 risk score. We utilised phenotypic data in up to 400,000 individuals from the UK Biobank, including Hospital Episode Statistics and General Practice data. We identified a spectrum of associations between both genetically determined COVID-19 susceptibility and severity with a number of traits. COVID-19 risk was associated with increased risk for phlebitis and thrombophlebitis (OR = 1.11, p = 5.36e-08). We also identified significant signals between COVID-19 susceptibility with blood clots in the leg (OR = 1.1, p = 1.66e-16) and with increased risk for blood clots in the lung (OR = 1.12, p = 1.45 e-10). Our study identifies significant association of genetically determined COVID-19 with increased blood clot events in leg and lungs. The reported associations between both COVID-19 susceptibility and severity and other diseases adds to the identification and stratification of individuals at increased risk, adverse outcomes and long-term effects
Linkage Disequilibrium Mapping via Cladistic Analysis of Single-Nucleotide Polymorphism Haplotypes
We present a novel approach to disease-gene mapping via cladistic analysis of single-nucleotide polymorphism (SNP) haplotypes obtained from large-scale, population-based association studies, applicable to whole-genome screens, candidate-gene studies, or fine-scale mapping. Clades of haplotypes are tested for association with disease, exploiting the expected similarity of chromosomes with recent shared ancestry in the region flanking the disease gene. The method is developed in a logistic-regression framework and can easily incorporate covariates such as environmental risk factors or additional unlinked loci to allow for population structure. To evaluate the power of this approach to detect disease-marker association, we have developed a simulation algorithm to generate high-density SNP data with short-range linkage disequilibrium based on empirical patterns of haplotype diversity. The results of the simulation study highlight substantial gains in power over single-locus tests for a wide range of disease models, despite overcorrection for multiple testing
- …