12 research outputs found

    Risk factors, outcomes and healthcare utilisation in individuals with multimorbidity including heart failure, chronic kidney disease and type 2 diabetes mellitus: a national electronic health record study

    Get PDF
    BACKGROUND: Heart failure (HF), type 2 diabetes (T2D) and chronic kidney disease (CKD) commonly coexist. We studied characteristics, prognosis and healthcare utilisation of individuals with two of these conditions. METHODS: We performed a retrospective, population-based linked electronic health records study from 1998 to 2020 in England to identify individuals diagnosed with two of: HF, T2D or CKD. We described cohort characteristics at time of second diagnosis and estimated risk of developing the third condition and mortality using Kaplan-Meier and Cox regression models. We also estimated rates of healthcare utilisation in primary care and hospital settings in follow-up. FINDINGS: We identified cohorts of 64 226 with CKD and HF, 82 431 with CKD and T2D, and 13 872 with HF and T2D. Compared with CKD and T2D, those with CKD and HF and HF and T2D had more severe risk factor profile. At 5 years, incidence of the third condition and all-cause mortality occurred in 37% (95% CI: 35.9%, 38.1%%) and 31.3% (30.4%, 32.3%) in HF+T2D, 8.7% (8.4%, 9.0%) and 51.6% (51.1%, 52.1%) in HF+CKD, and 6.8% (6.6%, 7.0%) and 17.9% (17.6%, 18.2%) in CKD+T2D, respectively. In each of the three multimorbid groups, the order of the first two diagnoses was also associated with prognosis. In multivariable analyses, we identified risk factors for developing the third condition and mortality, such as age, sex, medical history and the order of disease diagnosis. Inpatient and outpatient healthcare utilisation rates were highest in CKD and HF, and lowest in CKD and T2D. INTERPRETATION: HF, CKD and T2D carry significant mortality and healthcare burden in combination. Compared with other disease pairs, individuals with CKD and HF had the most severe risk factor profile, prognosis and healthcare utilisation. Service planning, policy and prevention must take into account and monitor data across conditions

    Identifying subtypes of chronic kidney disease with machine learning: development, internal validation and prognostic validation using linked electronic health records in 350,067 individuals

    Get PDF
    BACKGROUND: Although chronic kidney disease (CKD) is associated with high multimorbidity, polypharmacy, morbidity and mortality, existing classification systems (mild to severe, usually based on estimated glomerular filtration rate, proteinuria or urine albumin-creatinine ratio) and risk prediction models largely ignore the complexity of CKD, its risk factors and its outcomes. Improved subtype definition could improve prediction of outcomes and inform effective interventions. METHODS: We analysed individuals ≥18 years with incident and prevalent CKD (n = 350,067 and 195,422 respectively) from a population-based electronic health record resource (2006-2020; Clinical Practice Research Datalink, CPRD). We included factors (n = 264 with 2670 derived variables), e.g. demography, history, examination, blood laboratory values and medications. Using a published framework, we identified subtypes through seven unsupervised machine learning (ML) methods (K-means, Diana, HC, Fanny, PAM, Clara, Model-based) with 66 (of 2670) variables in each dataset. We evaluated subtypes for: (i) internal validity (within dataset, across methods); (ii) prognostic validity (predictive accuracy for 5-year all-cause mortality and admissions); and (iii) medications (new and existing by British National Formulary chapter). FINDINGS: After identifying five clusters across seven approaches, we labelled CKD subtypes: 1. Early-onset, 2. Late-onset, 3. Cancer, 4. Metabolic, and 5. Cardiometabolic. Internal validity: We trained a high performing model (using XGBoost) that could predict disease subtypes with 95% accuracy for incident and prevalent CKD (Sensitivity: 0.81-0.98, F1 score:0.84-0.97). Prognostic validity: 5-year all-cause mortality, hospital admissions, and incidence of new chronic diseases differed across CKD subtypes. The 5-year risk of mortality and admissions in the overall incident CKD population were highest in cardiometabolic subtype: 43.3% (42.3-42.8%) and 29.5% (29.1-30.0%), respectively, and lowest in the early-onset subtype: 5.7% (5.5-5.9%) and 18.7% (18.4-19.1%). MEDICATIONS: Across CKD subtypes, the distribution of prescription medication classes at baseline varied, with highest medication burden in cardiometabolic and metabolic subtypes, and higher burden in prevalent than incident CKD. INTERPRETATION: In the largest CKD study using ML, to-date, we identified five distinct subtypes in individuals with incident and prevalent CKD. These subtypes have relevance to study of aetiology, therapeutics and risk prediction. FUNDING: AstraZeneca UK Ltd, Health Data Research UK

    Identifying subtypes of heart failure from three electronic health record sources with machine learning: an external, prognostic, and genetic validation study

    Get PDF
    BACKGROUND: Machine learning has been used to analyse heart failure subtypes, but not across large, distinct, population-based datasets, across the whole spectrum of causes and presentations, or with clinical and non-clinical validation by different machine learning methods. Using our published framework, we aimed to discover heart failure subtypes and validate them upon population representative data. METHODS: In this external, prognostic, and genetic validation study we analysed individuals aged 30 years or older with incident heart failure from two population-based databases in the UK (Clinical Practice Research Datalink [CPRD] and The Health Improvement Network [THIN]) from 1998 to 2018. Pre-heart failure and post-heart failure factors (n=645) included demographic information, history, examination, blood laboratory values, and medications. We identified subtypes using four unsupervised machine learning methods (K-means, hierarchical, K-Medoids, and mixture model clustering) with 87 of 645 factors in each dataset. We evaluated subtypes for (1) external validity (across datasets); (2) prognostic validity (predictive accuracy for 1-year mortality); and (3) genetic validity (UK Biobank), association with polygenic risk score (PRS) for heart failure-related traits (n=11), and single nucleotide polymorphisms (n=12). FINDINGS: We included 188 800, 124 262, and 9573 individuals with incident heart failure from CPRD, THIN, and UK Biobank, respectively, between Jan 1, 1998, and Jan 1, 2018. After identifying five clusters, we labelled heart failure subtypes as (1) early onset, (2) late onset, (3) atrial fibrillation related, (4) metabolic, and (5) cardiometabolic. In the external validity analysis, subtypes were similar across datasets (c-statistics: THIN model in CPRD ranged from 0·79 [subtype 3] to 0·94 [subtype 1], and CPRD model in THIN ranged from 0·79 [subtype 1] to 0·92 [subtypes 2 and 5]). In the prognostic validity analysis, 1-year all-cause mortality after heart failure diagnosis (subtype 1 0·20 [95% CI 0·14-0·25], subtype 2 0·46 [0·43-0·49], subtype 3 0·61 [0·57-0·64], subtype 4 0·11 [0·07-0·16], and subtype 5 0·37 [0·32-0·41]) differed across subtypes in CPRD and THIN data, as did risk of non-fatal cardiovascular diseases and all-cause hospitalisation. In the genetic validity analysis the atrial fibrillation-related subtype showed associations with the related PRS. Late onset and cardiometabolic subtypes were the most similar and strongly associated with PRS for hypertension, myocardial infarction, and obesity (p<0·0009). We developed a prototype app for routine clinical use, which could enable evaluation of effectiveness and cost-effectiveness. INTERPRETATION: Across four methods and three datasets, including genetic data, in the largest study of incident heart failure to date, we identified five machine learning-informed subtypes, which might inform aetiological research, clinical risk prediction, and the design of heart failure trials. FUNDING: European Union Innovative Medicines Initiative-2

    A retrospective cohort study measured predicting and validating the impact of the COVID-19 pandemic in individuals with chronic kidney disease

    Get PDF
    Chronic kidney disease (CKD) is associated with increased risk of baseline mortality and severe COVID-19, but analyses across CKD stages, and comorbidities are lacking. In prevalent and incident CKD, we investigated comorbidities, baseline risk, COVID-19 incidence, and predicted versus observed one-year excess death. In a national dataset (NHS Digital Trusted Research Environment (NHSD TRE)) for England encompassing 56 million individuals), we conducted a retrospective cohort study (March 2020 to March 2021) for prevalence of comorbidities by incident and prevalent CKD, SARS-CoV-2 infection and mortality. Baseline mortality risk, incidence and outcome of infection by comorbidities, controlling for age, sex and vaccination were assessed. Observed versus predicted one-year mortality at varying population infection rates and pandemic-related relative risks using our published model in pre-pandemic CKD cohorts (NHSD TRE and Clinical Practice Research Datalink (CPRD)) were compared. Among individuals with CKD (prevalent:1,934,585, incident:144,969), comorbidities were common (73.5% and 71.2% with one or more condition(s) in respective data sets, and 13.2% and 11.2% with three or more conditions, in prevalent and incident CKD), and associated with SARS-CoV-2 infection, particularly dialysis/transplantation (odds ratio 2.08, 95% confidence interval 2.04-2.13) and heart failure(1.73, 1.71-1.76), but not cancer (1.01, 1.01-1.04). One-year all-cause mortality varied by age, sex, multi-morbidity and CKD stage. Compared with 34,265 observed excess deaths, in the NHSD-TRE and CPRD databases respectively, we predicted 28,746 and 24,546 deaths (infection rates 10% and relative risks 3.0), and 23,754 and 20,283 deaths (observed infection rates 6.7% and relative risks 3.7). Thus, in this largest, national-level study, individuals with CKD have a high burden of comorbidities and multi-morbidity, and high risk of pre-pandemic and pandemic mortality. Hence, treatment of comorbidities, non-pharmaceutical measures, and vaccination are priorities for people with CKD and management of long-term conditions is important during and beyond the pandemic

    AZD1222 effectiveness against severe COVID-19 in individuals with comorbidity or frailty: the RAVEN cohort study

    Get PDF
    Objectives Despite being prioritized during initial COVID-19 vaccine rollout, vulnerable individuals at high risk of severe COVID-19 (hospitalization, intensive care unit admission, or death) remain underrepresented in vaccine effectiveness (VE) studies. The RAVEN cohort study (NCT05047822) assessed AZD1222 (ChAdOx1 nCov-19) two-dose primary series VE in vulnerable populations. Methods Using the Oxford-Royal College of General Practitioners Clinical Informatics Digital Hub, linked to secondary care, death registration, and COVID-19 datasets in England, COVID-19 outcomes in 2021 were compared in vaccinated and unvaccinated individuals matched on age, sex, region, and multimorbidity. Results Over 4.5 million AZD1222 recipients were matched (mean follow-up ∼5 months); 68% were ≥50 years, 57% had high multimorbidity. Overall, high VE against severe COVID-19 was demonstrated, with lower VE observed in vulnerable populations. VE against hospitalization was higher in the lowest multimorbidity quartile (91.1%; 95% CI: 90.1, 92.0) than the highest quartile (80.4%; 79.7, 81.1), and among individuals ≥65 years, higher in the ‘fit’ (86.2%; 84.5, 87.6) than the frailest (71.8%; 69.3, 74.2). VE against hospitalization was lowest in immunosuppressed individuals (64.6%; 60.7, 68.1). Conclusions Based on integrated and comprehensive UK health data, overall population-level VE with AZD1222 was high. VEs were notably lower in vulnerable groups, particularly the immunosuppressed

    Using national electronic health records for pandemic preparedness: validation of a parsimonious model for predicting excess deaths among those with COVID-19–a data-driven retrospective cohort study

    Get PDF
    Objectives: To use national, pre- and post-pandemic electronic health records (EHR) to develop and validate a scenario-based model incorporating baseline mortality risk, infection rate (IR) and relative risk (RR) of death for prediction of excess deaths. Design: An EHR-based, retrospective cohort study. Setting: Linked EHR in Clinical Practice Research Datalink (CPRD); and linked EHR and COVID-19 data in England provided in NHS Digital Trusted Research Environment (TRE). Participants: In the development (CPRD) and validation (TRE) cohorts, we included 3.8 million and 35.1 million individuals aged ≥30 years, respectively. Main outcome measures: One-year all-cause excess deaths related to COVID-19 from March 2020 to March 2021. Results: From 1 March 2020 to 1 March 2021, there were 127,020 observed excess deaths. Observed RR was 4.34% (95% CI, 4.31–4.38) and IR was 6.27% (95% CI, 6.26–6.28). In the validation cohort, predicted one-year excess deaths were 100,338 compared with the observed 127,020 deaths with a ratio of predicted to observed excess deaths of 0.79. Conclusions: We show that a simple, parsimonious model incorporating baseline mortality risk, one-year IR and RR of the pandemic can be used for scenario-based prediction of excess deaths in the early stages of a pandemic. Our analyses show that EHR could inform pandemic planning and surveillance, despite limited use in emergency preparedness to date. Although infection dynamics are important in the prediction of mortality, future models should take greater account of underlying conditions

    COVID-19 trajectories among 57 million adults in England: a cohort study using electronic health records

    Get PDF
    BACKGROUND: Updatable estimates of COVID-19 onset, progression, and trajectories underpin pandemic mitigation efforts. To identify and characterise disease trajectories, we aimed to define and validate ten COVID-19 phenotypes from nationwide linked electronic health records (EHR) using an extensible framework. METHODS: In this cohort study, we used eight linked National Health Service (NHS) datasets for people in England alive on Jan 23, 2020. Data on COVID-19 testing, vaccination, primary and secondary care records, and death registrations were collected until Nov 30, 2021. We defined ten COVID-19 phenotypes reflecting clinically relevant stages of disease severity and encompassing five categories: positive SARS-CoV-2 test, primary care diagnosis, hospital admission, ventilation modality (four phenotypes), and death (three phenotypes). We constructed patient trajectories illustrating transition frequency and duration between phenotypes. Analyses were stratified by pandemic waves and vaccination status. FINDINGS: Among 57 032 174 individuals included in the cohort, 13 990 423 COVID-19 events were identified in 7 244 925 individuals, equating to an infection rate of 12·7% during the study period. Of 7 244 925 individuals, 460 737 (6·4%) were admitted to hospital and 158 020 (2·2%) died. Of 460 737 individuals who were admitted to hospital, 48 847 (10·6%) were admitted to the intensive care unit (ICU), 69 090 (15·0%) received non-invasive ventilation, and 25 928 (5·6%) received invasive ventilation. Among 384 135 patients who were admitted to hospital but did not require ventilation, mortality was higher in wave 1 (23 485 [30·4%] of 77 202 patients) than wave 2 (44 220 [23·1%] of 191 528 patients), but remained unchanged for patients admitted to the ICU. Mortality was highest among patients who received ventilatory support outside of the ICU in wave 1 (2569 [50·7%] of 5063 patients). 15 486 (9·8%) of 158 020 COVID-19-related deaths occurred within 28 days of the first COVID-19 event without a COVID-19 diagnoses on the death certificate. 10 884 (6·9%) of 158 020 deaths were identified exclusively from mortality data with no previous COVID-19 phenotype recorded. We observed longer patient trajectories in wave 2 than wave 1. INTERPRETATION: Our analyses illustrate the wide spectrum of disease trajectories as shown by differences in incidence, survival, and clinical pathways. We have provided a modular analytical framework that can be used to monitor the impact of the pandemic and generate evidence of clinical and policy relevance using multiple EHR sources. FUNDING: British Heart Foundation Data Science Centre, led by Health Data Research UK

    Risk factors, outcomes and healthcare utilisation in individuals with multimorbidity including heart failure, chronic kidney disease and type 2 diabetes mellitus: a national electronic health record study

    No full text
    Background Heart failure (HF), type 2 diabetes (T2D) and chronic kidney disease (CKD) commonly coexist. We studied characteristics, prognosis and healthcare utilisation of individuals with two of these conditions.Methods We performed a retrospective, population-based linked electronic health records study from 1998 to 2020 in England to identify individuals diagnosed with two of: HF, T2D or CKD. We described cohort characteristics at time of second diagnosis and estimated risk of developing the third condition and mortality using Kaplan-Meier and Cox regression models. We also estimated rates of healthcare utilisation in primary care and hospital settings in follow-up.Findings We identified cohorts of 64 226 with CKD and HF, 82 431 with CKD and T2D, and 13 872 with HF and T2D. Compared with CKD and T2D, those with CKD and HF and HF and T2D had more severe risk factor profile. At 5 years, incidence of the third condition and all-cause mortality occurred in 37% (95% CI: 35.9%, 38.1%%) and 31.3% (30.4%, 32.3%) in HF+T2D, 8.7% (8.4%, 9.0%) and 51.6% (51.1%, 52.1%) in HF+CKD, and 6.8% (6.6%, 7.0%) and 17.9% (17.6%, 18.2%) in CKD+T2D, respectively. In each of the three multimorbid groups, the order of the first two diagnoses was also associated with prognosis. In multivariable analyses, we identified risk factors for developing the third condition and mortality, such as age, sex, medical history and the order of disease diagnosis. Inpatient and outpatient healthcare utilisation rates were highest in CKD and HF, and lowest in CKD and T2D.Interpretation HF, CKD and T2D carry significant mortality and healthcare burden in combination. Compared with other disease pairs, individuals with CKD and HF had the most severe risk factor profile, prognosis and healthcare utilisation. Service planning, policy and prevention must take into account and monitor data across conditions
    corecore