9 research outputs found

    Mapping Chronic Disease Prevalence based on Medication Use and Socio-demographic variables: an Application of LASSO in healthcare in the Netherlands

    Get PDF
    BACKGROUND: Policymakers generally lack sufficiently detailed health information to develop localized health policy plans. Chronic disease prevalence mapping is difficult as accurate direct sources are often lacking. Improvement is possible by adding extra information such as medication use and demographic information to identify disease. The aim of the current study was to obtain small geographic area prevalence estimates for four common chronic diseases by modelling based on medication use and socio-economic variables and next to investigate regional patterns of disease. METHODS: Administrative hospital records and general practitioner registry data were linked to medication use and socio-economic characteristics. The training set (n = 707,021) contained GP diagnosis and/or hospital admission diagnosis as the standard for disease prevalence. For the entire Dutch population (n = 16,777,888), all information except GP diagnosis and hospital admission was available. LASSO regression models for binary outcomes were used to select variables strongly associated with disease. Dutch municipality (non-)standardized prevalence estimates for stroke, CHD, COPD and diabetes were then based on averages of predicted probabilities for each individual inhabitant. RESULTS: Adding medication use data as a predictor substantially improved model performance. Estimates at the municipality level performed best for diabetes with a weighted percentage error (WPE) of 6.8%, and worst for COPD (WPE 14.5%)Disease prevalence showed clear regional patterns, also after standardization for age. CONCLUSION: Adding medication use as an indicator of disease prevalence next to socio-economic variables substantially improved estimates at the municipality level. The resulting individual disease probabilities could be aggregated into any desired regional level and provide a useful tool to identify regional patterns and inform local policy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12889-021-10754-4

    Physical activity guidelines and cardiovascular risk in children: a cross-sectional analysis to determine whether 60 minutes is enough

    Get PDF
    Background Physical activity reduces cardiovascular mortality and morbidity. The World Health Organisation (WHO) recommends children engage in 60 min daily moderate-to-vigorous physical activity (MVPA). The effect of compliance with this recommendation on childhood cardiovascular risk has not been empirically tested. To evaluate whether achieving recommendations results in reduced composite-cardiovascular risk score (CCVR) in children, and to examine if vigorous PA (VPA) has independent risk-reduction effects. Methods PA was measured using accelerometry in 182 children (9–11 years). Subjects were grouped according to achievement of 60 min daily MVPA (active) or not (inactive). CCVR was calculated (sum of z-scores: DXA body fat %, blood pressure, VO2peak, flow mediated dilation, left ventricular diastolic function; CVR score ≥1SD indicated ‘higher risk’). The cohort was further split into quintiles for VPA and odds ratios (OR) calculated for each quintile. Results Active children (92 (53 boys)) undertook more MVPA (38 ± 11 min, P  0.05). CCVR in the lowest VPA quintile was significantly greater than the highest quintile (3.9 ± 0.6, P < 0.05), and the OR was 4.7 times higher. Conclusion Achievement of current guidelines has positive effects on body composition and cardiorespiratory fitness, but not CCVR. Vigorous physical activity appears to have beneficial effects on CVD risk, independent of moderate PA, implying a more prescriptive approach may be needed for future VPA guidelines

    Mapping chronic disease prevalence based on medication use and socio-demographic variables: an application of LASSO on administrative data sources in healthcare in the Netherlands.

    Get PDF
    BACKGROUND: Policymakers generally lack sufficiently detailed health information to develop localized health policy plans. Chronic disease prevalence mapping is difficult as accurate direct sources are often lacking. Improvement is possible by adding extra information such as medication use and demographic information to identify disease. The aim of the current study was to obtain small geographic area prevalence estimates for four common chronic diseases by modelling based on medication use and socio-economic variables and next to investigate regional patterns of disease. METHODS: Administrative hospital records and general practitioner registry data were linked to medication use and socio-economic characteristics. The training set (n = 707,021) contained GP diagnosis and/or hospital admission diagnosis as the standard for disease prevalence. For the entire Dutch population (n = 16,777,888), all information except GP diagnosis and hospital admission was available. LASSO regression models for binary outcomes were used to select variables strongly associated with disease. Dutch municipality (non-)standardized prevalence estimates for stroke, CHD, COPD and diabetes were then based on averages of predicted probabilities for each individual inhabitant. RESULTS: Adding medication use data as a predictor substantially improved model performance. Estimates at the municipality level performed best for diabetes with a weighted percentage error (WPE) of 6.8%, and worst for COPD (WPE 14.5%)Disease prevalence showed clear regional patterns, also after standardization for age. CONCLUSION: Adding medication use as an indicator of disease prevalence next to socio-economic variables substantially improved estimates at the municipality level. The resulting individual disease probabilities could be aggregated into any desired regional level and provide a useful tool to identify regional patterns and inform local policy. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12889-021-10754-4

    Estimating disease prevalence from drug utilization data using the Random Forest algorithm.

    Get PDF
    Aggregated claims data on medication are often used as a proxy for the prevalence of diseases, especially chronic diseases. However, linkage between medication and diagnosis tend to be theory based and not very precise. Modelling disease probability at an individual level using individual level data may yield more accurate results. Individual probabilities of having a certain chronic disease were estimated using the Random Forest (RF) algorithm. A training set was created from a general practitioners database of 276 723 cases that included diagnosis and claims data on medication. Model performance for 29 chronic diseases was evaluated using Receiver-Operator Curves, by measuring the Area Under the Curve (AUC). The diseases for which model performance was best were Parkinson's disease (AUC = .89, 95% CI = .77-1.00), diabetes (AUC = .87, 95% CI = .85-.90), osteoporosis (AUC = .87, 95% CI = .81-.92) and heart failure (AUC = .81, 95% CI = .74-.88). Five other diseases had an AUC >.75: asthma, chronic enteritis, COPD, epilepsy and HIV/AIDS. For 16 of 17 diseases tested, the medication categories used in theory-based algorithms were also identified by our method, however the RF models included a broader range of medications as important predictors. Data on medication use can be a useful predictor when estimating the prevalence of several chronic diseases. To improve the estimates, for a broader range of chronic diseases, research should use better training data, include more details concerning dosages and duration of prescriptions, and add related predictors like hospitalizations
    corecore