336 research outputs found
Dynamic Prediction of Survival in Cystic Fibrosis: A Landmarking Analysis Using UK Patient Registry Data.
BACKGROUND: Cystic fibrosis (CF) is an inherited, chronic, progressive condition affecting around 10,000 individuals in the United Kingdom and over 70,000 worldwide. Survival in CF has improved considerably over recent decades, and it is important to provide up-to-date information on patient prognosis. METHODS: The UK Cystic Fibrosis Registry is a secure centralized database, which collects annual data on almost all CF patients in the United Kingdom. Data from 43,592 annual records from 2005 to 2015 on 6181 individuals were used to develop a dynamic survival prediction model that provides personalized estimates of survival probabilities given a patient's current health status using 16 predictors. We developed the model using the landmarking approach, giving predicted survival curves up to 10 years from 18 to 50 years of age. We compared several models using cross-validation. RESULTS: The final model has good discrimination (C-indexes: 0.873, 0.843, and 0.804 for 2-, 5-, and 10-year survival prediction) and low prediction error (Brier scores: 0.036, 0.076, and 0.133). It identifies individuals at low and high risk of short- and long-term mortality based on their current status. For patients 20 years of age during 2013-2015, for example, over 80% had a greater than 95% probability of 2-year survival and 40% were predicted to survive 10 years or more. CONCLUSIONS: Dynamic personalized prediction models can guide treatment decisions and provide personalized information for patients. Our application illustrates the utility of the landmarking approach for making the best use of longitudinal and survival data and shows how models can be defined and compared in terms of predictive performance.US NIH Grant K25 HL12595
Integrating latent classes in the Bayesian shared parameter joint model of longitudinal and survival outcomes
Cystic fibrosis is a chronic lung disease requiring frequent lung-function monitoring to track acute respiratory events (pulmonary exacerbations). The association between lung-function trajectory and time-to-first exacerbation can be characterized using joint longitudinal-survival modeling. Joint models specified through the shared parameter framework quantify the strength of association between such outcomes but do not incorporate latent sub-populations reflective of heterogeneous disease progression. Conversely, latent class joint models explicitly postulate the existence of sub-populations but do not directly quantify the strength of association. Furthermore, choosing the optimal number of classes using established metrics like deviance information criterion is computationally intensive in complex models. To overcome these limitations, we integrate latent classes in the shared parameter joint model through a fully Bayesian approach. To choose the optimal number of classes, we construct a mixture model assuming more latent classes than present in the data, thereby asymptotically “emptying” superfluous latent classes, provided the Dirichlet prior on class proportions is sufficiently uninformative. Model properties are evaluated in simulation studies. Application to data from the US Cystic Fibrosis Registry supports the existence of three sub-populations corresponding to lung-function trajectories with high initial forced expiratory volume in 1 s (FEV1), rapid FEV1 decline, and low but steady FEV1 progression. The association between FEV1 and hazard of exacerbation was negative in each class, but magnitude varied
Efficiently analyzing large patient registries with Bayesian joint models for longitudinal and time-to-event data
The joint modeling of longitudinal and time-to-event outcomes has become a popular tool infollow-up studies. However, fitting Bayesian joint models to large datasets, such as patientregistries, can require extended computing times. To speed up sampling, we divided a patient registry dataset into subsamples, analyzed them in parallel, and combined the resultingMarkov chain Monte Carlo draws into a consensus distribution. We used a simulation studyto investigate how different consensus strategies perform with joint models. In particular,we compared grouping all draws together with using equal- and precision-weighted averages.We considered scenarios reflecting different sample sizes, numbers of data splits, and processor characteristics. Parallelization of the sampling process substantially decreased the timerequired to run the model. We found that the weighted-average consensus distributions forlarge sample sizes were nearly identical to the target posterior distribution. The proposedalgorithm has been made available in an R package for joint models, JMbayes2. This workwas motivated by the clinical interest in investigating the association between ppFEV1, acommonly measured marker of lung function, and the risk of lung transplant or death, using data from the US Cystic Fibrosis Foundation Patient Registry (35,153 individuals with372,366 years of cumulative follow-up). Splitting the registry into five subsamples resultedin an 85% decrease in computing time, from 9.22 to 1.39 hours. Splitting the data and finding a consensus distribution by precision-weighted averaging proved to be a computationallyefficient and robust approach to handling large datasets under the joint modeling framework
Efficiently analyzing large patient registries with Bayesian joint models for longitudinal and time-to-event data
The joint modeling of longitudinal and time-to-event outcomes has become a popular tool infollow-up studies. However, fitting Bayesian joint models to large datasets, such as patientregistries, can require extended computing times. To speed up sampling, we divided a patient registry dataset into subsamples, analyzed them in parallel, and combined the resultingMarkov chain Monte Carlo draws into a consensus distribution. We used a simulation studyto investigate how different consensus strategies perform with joint models. In particular,we compared grouping all draws together with using equal- and precision-weighted averages.We considered scenarios reflecting different sample sizes, numbers of data splits, and processor characteristics. Parallelization of the sampling process substantially decreased the timerequired to run the model. We found that the weighted-average consensus distributions forlarge sample sizes were nearly identical to the target posterior distribution. The proposedalgorithm has been made available in an R package for joint models, JMbayes2. This workwas motivated by the clinical interest in investigating the association between ppFEV1, acommonly measured marker of lung function, and the risk of lung transplant or death, using data from the US Cystic Fibrosis Foundation Patient Registry (35,153 individuals with372,366 years of cumulative follow-up). Splitting the registry into five subsamples resultedin an 85% decrease in computing time, from 9.22 to 1.39 hours. Splitting the data and finding a consensus distribution by precision-weighted averaging proved to be a computationallyefficient and robust approach to handling large datasets under the joint modeling framework
A joint model for (un)bounded longitudinal markers, competing risks, and recurrent events using patient registry data
Joint models for longitudinal and survival data have become a popular framework for studying the association between repeatedly measured biomarkers and clinical events. Nevertheless, addressing complex survival data structures, especially handling both recurrent and competing event times within a single model, remains a challenge. This causes important information to be disregarded. Moreover, existing frameworks rely on a Gaussian distribution for continuous markers, which may be unsuitable for bounded biomarkers, resulting in biased estimates of associations. To address these limitations, we propose a Bayesian shared-parameter joint model that simultaneously accommodates multiple (possibly bounded) longitudinal markers, a recurrent event process, and competing risks. We use the beta distribution to model responses bounded within any interval (a,b) without sacrificing the interpretability of the association. The model offers various forms of association, discontinuous risk intervals, and both gap and calendar timescales. A simulation study shows that it outperforms simpler joint models. We utilize the US Cystic Fibrosis Foundation Patient Registry to study the associations between changes in lung function and body mass index, and the risk of recurrent pulmonary exacerbations, while accounting for the competing risks of death and lung transplantation. Our efficient implementation allows fast fitting of the model despite its complexity and the large sample size from this patient registry. Our comprehensive approach provides new insights into cystic fibrosis disease progression by quantifying the relationship between the most important clinical markers and events more precisely than has been possible before. The model implementation is available in the R package JMbayes2
A joint model for (un)bounded longitudinal markers, competing risks, and recurrent events using patient registry data
Joint models for longitudinal and survival data have become a popular framework for studying the association between repeatedly measured biomarkers and clinical events. Nevertheless, addressing complex survival data structures, especially handling both recurrent and competing event times within a single model, remains a challenge. This causes important information to be disregarded. Moreover, existing frameworks rely on a Gaussian distribution for continuous markers, which may be unsuitable for bounded biomarkers, resulting in biased estimates of associations. To address these limitations, we propose a Bayesian shared-parameter joint model that simultaneously accommodates multiple (possibly bounded) longitudinal markers, a recurrent event process, and competing risks. We use the beta distribution to model responses bounded within any interval (a,b) without sacrificing the interpretability of the association. The model offers various forms of association, discontinuous risk intervals, and both gap and calendar timescales. A simulation study shows that it outperforms simpler joint models. We utilize the US Cystic Fibrosis Foundation Patient Registry to study the associations between changes in lung function and body mass index, and the risk of recurrent pulmonary exacerbations, while accounting for the competing risks of death and lung transplantation. Our efficient implementation allows fast fitting of the model despite its complexity and the large sample size from this patient registry. Our comprehensive approach provides new insights into cystic fibrosis disease progression by quantifying the relationship between the most important clinical markers and events more precisely than has been possible before. The model implementation is available in the R package JMbayes2
Lung function in children with cystic fibrosis in the USA and UK: a comparative longitudinal analysis of national registry data
RATIONALE: A previous analysis found significantly higher lung function in the US paediatric cystic fibrosis (CF) population compared with the UK with this difference apparently decreasing in adolescence and adulthood. However, the cross-sectional nature of the study makes it hard to interpret these results. OBJECTIVES: To compare longitudinal trajectories of lung function in children with CF between the USA and UK and to explore reasons for any differences. METHODS: We used mixed effects regression analysis to model lung function trajectories in the study populations. Using descriptive statistics, we compared early growth and nutrition (height, weight, body mass index), infections (Pseudomonas aeruginosa, Staphylococcus aureus) and treatments (rhDnase, hypertonic saline, inhaled antibiotics). RESULTS: We included 9463 children from the USA and 3055 children from the UK with homozygous F508del genotype. Lung function was higher in the USA than in the UK when first measured at age six and remained higher throughout childhood. We did not find important differences in early growth and nutrition, or P.aeruginosa infection. Prescription of rhDNase and hypertonic saline was more common in the USA. Inhaled antibiotics were prescribed at similar levels in both countries, but Tobramycin was prescribed more in the USA and colistin in the UK. S. aureus infection was more common in the USA than the UK. CONCLUSIONS: Children with CF and homozygous F508del genotype in the USA had better lung function than UK children. These differences do not appear to be explained by early growth or nutrition, but differences in the use of early treatments need further investigation
- …