An early warning risk prediction tool (RECAP-V1) for patients diagnosed with COVID-19: the protocol for a statistical analysis plan

Abstract

Background: Since the start of the Covid-19 pandemic efforts have been made to develop early warning risk scores to help clinicians decide which patient is likely to deteriorate and require hospitalisation. The RECAP (Remote COVID Assessment in Primary Care) study investigates the predictive risk of hospitalisation, deterioration, and death of patients with confirmed COVID-19, based on a set of parameters chosen through a Delphi process done by clinicians. The study aims to use rich data collected remotely through the use of electronic data templates integrated in the electronic health systems of a number of general practices across the UK to construct accurate predictive models that will use pre-existing conditions and monitoring data of a patient’s clinical parameters such as blood oxygen saturation to make reliable predictions as to the patient’s risk of hospital admission, deterioration, and death. Objective: We outline the statistical methods to build the prediction model to be used in the prioritisation of patients in the primary care setting. The statistical analysis plan for the RECAP study includes as primary outcome the development and validation of the RECAP-V1 prediction model. Such prediction model will be adapted as a three-category risk score split into red (high risk), amber (medium risk), and green (low risk) for any patient with suspected covid-19. The model will predict risk of deterioration, hospitalisation, and death. Methods: After the data has been collected, we will assess the degree of missingness and use a combination of traditional data imputation using multiple imputation by chained equations, as well as more novel machine learning approaches to impute the missing data for the final analysis. For predictive model development we will use multiple logistic regressions to construct the model on a training dataset, as well as validating the model on an independent dataset. The model will also be applied for multiple different datasets to assess both its performance in different patient groups, and applicability for different methods of data collection. Results: As of 5th of May 2021 we have recruited 2280 patients for the main dataset for model development, as well as a further 1741 patients for the validation dataset. Final analysis will commence as soon as data for 2880 are collected. Conclusions: We believe that the methodology for the development of the RECAP V1 prediction model as well as the risk score will provide clinicians with a statistically robust tool to help prioritise Covid-19 patients. Clinical Trial: Trial registration number: NCT0443504

    Similar works