3 research outputs found

    Bivariate network meta-analysis for surrogate endpoint evaluation

    Get PDF
    Surrogate endpoints are very important in regulatory decision-making in healthcare, in particular if they can be measured early compared to the long-term final clinical outcome and act as good predictors of clinical benefit. Bivariate meta-analysis methods can be used to evaluate surrogate endpoints and to predict the treatment effect on the final outcome from the treatment effect measured on a surrogate endpoint. However, candidate surrogate endpoints are often imperfect, and the level of association between the treatment effects on the surrogate and final outcomes may vary between treatments. This imposes a limitation on methods which do not differentiate between the treatments. We develop bivariate network meta-analysis (bvNMA) methods which combine data on treatment effects on the surrogate and final outcomes, from trials investigating multiple treatment contrasts. The bvNMA methods estimate the effects on both outcomes for all treatment contrasts individually in a single analysis. At the same time, they allow us to model the trial-level surrogacy patterns within each treatment contrast and treatment-level surrogacy, thus enabling predictions of the treatment effect on the final outcome either for a new study in a new population or for a new treatment. Modelling assumptions about the betweenstudies heterogeneity and the network consistency, and their impact on predictions, are investigated using an illustrative example in advanced colorectal cancer and in a simulation study. When the strength of the surrogate relationships varies across treatment contrasts, bvNMA has the advantage of identifying treatment comparisons for which surrogacy holds, thus leading to better predictions

    The peppermint breath test: A benchmarking protocol for breath sampling and analysis using GC-MS

    No full text
    © 2021 The Author(s). Published by IOP Publishing Ltd Exhaled breath contains hundreds of volatile organic compounds (VOCs) which offer the potential for diagnosing and monitoring a wide range of diseases. As the breath research field has grown, sampling and analytical practices have become highly varied between groups. Standardisation would allow meta-analyses of data from multiple studies and greater confidence in published results. Washout of VOCs from ingestion into the blood and subsequently breath could provide data for an initial assessment of inter-group performance. The Peppermint Initiative has been formed to address this task of standardisation. In the current study we aimed to generate initial benchmark values for thermal desorption-gas chromatography-mass spectrometry (TD-GC-MS) analysis of breath samples containing peppermint-derived VOCs using data from three independent European research groups. Initially, headspace analysis of peppermint oil capsules was performed to determine compounds of interest. Ten healthy participants were recruited by each three groups across Europe. The standard Peppermint protocol was followed. In brief, each participant provided a baseline breath sample prior to taking a peppermint capsule, with further samples collected at 60, 90, 165, 285 and 360 min following ingestion. Sampling and analytical protocols were different for each group, in line with their usual practice. Samples were analysed by TD-GC-MS and benchmarking values determined for the time taken for detected peppermint VOCs to return to baseline values. Sixteen compounds were identified in the capsule headspace, and all were confirmed in breath following ingestion of the peppermint capsules. Additionally, 2,3-dehydro-1,8-cineole was uniquely found in the breath samples, with a washout profile that suggested it was a product of metabolism of peppermint compounds. Five compounds (α-pinene, β-pinene, eucalyptol, menthol and menthone) were quantified by all three groups. Differences were observed between the groups, particularly for the recovery of menthone and menthol. The average time taken for VOCs to return to baseline was selected as the benchmark and were 377, 423, 533, 418 and 336 min for α-pinene, β-pinene, eucalyptol, menthone and menthol respectively. We have presented an initial set of easy-to-measure benchmarking values for assessing the performance of TD-GC-MS systems for the analysis of VOCs in breath. These values will be updated when more groups provide additional data

    Equalization of four cardiovascular risk algorithms after systematic recalibration: individual-participant meta-analysis of 86 prospective studies.

    No full text
    Aims: There is debate about the optimum algorithm for cardiovascular disease (CVD) risk estimation. We conducted head-to-head comparisons of four algorithms recommended by primary prevention guidelines, before and after 'recalibration', a method that adapts risk algorithms to take account of differences in the risk characteristics of the populations being studied. Methods and results: Using individual-participant data on 360 737 participants without CVD at baseline in 86 prospective studies from 22 countries, we compared the Framingham risk score (FRS), Systematic COronary Risk Evaluation (SCORE), pooled cohort equations (PCE), and Reynolds risk score (RRS). We calculated measures of risk discrimination and calibration, and modelled clinical implications of initiating statin therapy in people judged to be at 'high' 10 year CVD risk. Original risk algorithms were recalibrated using the risk factor profile and CVD incidence of target populations. The four algorithms had similar risk discrimination. Before recalibration, FRS, SCORE, and PCE over-predicted CVD risk on average by 10%, 52%, and 41%, respectively, whereas RRS under-predicted by 10%. Original versions of algorithms classified 29-39% of individuals aged ≥40 years as high risk. By contrast, recalibration reduced this proportion to 22-24% for every algorithm. We estimated that to prevent one CVD event, it would be necessary to initiate statin therapy in 44-51 such individuals using original algorithms, in contrast to 37-39 individuals with recalibrated algorithms. Conclusion: Before recalibration, the clinical performance of four widely used CVD risk algorithms varied substantially. By contrast, simple recalibration nearly equalized their performance and improved modelled targeting of preventive action to clinical need
    corecore