113 research outputs found

    Mathematical modelling of SARS-CoV-2 variant outbreaks reveals their probability of extinction

    Get PDF
    When a virus spreads, it may mutate into, e.g., vaccine resistant or fast spreading lineages, as was the case for the Danish Cluster-5 mink variant (belonging to the B.1.1.298 lineage), the British B.1.1.7 lineage, and the South African B.1.351 lineage of the SARS-CoV-2 virus. A way to handle such spreads is through a containment strategy, where the population in the affected area is isolated until the spread has been stopped. Under such circumstances, it is important to monitor whether the mutated virus is extinct via massive testing for the virus sub-type. If successful, the strategy will lead to lower and lower numbers of the sub-type, and it will eventually die out. An important question is, for how long time one should wait to be sure the sub-type is extinct? We use a hidden Markov model for infection spread and an approximation of a two stage sampling scheme to infer the probability of extinction. The potential of the method is illustrated via a simulation study. Finally, the model is used to assess the Danish containment strategy when SARS-CoV-2 spread from mink to man during the summer of 2020, including the Cluster-5 sub-type. In order to avoid further spread and mink being a large animal virus reservoir, this situation led to the isolation of seven municipalities in the Northern part of the country, the culling of the entire Danish 17 million large mink population, and a bill to interim ban Danish mink production until the end of 2021

    Genome position specific priors for genomic prediction

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The accuracy of genomic prediction is highly dependent on the size of the reference population. For small populations, including information from other populations could improve this accuracy. The usual strategy is to pool data from different populations; however, this has not proven as successful as hoped for with distantly related breeds. BayesRS is a novel approach to share information across populations for genomic predictions. The approach allows information to be captured even where the phase of SNP alleles and casuative mutation alleles are reversed across populations, or the actual casuative mutation is different between the populations but affects the same gene. Proportions of a four-distribution mixture for SNP effects in segments of fixed size along the genome are derived from one population and set as location specific prior proportions of distributions of SNP effects for the target population. The model was tested using dairy cattle populations of different breeds: 540 Australian Jersey bulls, 2297 Australian Holstein bulls and 5214 Nordic Holstein bulls. The traits studied were protein-, fat- and milk yield. Genotypic data was Illumina 777K SNPs, real or imputed.</p> <p>Results</p> <p>Results showed an increase in accuracy of up to 3.5% for the Jersey population when using BayesRS with a prior derived from Australian Holstein compared to a model without location specific priors. The increase in accuracy was however lower than was achieved when reference populations were combined to estimate SNP effects, except in the case of fat yield. The small size of the Jersey validation set meant that these improvements in accuracy were not significant using a Hotelling-Williams t-test at the 5% level. An increase in accuracy of 1-2% for all traits was observed in the Australian Holstein population when using a prior derived from the Nordic Holstein population compared to using no prior information. These improvements were significant (P<0.05) using the Hotelling Williams t-test for protein- and fat yield.</p> <p>Conclusion</p> <p>For some traits the method might be advantageous compared to pooling of reference data for distantly related populations, but further investigation is needed to confirm the results. For closely related populations the method does not perform better than pooling reference data. However, it does give an increased accuracy compared to analysis based on only one reference population, without an increased computational burden. The approach described here provides a general setup for inclusion of location specific priors: the approach could be used to include biological information in genomic predictions.</p

    Dynamic Risk Prediction of 30-Day Mortality in Patients With Advanced Lung Cancer:Comparing Five Machine Learning Approaches

    Get PDF
    International audiencePURPOSE Administering systemic anticancer treatment (SACT) to patients near death can negatively affect their health-related quality of life. Late SACT administrations should be avoided in these cases. Machine learning techniques could be used to build decision support tools leveraging registry data for clinicians to limit late SACT administration. MATERIALS AND METHODS Patients with advanced lung cancer who were treated at the Department of Oncology, Aalborg University Hospital and died between 2010 and 2019 were included (N = 2,368). Diagnoses, treatments, biochemical data, and histopathologic results were used to train predictive models of 30-day mortality using logistic regression with elastic net penalty, random forest, gradient tree boosting, multilayer perceptron, and long short-term memory network. The importance of the variables and the clinical utility of the models were evaluated. RESULTS The random forest and gradient tree boosting models outperformed other models, whereas the artificial neural network–based models underperformed. Adding summary variables had a modest effect on performance with an increase in average precision from 0.500 to 0.505 and from 0.498 to 0.509 for the gradient tree boosting and random forest models, respectively. Biochemical results alone contained most of the information with a limited degradation of the performances when fitting models with only these variables. The utility analysis showed that by applying a simple threshold to the predicted risk of 30-day mortality, 40% of late SACT administrations could have been prevented at the cost of 2% of patients stopping their treatment 90 days before death. CONCLUSION This study demonstrates the potential of a decision support tool to limit late SACT administration in patients with cancer. Further work is warranted to refine the model, build an easy-to-use prototype, and conduct a prospective validation study

    A clinical proteomics study of exhaled breath condensate and biomarkers for pulmonary embolism

    Get PDF
    Pulmonary Embolism (PE) can be a diagnostic challenge. Current diagnostic markers for PE are unspecific and new diagnostic tools are needed. The air we exhale is a possible new source for biomarkers which can be tapped into by analysing the exhaled breath condensate (EBC). We analysed the EBC from patients with PE and controls to investigate if the EBC is a useful source for new diagnostic biomarkers of PE. We collected and analysed EBC samples from patients with suspected PE and controls matched on age and sex. Patients in whom PE was ruled out after diagnostic work-up were included in the control group to increase the sensitivity and generalizability of the identified markers. EBC samples were collected using an RTube™. The protein composition of the EBCs were analysed using data dependent label-free quantitative nano liquid chromatography-tandem mass spectrometry. EBC samples from 28 patients with confirmed PE, and 49 controls were analysed. A total of 928 EBC proteins were identified in the 77 EBC samples. As expected, a low protein concentration was determined which resulted in many proteins with unmeasurable levels in several samples. The levels of HSPA5, PEBP1 and SFTPA2 were higher and levels of POF1B, EPPK1, PSMA4, ALDOA, and CFL1 were lower in PE compared with controls. In conclusion, the human EBC contained a variety of endogenous proteins and may be a source for new diagnostic markers of PE and other diseases. The project is registered at ClinicalTrials.gov (Identifier NCT04010760).</p
    corecore