10 research outputs found

    muSignAl: An algorithm to search for multiple omic signatures with similar predictive performance

    Get PDF
    Multidimensional omic datasets often have correlated features leading to the possibility of discovering multiple biological signatures with similar predictive performance for a phenotype. However, their exploration is limited by low sample size and the exponential nature of the combinatorial search leading to high computational cost. To address these issues, we have developed an algorithm muSignAl (multiple signature algorithm) which selects multiple signatures with similar predictive performance while systematically bypassing the requirement of exploring all the combinations of features. We demonstrated the workflow of this algorithm with an example of proteomics dataset. muSignAl is applicable in various bioinformatics driven explorations, such as understanding the relationship between multiple biological feature sets and phenotypes, and discovery and development of biomarker panels while providing the opportunity of optimising their development cost with the help of equally good multiple signatures. Source code of muSignAl is freely available at https://github.com/ShuklaLab/muSignAl

    Data-driven patient stratification of UK Biobank cohort suggests five endotypes of multimorbidity

    Get PDF
    Multimorbidity generally refers to concurrent occurrence of multiple chronic conditions. These patients are inherently at high risk and often lead a poor quality of life due to delayed treatments. With the emergence of personalized medicine and stratified healthcare, there is a need to stratify patients right at the primary care setting. Here we developed multimorbidity analysis pipeline (MulMorPip), which can stratify patients into multimorbid subgroups or endotypes based on their lifetime disease diagnosis and characterize them based on demographic features and underlying disease–disease interaction networks. By implementing MulMorPip on UK Biobank cohort, we report five distinct molecular subclasses or endotypes of multimorbidity. For each patient, we calculated the existence of broad disease classes defined by Charlson's comorbidity classification using the International Classification of Diseases-10 encoding. We then applied multiple correspondence analysis in 77 524 patients from UK Biobank, who had multimorbidity of more than one disease, which resulted in five multimorbid clusters. We further validated these clusters using machine learning and were able to classify 20% model-blind test set patients with an accuracy of 97% and an average Jaccard similarity of 84%. This was followed by demographic characterization and development of interlinking disease network for each cluster to understand disease–disease interactions. Our identified five endotypes of multimorbidity draw attention to dementia, stroke and paralysis as important drivers of multimorbidity stratification. Inclusion of such patient stratification at the primary care setting can help general practitioners to better observe patients’ multiple chronic conditions, their risk stratification and personalization of treatment strategies

    Simvastatin is associated with superior lipid and glycaemic control to atorvastatin and reduced levels of incident Type 2 diabetes, in men and women, in the UK Biobank

    Get PDF
    INTRODUCTION: Cardiovascular disease (CVD) is the leading cause of mortality in people with Type 2 diabetes mellitus (T2DM). Statins reduce low‐density lipoproteins and positively affect CVD outcomes. Statin type and dose have differential effects on glycaemia and risk of incident T2DM; however, the impact of gender, and of individual drugs within the statin class, remains unclear. AIM: To compare effects of simvastatin and atorvastatin on lipid and glycaemic control in men and women with and without T2DM, and their association with incident T2DM. METHODS: The effect of simvastatin and atorvastatin on lipid and glycaemic control was assessed in the T2DM DiaStrat cohort. Prescribed medications, gender, age, BMI, diabetes duration, blood lipid profile and HbA1c were extracted from Electronic Care Record, and compared in men and women prescribed simvastatin and atorvastatin. Analyses were replicated in the UKBiobank in those with and without T2DM. The association of simvastatin and atorvastatin with incident T2DM was also investigated in the UKBiobank. Cohorts where matched for age, BMI and diabetes duration in men and women, in the UKBioBank analysis, where possible. RESULTS: Simvastatin was associated with better LDL (1.6 ± 0.6 vs 2.1 ± 0.9 mmol/L, p < .01) and total cholesterol (3.6 ± 0.7 vs 4.2 ± 1.0 mmol/L, p < .05), and glycaemic control (62 ± 17 vs 67 ± 19 mmol/mol, p < .059) than atorvastatin specifically in women in the DiaStrat cohort. In the UKBiobank, both men and women prescribed simvastatin had better LDL (Women: 2.6 ± 0.6 vs 2.6 ± 0.7 mmol/L, p < .05; Men: 2.4 ± 0.6 vs 2.4 ± 0.6, p < .01) and glycaemic control (Women:54 ± 14 vs 56 ± 15mmol/mol, p < .05; Men, 54 ± 14 vs 55 ± 15 mmol/mol, p < .01) than those prescribed atorvastatin. Simvastatin was also associated with reduced risk of incident T2DM in both men and women (p < .0001) in the UKBiobank. CONCLUSIONS: Simvastatin is associated with superior lipid and glycaemic control to atorvastatin in those with and without T2DM, and with fewer incident T2DM cases. Given the importance of lipid and glycaemic control in preventing secondary complications of T2DM, these findings may help inform prescribing practices

    Immuno-informatics analysis predicts B and T cell consensus epitopes for designing peptide vaccine against SARS-CoV-2 with 99.82% global population coverage.

    Get PDF
    The current global pandemic due to Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has taken a substantial number of lives across the world. Although few vaccines have been rolled-out, a number of vaccine candidates are still under clinical trials at various pharmaceutical companies and laboratories around the world. Considering the intrinsic nature of viruses in mutating and evolving over time, persistent efforts are needed to develop better vaccine candidates. In this study, various immuno-informatics tools and bioinformatics databases were deployed to derive consensus B-cell and T-cell epitope sequences of SARS-CoV-2 spike glycoprotein. This approach has identified four potential epitopes which have the capability to initiate both antibody and cell-mediated immune responses, are non-allergenic and do not trigger autoimmunity. These peptide sequences were also evaluated to show 99.82% of global population coverage based on the genotypic frequencies of HLA binding alleles for both MHC class-I and class-II and are unique for SARS-CoV-2 isolated from human as a host species. Epitope number 2 alone had a global population coverage of 98.2%. Therefore, we further validated binding and interaction of its constituent T-cell epitopes with their corresponding HLA proteins using molecular docking and molecular dynamics simulation experiments, followed by binding free energy calculations with molecular mechanics Poisson-Boltzmann surface area, essential dynamics analysis and free energy landscape analysis. The immuno-informatics pipeline described and the candidate epitopes discovered herein could have significant impact upon efforts to develop globally effective SARS-CoV-2 vaccines

    Immuno-informatics analysis predicts B and T cell consensus epitopes for designing peptide vaccine against SARS-CoV-2 with 99.82% global population coverage.

    Get PDF
    The current global pandemic due to Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) has taken a substantial number of lives across the world. Although few vaccines have been rolled-out, a number of vaccine candidates are still under clinical trials at various pharmaceutical companies and laboratories around the world. Considering the intrinsic nature of viruses in mutating and evolving over time, persistent efforts are needed to develop better vaccine candidates. In this study, various immuno-informatics tools and bioinformatics databases were deployed to derive consensus B-cell and T-cell epitope sequences of SARS-CoV-2 spike glycoprotein. This approach has identified four potential epitopes which have the capability to initiate both antibody and cell-mediated immune responses, are non-allergenic and do not trigger autoimmunity. These peptide sequences were also evaluated to show 99.82% of global population coverage based on the genotypic frequencies of HLA binding alleles for both MHC class-I and class-II and are unique for SARS-CoV-2 isolated from human as a host species. Epitope number 2 alone had a global population coverage of 98.2%. Therefore, we further validated binding and interaction of its constituent T-cell epitopes with their corresponding HLA proteins using molecular docking and molecular dynamics simulation experiments, followed by binding free energy calculations with molecular mechanics Poisson-Boltzmann surface area, essential dynamics analysis and free energy landscape analysis. The immuno-informatics pipeline described and the candidate epitopes discovered herein could have significant impact upon efforts to develop globally effective SARS-CoV-2 vaccines

    Leukaemia exposure alters the transcriptional profile and function of BCR::ABL1 negative macrophages in the bone marrow niche

    Get PDF
    Macrophages are fundamental cells of the innate immune system that support normal haematopoiesis and play roles in both anti-cancer immunity and tumour progression. Here we use a chimeric mouse model of chronic myeloid leukaemia (CML) and human bone marrow (BM) derived macrophages to study the impact of the dysregulated BM microenvironment on bystander macrophages. Utilising single-cell RNA sequencing (scRNA-seq) of Philadelphia chromosome (Ph) negative macrophages we reveal unique subpopulations of immature macrophages residing in the CML BM microenvironment. CML exposed macrophages separate from their normal counterparts by reduced expression of the surface marker CD36, which significantly reduces clearance of apoptotic cells. We uncover aberrant production of CML-secreted factors, including the immune modulatory protein lactotransferrin (LTF), that suppresses efferocytosis, phagocytosis, and CD36 surface expression in BM macrophages, indicating that the elevated secretion of LTF is, at least partially responsible for the supressed clearance function of Ph- macrophages

    ATRPred: A machine learning based tool for clinical decision making of anti-TNF treatment in rheumatoid arthritis patients

    Get PDF
    Rheumatoid arthritis (RA) is a chronic autoimmune condition, characterised by joint pain, damage and disability, which can be addressed in a high proportion of patients by timely use of targeted biologic treatments. However, the patients, non-responsive to the treatments often suffer from refractoriness of the disease, leading to poor quality of life. Additionally, the biologic treatments are expensive. We obtained plasma samples from N = 144 participants with RA, who were about to commence anti-tumour necrosis factor (anti-TNF) therapy. These samples were sent to Olink Proteomics, Uppsala, Sweden, where proximity extension assays of 4 panels, containing 92 proteins each, were performed. A total of n = 89 samples of patients passed the quality control of anti-TNF treatment response data. The preliminary analysis of plasma protein expression values suggested that the RA population could be divided into two distinct molecular sub-groups (endotypes). However, these broad groups did not predict response to anti-TNF treatment, but were significantly different in terms of gender and their disease activity. We then labelled these patients as responders (n = 60) and non-responders (n = 29) based on the change in disease activity score (DAS) after 6 months of anti-TNF treatment and applied machine learning (ML) with a rigorous 5-fold nested cross-validation scheme to filter 17 proteins that were significantly associated with the treatment response. We have developed a ML based classifier ATRPred (anti-TNF treatment response predictor), which can predict anti-TNF treatment response in RA patients with 81% accuracy, 75% sensitivity and 86% specificity. ATRPred may aid clinicians to direct anti-TNF therapy to patients most likely to receive benefit, thus save cost as well as prevent non-responsive patients from refractory consequences. ATRPred is implemented in R

    Arginine dependency is a therapeutically exploitable vulnerability in chronic myeloid leukaemic stem cells

    Get PDF
    To fuel accelerated proliferation, leukaemic cells undergo metabolic deregulation, which can result in specific nutrient dependencies. Here, we perform an amino acid drop-out screen and apply pre-clinical models of chronic phase chronic myeloid leukaemia (CML) to identify arginine as a nutrient essential for primary human CML cells. Analysis of the Microarray Innovations in Leukaemia (MILE) dataset uncovers reduced ASS1 levels in CML compared to most other leukaemia types. Stable isotope tracing reveals repressed activity of all urea cycle enzymes in patient-derived CML CD34+ cells, rendering them arginine auxotrophic. Thus, arginine deprivation completely blocks proliferation of CML CD34+ cells and induces significantly higher levels of apoptosis when compared to arginine-deprived cell lines. Similarly, primary CML cells, but not normal CD34+ samples, are particularly sensitive to treatment with the arginine-depleting enzyme, BCT-100, which induces apoptosis and reduces clonogenicity. Moreover, BCT-100 is highly efficacious in a patient-derived xenograft model, causing &gt; 90% reduction in the number of human leukaemic stem cells (LSCs). These findings indicate arginine depletion to be a promising and novel strategy to eradicate therapy resistant LSCs
    corecore