8 research outputs found

    A novel approach for the effective prediction of cardiovascular disease using applied artificial intelligence techniques

    Get PDF
    Aims: The objective of this research is to develop an effective cardiovascular disease prediction framework using machine learning techniques and to achieve high accuracy for the prediction of cardiovascular disease. Methods: In this paper, we have utilized machine learning algorithms to predict cardiovascular disease on the basis of symptoms such as chest pain, age and blood pressure. This study incorporated five distinct datasets: Heart UCI, Stroke, Heart Statlog, Framingham and Coronary Heart dataset obtained from online sources. For the implementation of the framework, RapidMiner tool was used. The three‐step approach includes pre‐processing of the dataset, applying feature selection method on pre‐processed dataset and then applying classification methods for prediction of results. We addressed missing values by replacing them with mean, and class imbalance was handled using sample bootstrapping. Various machine learning classifiers were applied out of which random forest with AdaBoost dataset using 10‐fold cross‐validation provided the high accuracy. Results: The proposed model provides the highest accuracy of 99.48% on Heart Statlog, 93.90% on Heart UCI, 96.25% on Stroke dataset, 86% on Framingham dataset and 78.36% on Coronary heart disease dataset, respectively. Conclusions: In conclusion, the results of the study have shown remarkable potential of the proposed framework. By handling imbalance and missing values, a significantly accurate framework has been established that could effectively contribute to the prediction of cardiovascular disease at early stages

    A Machine Learning-Based Framework for Accurate and Early Diagnosis of Liver Diseases: A Comprehensive Study on Feature Selection, Data Imbalance, and Algorithmic Performance

    Get PDF
    The liver is the largest organ of the human body with more than 500 vital functions. In recent decades, a large number of liver patients have been reported with diseases such as cirrhosis, fibrosis, or other liver disorders. There is a need for effective, early, and accurate identification of individuals suffering from such disease so that the person may recover before the disease spreads and becomes fatal. For this, applications of machine learning are playing a significant role. Despite the advancements, existing systems remain inconsistent in performance due to limited feature selection and data imbalance. In this article, we reviewed 58 articles extracted from 5 different electronic repositories published from January 2015 to 2023. After a systematic and protocol-based review, we answered 6 research questions about machine learning algorithms. The identification of effective feature selection techniques, data imbalance management techniques, accurate machine learning algorithms, a list of available data sets with their URLs and characteristics, and feature importance based on usage has been identified for diagnosing liver disease. The reason to select this research question is, in any machine learning framework, the role of dimensionality reduction, data imbalance management, machine learning algorithm with its accuracy, and data itself is very significant. Based on the conducted review, a framework, machine learning-based liver disease diagnosis (MaLLiDD), has been proposed and validated using three datasets. The proposed framework classified liver disorders with 99.56%, 76.56%, and 76.11% accuracy. In conclusion, this article addressed six research questions by identifying effective feature selection techniques, data imbalance management techniques, algorithms, datasets, and feature importance based on usage. It also demonstrated a high accuracy with the framework for early diagnosis, marking a significant advancement

    STREAK: A supervised cell surface receptor abundance estimation strategy for single cell RNA-sequencing data using feature selection and thresholded gene set scoring.

    No full text
    The accurate estimation of cell surface receptor abundance for single cell transcriptomics data is important for the tasks of cell type and phenotype categorization and cell-cell interaction quantification. We previously developed an unsupervised receptor abundance estimation technique named SPECK (Surface Protein abundance Estimation using CKmeans-based clustered thresholding) to address the challenges associated with accurate abundance estimation. In that paper, we concluded that SPECK results in improved concordance with Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-seq) data relative to comparative unsupervised abundance estimation techniques using only single-cell RNA-sequencing (scRNA-seq) data. In this paper, we outline a new supervised receptor abundance estimation method called STREAK (gene Set Testing-based Receptor abundance Estimation using Adjusted distances and cKmeans thresholding) that leverages associations learned from joint scRNA-seq/CITE-seq training data and a thresholded gene set scoring mechanism to estimate receptor abundance for scRNA-seq target data. We evaluate STREAK relative to both unsupervised and supervised receptor abundance estimation techniques using two evaluation approaches on six joint scRNA-seq/CITE-seq datasets that represent four human and mouse tissue types. We conclude that STREAK outperforms other abundance estimation strategies and provides a more biologically interpretable and transparent statistical model

    Usual presentation of an unusual pathogen – Cryptococcus laurentii meningitis: a case report

    No full text
    Infections caused by non-neoformans Cryptococcus spp., including Cryptococcus laurentii, previously thought to be saprophyte and non-pathogenic, have become more common during the past few years, particularly in immunocompromised hosts. To the best of our knowledge here, we present the first case of meningitis in an immunocompromised patient due to a fungus that has never been reported in Pakistan. Our patient, a 40-year old male, who had acquired immunodeficiency syndrome (AIDS) was diagnosed as Cryptococcus laurentti meningitis, with a rare neurological manifestation i.e., cryptococcomas and lepto-meningitis. We presume that exposure to pigeon droppings and acquired immunodeficiency syndrome were the risk factors for this case report. He was treated with liposomal Amphotericin (LAMB) and fluconazole but unfortunately, he rapidly deteriorated and ultimately succumbed to the infection. This case underscores the significance of prompt diagnosis and vigorous treatment of Cryptococcus laurentii meningitis, as well as the need for continued surveillance in immunocompromised individuals

    Increased Aortic Stiffness is Associated with Higher Rates of Stroke, GI-bleeding and Pump Thrombosis in CF-LVAD Patients

    No full text
    Background In the general population, increased aortic stiffness is associated with an increased risk of cardiovascular events. Previous studies have demonstrated an increase in aortic stiffness in patients with a continuous flow left ventricular assist device (CF-LVAD). However, the association between aortic stiffness and common adverse events is unknown. Methods and results Forty patients with a HeartMate II (HMII) (51 $ 11 years; 20% female; 25% ischemic) implanted between January 2011 and September 2017 were included. Two-dimensional transthoracic echocardiograms of the ascending aorta, obtained before HMII placement and early after heart transplant, were analyzed to calculate the aortic stiffness index (AO-SI). The study cohort was divided into patients who had an increased vs decreased AO-SI after LVAD support. A composite outcome of gastrointestinal bleeding, stroke, and pump thrombosis was defined as the primary end point and compared between the groups. While median AO-SI increased significantly after HMII support (AO-SI 4.4-6.5, P = .012), 16 patients had a lower AO-SI. Patients with increased (n = 24) AO-SI had a significantly higher rate of the composite end point (58% vs 12%, odds ratio 9.8, P < .01). Similarly, those with increased AO-SI tended to be on LVAD support for a longer duration, had higher LVAD speed and reduced use of angiotensin-converting enzyme inhibitors or angiotensin II receptor blockers. Conclusions Increased aortic stiffness in patients with a HMII is associated with a significantly higher rates of adverse events. Further studies are warranted to determine the causality between aortic stiffness and adverse events, as well as the effect of neurohormonal modulation on the conduit vasculature in patients with a CF-LVAD

    Same data, different conclusions : radical dispersion in empirical results when independent analysts operationalize and test the same hypothesis

    No full text
    In this crowdsourced initiative, independent analysts used the same dataset to test two hypotheses regarding the effects of scientists’ gender and professional status on verbosity during group meetings. Not only the analytic approach but also the operationalizations of key variables were left unconstrained and up to individual analysts. For instance, analysts could choose to operationalize status as job title, institutional ranking, citation counts, or some combination. To maximize transparency regarding the process by which analytic choices are made, the analysts used a platform we developed called DataExplained to justify both preferred and rejected analytic paths in real time. Analyses lacking sufficient detail, reproducible code, or with statistical errors were excluded, resulting in 29 analyses in the final sample. Researchers reported radically different analyses and dispersed empirical outcomes, in a number of cases obtaining significant effects in opposite directions for the same research question. A Boba multiverse analysis demonstrates that decisions about how to operationalize variables explain variability in outcomes above and beyond statistical choices (e.g., covariates). Subjective researcher decisions play a critical role in driving the reported empirical results, underscoring the need for open data, systematic robustness checks, and transparency regarding both analytic paths taken and not taken. Implications for organizations and leaders, whose decision making relies in part on scientific findings, consulting reports, and internal analyses by data scientists, are discussed

    Same data, different conclusions: Radical dispersion in empirical results when independent analysts operationalize and test the same hypothesis

    Get PDF
    In this crowdsourced initiative, independent analysts used the same dataset to test two hypotheses regarding the effects of scientists’ gender and professional status on verbosity during group meetings. Not only the analytic approach but also the operationalizations of key variables were left unconstrained and up to individual analysts. For instance, analysts could choose to operationalize status as job title, institutional ranking, citation counts, or some combination. To maximize transparency regarding the process by which analytic choices are made, the analysts used a platform we developed called DataExplained to justify both preferred and rejected analytic paths in real time. Analyses lacking sufficient detail, reproducible code, or with statistical errors were excluded, resulting in 29 analyses in the final sample. Researchers reported radically different analyses and dispersed empirical outcomes, in a number of cases obtaining significant effects in opposite directions for the same research question. A Boba multiverse analysis demonstrates that decisions about how to operationalize variables explain variability in outcomes above and beyond statistical choices (e.g., covariates). Subjective researcher decisions play a critical role in driving the reported empirical results, underscoring the need for open data, systematic robustness checks, and transparency regarding both analytic paths taken and not taken. Implications for organizations and leaders, whose decision making relies in part on scientific findings, consulting reports, and internal analyses by data scientists, are discussed
    corecore