7,090 research outputs found

    Comparative analysis of classification algorithms for chronic kidney disease diagnosis

    Get PDF
    Chronic Kidney Disease (CKD) is one of the leading cause of death contributed by other illnesses such as diabetes, hypertension, lupus, anemia or weak bones that lead to bone fractures. Early prediction of CKD is important in order to contain the disesase. However, instead of predicting the severity of CKD, the objective of this paper is to predict the diagnosis of CKD based on the symptoms or attributes observed in a particular case, whether the stage is acute or chronic. To achieve this, a classification model is proposed to label stage of severity for kidney diseases patients. The experiments then investigated the performance of the proposed classification model based on eight supervised classification algorithms, which are ZeroR, Rule Induction, Support Vector Machine, NaĂŻve Bayes, Decision Tree, Decision Stump, k-Nearest Neighbour, and Classification via Regression. The performance of the all classifiers is evaluated based on accuracy, precision, and recall. The results showed that the regression classifier perform best in the kidney diagnostic procedure

    Accuracy Prediction of Classification and Forecast using WEKA Tool by Example: Chronic Kidney Disease

    Get PDF
    Introduction: Renal failure due to kidney disease can be avoided with early diagnosis. Disease markers able to anticipate renal failure at an asymptomatic stage and thus, the onset of chronic kidney disease in a human subject can be predicted using, for example, data mining techniques. The present study focuses on building a decision tree and predicting the accuracy of machine learning classifiers to forecast kidney disease using the CKD dataset. Methods: The dataset in the current study includes information from 400 samples (instances) and 25 attributes retrieved from the freely available UCI machine learning repository. The accuracy of prediction of classifiers was conducted with the WEKA software tool using 14 algorithms. The performance evaluation of the models was done with accuracy, precision, recall and F-measure. Results: The lowest performance was given by Stacking and Vote classifiers. Moderate performance evaluation was observed for Logistic, NaĂŻve Bayes, Random Tree, and Voted Perceptron. The best performances were observed for Random Forest, Multilayer Perceptron, Logit Boost, J48, Decision Table, Bagging, PART, and SMO. The following two were statistically significant: Random Forest and Multilayer Perceptron . Conclusion: The decision tree could successively depict the contribution of serum creatine, pedal edema, diabetes, hemoglobin, and specific gravity of blood in tracing the prevalence of CKD in a prospective patient

    Establishment of a integrative multi-omics expression database CKDdb in the context of chronic kidney disease (CKD)

    Get PDF
    Complex human traits such as chronic kidney disease (CKD) are a major health and financial burden in modern societies. Currently, the description of the CKD onset and progression at the molecular level is still not fully understood. Meanwhile, the prolific use of high-throughput omic technologies in disease biomarker discovery studies yielded a vast amount of disjointed data that cannot be easily collated. Therefore, we aimed to develop a molecule-centric database featuring CKD-related experiments from available literature publications. We established the Chronic Kidney Disease database CKDdb, an integrated and clustered information resource that covers multi-omic studies (microRNAs, genomics, peptidomics, proteomics and metabolomics) of CKD and related disorders by performing literature data mining and manual curation. The CKDdb database contains differential expression data from 49395 molecule entries (redundant), of which 16885 are unique molecules (non-redundant) from 377 manually curated studies of 230 publications. This database was intentionally built to allow disease pathway analysis through a systems approach in order to yield biological meaning by integrating all existing information and therefore has the potential to unravel and gain an in-depth understanding of the key molecular events that modulate CKD pathogenesis

    Mining heterogeneous information graph for health status classification

    Get PDF
    In the medical domain, there exists a large volume of data from multiple sources such as electronic health records, general health examination results, and surveys. The data contain useful information reflecting people’s health and provides great opportunities for studies to improve the quality of healthcare. However, how to mine these data effectively and efficiently still remains a critical challenge. In this paper, we propose an innovative classification model for knowledge discovery from patients’ personal health repositories. By based on analytics of massive data in the National Health and Nutrition Examination Survey, the study builds a classification model to classify patients’health status and reveal the specific disease potentially suffered by the patient. This paper makes significant contributions to the advancement of knowledge in data mining with an innovative classification model specifically crafted for domain-based data. Moreover, this research contributes to the healthcare community by providing a deep understanding of people’s health with accessibility to the patterns in various observations

    IMPROVING THE PERFORMANCE OF SUPPORT VECTOR MACHINE WITH FORWARD SELECTION FOR PREDICTION OF CHRONIC KIDNEY DISEASE

    Get PDF
    Chronic kidney disease is a disorder that affects the kidneys and arises due to various factors. Chronic kidney disease, usually develops slowly and is chronic. For prevention and control, proper treatment is needed, so that detection of this disease can play a very important role. This study aims to determine the level of accuracy in predicting chronic kidney disease through SVM based on forward selection and to determine the performance of Feature Selection which is applied to the SVM method in solving problems in chronic kidney disease. This research was conducted an experiment on the SVM method using various kinds of kernels and it was seen that SVM with the dot kernel was 98.50% with AUC 1,000 which was superior to the polynominal kernel and RBF. However, when the experiment was carried out again by applying FS to SVM, it was found that SVM+FS with the RBF kernel outperformed the other kernels by 99.75% with AUC 1,000. So it can be concluded that the Forward Selection of SVM has succeeded in improving its performance, especially in this case, namely the prediction of chronic kidney diseas
    • …
    corecore