799 research outputs found

    Comparative Analysis of Some Prominent Machine Learning Algorithm for the Prediction of Chronic Kidney Disease

    Get PDF
    Chronic Kidney Disease (CKD) is a disorder against proper function regarding kidneys, as kidneys filter our blood whenever CKD gets worse, our blood receives wastes at a higher level, which results in sickness. It also has a substantial financial problem for families of subjects with a medical issue in  Nigeria. Among the necessary measures that need action concerning the increase of CKD is detecting the disease early and with different data mining  techniques. Data mining is gradually becoming more prevalent nowadays in healthcare, as also in fraud, abuse detection etc. Classification is a more  useful data mining function to handle items in a collection to class or target categories. For obtaining essential information from medical database,  machine learning and statistical analysis can assist in extracting hidden patterns and identify relationships from vast among of data. In this study, we  compared five (5) different models namely: Deep Neural Network (DNN), Artificial Neural Network (ANN), Naïve Bayes (NB), Logistic Regression (LR), and  K-Neighbor Nearest (KNN) to predict CKD on Gashua General Hospital (GGH) dataset. The study achieved an accuracy of 98% for DNN, KNN: 96%, NB:  97%, LR: 96% and ANN: 96%. The best performance was obtained with DNN with the highest accuracy and can be applied in real world application. &nbsp

    Big data analytics for preventive medicine

    Get PDF
    © 2019, Springer-Verlag London Ltd., part of Springer Nature. Medical data is one of the most rewarding and yet most complicated data to analyze. How can healthcare providers use modern data analytics tools and technologies to analyze and create value from complex data? Data analytics, with its promise to efficiently discover valuable pattern by analyzing large amount of unstructured, heterogeneous, non-standard and incomplete healthcare data. It does not only forecast but also helps in decision making and is increasingly noticed as breakthrough in ongoing advancement with the goal is to improve the quality of patient care and reduces the healthcare cost. The aim of this study is to provide a comprehensive and structured overview of extensive research on the advancement of data analytics methods for disease prevention. This review first introduces disease prevention and its challenges followed by traditional prevention methodologies. We summarize state-of-the-art data analytics algorithms used for classification of disease, clustering (unusually high incidence of a particular disease), anomalies detection (detection of disease) and association as well as their respective advantages, drawbacks and guidelines for selection of specific model followed by discussion on recent development and successful application of disease prevention methods. The article concludes with open research challenges and recommendations

    Sensor-based datasets for human activity recognition - a systematic review of literature

    Get PDF
    The research area of ambient assisted living has led to the development of activity recognition systems (ARS) based on human activity recognition (HAR). These systems improve the quality of life and the health care of the elderly and dependent people. However, before making them available to end users, it is necessary to evaluate their performance in recognizing activities of daily living, using data set benchmarks in experimental scenarios. For that reason, the scientific community has developed and provided a huge amount of data sets for HAR. Therefore, identifying which ones to use in the evaluation process and which techniques are the most appropriate for prediction of HAR in a specific context is not a trivial task and is key to further progress in this area of research. This work presents a systematic review of the literature of the sensor-based data sets used to evaluate ARS. On the one hand, an analysis of different variables taken from indexed publications related to this field was performed. The sources of information are journals, proceedings, and books located in specialized databases. The analyzed variables characterize publications by year, database, type, quartile, country of origin, and destination, using scientometrics, which allowed identification of the data set most used by researchers. On the other hand, the descriptive and functional variables were analyzed for each of the identified data sets: occupation, annotation, approach, segmentation, representation, feature selection, balancing and addition of instances, and classifier used for recognition. This paper provides an analysis of the sensor-based data sets used in HAR to date, identifying the most appropriate dataset to evaluate ARS and the classification techniques that generate better results

    A New Method for Solving Supervised Data Classification Problems

    Get PDF
    Supervised data classification is one of the techniques used to extract nontrivial information from data. Classification is a widely used technique in various fields, including data mining, industry, medicine, science, and law. This paper considers a new algorithm for supervised data classification problems associated with the cluster analysis. The mathematical formulations for this algorithm are based on nonsmooth, nonconvex optimization. A new algorithm for solving this optimization problem is utilized. The new algorithm uses a derivative-free technique, with robustness and efficiency. To improve classification performance and efficiency in generating classification model, a new feature selection algorithm based on techniques of convex programming is suggested. Proposed methods are tested on real-world datasets. Results of numerical experiments have been presented which demonstrate the effectiveness of the proposed algorithms
    • …
    corecore