24 research outputs found

    Two-stage hybrid feature selection algorithms for diagnosing erythemato-squamous diseases

    Get PDF
    This paper proposes two-stage hybrid feature selection algorithms to build the stable and efficient diagnostic models where a new accuracy measure is introduced to assess the models. The two-stage hybrid algorithms adopt Support Vector Machines (SVM) as a classification tool, and the extended Sequential Forward Search (SFS), Sequential Forward Floating Search (SFFS), and Sequential Backward Floating Search (SBFS), respectively, as search strategies, and the generalized F-score (GF) to evaluate the importance of each feature. The new accuracy measure is used as the criterion to evaluated the performance of a temporary SVM to direct the feature selection algorithms. These hybrid methods combine the advantages of filters and wrappers to select the optimal feature subset from the original feature set to build the stable and efficient classifiers. To get the stable, statistical and optimal classifiers, we conduct 10-fold cross validation experiments in the first stage; then we merge the 10 selected feature subsets of the 10-cross validation experiments, respectively, as the new full feature set to do feature selection in the second stage for each algorithm. We repeat the each hybrid feature selection algorithm in the second stage on the one fold that has got the best result in the first stage. Experimental results show that our proposed two-stage hybrid feature selection algorithms can construct efficient diagnostic models which have got better accuracy than that built by the corresponding hybrid feature selection algorithms without the second stage feature selection procedures. Furthermore our methods have got better classification accuracy when compared with the available algorithms for diagnosing erythemato-squamous diseases

    Differential diagnosis of Erythmato-Squamous Diseases using classification and regression tree

    Get PDF
    Introduction: Differential diagnosis of Erythmato-Squamous Diseases (ESD) is a major challenge in the field of dermatology. The ESD diseases are placed into six different classes. Data mining is the process for detection of hidden patterns. In the case of ESD, data mining help us to predict the diseases. Different algorithms were developed for this purpose. Objective: we aimed to use the Classification and Regression Tree (CART) to predict differential diagnosis of ESD. Methods: we used the Cross Industry Standard Process for Data Mining (CRISP-DM) methodology. For this purpose, the dermatology data set from machine learning repository, UCI was obtained. The Clementine 12.0 software from IBM Company was used for modelling. In order to evaluation of the model we calculate the accuracy, sensitivity and specificity of the model. Results: The proposed model had an accuracy of 94.84 (Standard Deviation: 24.42) in order to correct prediction of the ESD disease. Conclusions: Results indicated that using of this classifier could be useful. But, it would be strongly recommended that the combination of machine learning methods could be more useful in terms of prediction of ESD. © 2016 Keivan Maghooli, Mostafa Langarizadeh, Leila Shahmoradi, Mahdi Habibi-koolaee, Mohamad Jebraeily, and Hamid Bouraghi

    A Novel Clinical Decision Support System Using Improved Adaptive Genetic Algorithm for the Assessment of Fetal Well-Being

    Get PDF
    A novel clinical decision support system is proposed in this paper for evaluating the fetal well-being from the cardiotocogram (CTG) dataset through an Improved Adaptive Genetic Algorithm (IAGA) and Extreme Learning Machine (ELM). IAGA employs a new scaling technique (called sigma scaling) to avoid premature convergence and applies adaptive crossover and mutation techniques with masking concepts to enhance population diversity. Also, this search algorithm utilizes three different fitness functions (two single objective fitness functions and multi-objective fitness function) to assess its performance. The classification results unfold that promising classification accuracy of 94% is obtained with an optimal feature subset using IAGA. Also, the classification results are compared with those of other Feature Reduction techniques to substantiate its exhaustive search towards the global optimum. Besides, five other benchmark datasets are used to gauge the strength of the proposed IAGA algorithm

    Analysis and Design of Detection for Liver Cancer using Particle Swarm Optimization and Decision Tree

    Get PDF
    Liver cancer is taken as a major cause of death all over the world. According to WHO (World Health Organization) every year 9.6 million peoples are died due to cancer worldwide. It is one of the eighth most leading causes of death in women and fifth in men as reported by the American Cancer Society. The number of death rate due to cancer is projected to increase by45 percent in between 2008 to 2030. The most common cancers are lung, breast, and liver, colorectal. Approximately 7, 82,000 peoples are died due to liver cancer each year. The most efficient way to decrease the death rate cause of liver cancer is to treat the diseases in the initial stage. Early treatment depends upon the early diagnosis, which depends on reliable diagnosis methods. CT imaging is one of the most common and important technique and it acts as an imaging tool for evaluating the patients with intuition of liver cancer. The diagnosis of liver cancer has historically been made manually by a skilled radiologist, who relied on their expertise and personal judgement to reach a conclusion. The main objective of this paper is to develop the automatic methods based on machine learning approach for accurate detection of liver cancer in order to help radiologists in the clinical practice. The paper primary contribution to the process of liver cancer lesion classification and automatic detection for clinical diagnosis. For the purpose of detecting liver cancer lesions, the best approaches based on PSO and DPSO have been given. With the help of the C4.5 decision tree classifier, wavelet-based statistical and morphological features were retrieved and categorised

    Comparative study on the performance of Au/F-TiO2 photocatalyst synthesized from Zamzam water and distilled water under blue light irradiation

    Get PDF
    Recurring problems of titanium dioxide (TiO2) for needing UV light to be activated and high electron-hole recombination rate limit the application of TiO2 as a prolific photocatalyst. By modifying the morphology and introducing electron trapping species into TiO2, the photocatalytic activity of TiO2 could be improved. Solvents of two different kinds; distilled water and Zamzam water were used in peroxotitanic acid synthesis of TiO2 and the photocatalyst was utilized to degrade Reactive Blue 19 (RB19) dye under blue light irradiation (475 nm) to assess the visible light activity of synthesized TiO2. Fluorine was incorporated to control the morphology while gold nanoparticles (GNP) stabilized by arabic gum were deposited to trap electrons. The morphology of F-TiO2 which appeared to be in ovoid shape was confirmed by Field Emission-Scanning Electron Microscope (FE-SEM) and Transmission Electron Microscope (TEM). Brunauer-Emmett-Teller (BET) surface area and crystallite size estimated from X-ray Diffraction (XRD) data revealed that F-TiO2 modified using HF was smaller in size and exhibited single anatase phase. The band gap of Au-TiO2 synthesized by distilled and Zamzam water was 2.78 eV and 2.89 eV respectively; shifted from 3.08 eV in blank TiO2. Peroxo Au/F-TiO2 synthesized with the incorporation of arabic gum as GNP stabilizer and HF as fluorine modifier degraded up to 49.23% of RB19 within two hours of reaction. The addition of fluorine and gold demonstrated high ability to enhance visible light activity of TiO2 with distilled water used as solvent displayed higher photocatalytic performance compared to Zamzam water

    Implementing decision tree-based algorithms in medical diagnostic decision support systems

    Get PDF
    As a branch of healthcare, medical diagnosis can be defined as finding the disease based on the signs and symptoms of the patient. To this end, the required information is gathered from different sources like physical examination, medical history and general information of the patient. Development of smart classification models for medical diagnosis is of great interest amongst the researchers. This is mainly owing to the fact that the machine learning and data mining algorithms are capable of detecting the hidden trends between features of a database. Hence, classifying the medical datasets using smart techniques paves the way to design more efficient medical diagnostic decision support systems. Several databases have been provided in the literature to investigate different aspects of diseases. As an alternative to the available diagnosis tools/methods, this research involves machine learning algorithms called Classification and Regression Tree (CART), Random Forest (RF) and Extremely Randomized Trees or Extra Trees (ET) for the development of classification models that can be implemented in computer-aided diagnosis systems. As a decision tree (DT), CART is fast to create, and it applies to both the quantitative and qualitative data. For classification problems, RF and ET employ a number of weak learners like CART to develop models for classification tasks. We employed Wisconsin Breast Cancer Database (WBCD), Z-Alizadeh Sani dataset for coronary artery disease (CAD) and the databanks gathered in Ghaem Hospital’s dermatology clinic for the response of patients having common and/or plantar warts to the cryotherapy and/or immunotherapy methods. To classify the breast cancer type based on the WBCD, the RF and ET methods were employed. It was found that the developed RF and ET models forecast the WBCD type with 100% accuracy in all cases. To choose the proper treatment approach for warts as well as the CAD diagnosis, the CART methodology was employed. The findings of the error analysis revealed that the proposed CART models for the applications of interest attain the highest precision and no literature model can rival it. The outcome of this study supports the idea that methods like CART, RF and ET not only improve the diagnosis precision, but also reduce the time and expense needed to reach a diagnosis. However, since these strategies are highly sensitive to the quality and quantity of the introduced data, more extensive databases with a greater number of independent parameters might be required for further practical implications of the developed models

    Κατηγοριοποίηση Δεδομένων Κυτταρολογικής Βιοψίας Δερματικών Παθήσεων

    Get PDF
    Οι δερματοπάθειες είναι αρκετά συνηθισμένες, εμφανίζουν μεγάλο βαθμό περιπλοκότητας και επηρεάζουν σημαντικά την ποιότητα ζωής των ασθενών. Καθώς αρκετά συχνά διαφορετικές δερματικές ασθένειες- παθήσεις παρουσιάζουν παρόμοια συμπτώματα, είναι αρκετά δύσκολη η αξιόπιστη και έγκυρη διάγνωσή τους. Συνήθως μια βιοψία είναι απαραίτητη για τη διάγνωση. Η συγκεκριμένη διπλωματική εργασία εστιάζει στην ανάπτυξη και στην εφαρμογή μεθόδων αναγνώρισης προτύπων για την κατηγοριοποίηση δεδομένων κυτταρολογικής βιοψίας δερματικών ασθενειών. Συγκεκριμένα, χρησιμοποιήθηκαν δεδομένα κυτταρολογικής βιοψίας από 113 περιπτώσεις από δύο διαφορετικές δερματικές παθήσεις. Η πρώτη κατηγορία είναι η seboreicdermatitis με 61 περιπτώσεις και η δεύτερη κατηγορία είναι η chronicdermatitis με 52 περιπτώσεις. Για κάθε περίπτωση, είναι διαθέσιμα 33 χαρακτηριστικά (π.χ. erythema, scaling, itching, melanin incontinence, acanthosis).Πραγματοποιήθηκε επιλογή χαρακτηριστικών των προτύπων, με χρήση μεθόδων όπως εξαντλητική αναζήτηση και η διαχωριστική ικανότητα των χαρακτηριστικών δοκιμάστηκε με διάφορους ταξινομητές (π.χ. MDC, k-NN, SVM, PNN) και με μεθόδους αξιολόγησης όπως η Leave One Out (LOO). Τέλος, οι αναπτυχθείσες μεθοδολογίες συγκρίθηκαν μεταξύ τους ώστε να βρεθεί ο συνδυασμός μεθόδου επιλογής χαρακτηριστικών και μεθόδου κατηγοριοποίησης που παρέχει τα αποτελέσματα με τη μεγαλύτερη δυνατή ακρίβεια.Dermatoses are quite common, exhibit large degree of complexity and affect the patients` life. As quite often different skin diseases have similar symptoms, a reliable and valid diagnosis is difficult enough. Usually a biopsy is necessary for a diagnosis. This thesis focuses on the development and application of pattern recognition methods forskin diseases cytology biopsydata classification. Specifically, cytological biopsy of 113 cases of two different skin disease data was used. The first category is seboreic dermatitis in 61 cases, and the second group is chronic dermatitis with 52 cases. For each case, 33 features are available (e.g. erythema, scaling, itching, melanin incontinence, acanthosis). Standards featureselection was performed, using techniques such as exhaustive search and the resolution of the characteristics was tested by using different classifiers (e.g. MDC, k-NN, SVM, PNN) and assessment methods such as Leave One Out (LOO). Finally, the developed methods were compared, so that we find the combination of feature selection and classification methodsthat provide results with utmost accuracy

    Application of Hierarchical Temporal Memory to Anomaly Detection of Vital Signs for Ambient Assisted Living

    Get PDF
    This thesis presents the development of a framework for anomaly detection of vital signs for an Ambient Assisted Living (AAL) health monitoring scenario. It is driven by spatiotemporal reasoning of vital signs that Cortical Learning Algorithms (CLA) based on Hierarchal Temporal Memory (HTM) theory undertakes in an AAL health monitoring scenario to detect anomalous data points preceding cardiac arrest. This thesis begins with a literature review on the existing Ambient intelligence (AmI) paradigm, AAL technologies and anomaly detection algorithms used in a health monitoring scenario. The research revealed the significance of the temporal and spatial reasoning in the vital signs monitoring as the spatiotemporal patterns of vital signs provide a basis to detect irregularities in the health status of elderly people. The HTM theory is yet to be adequately deployed in an AAL health monitoring scenario. Hence HTM theory, network and core operations of the CLA are explored. Despite the fact that standard implementation of the HTM theory comprises of a single-level hierarchy, multiple vital signs, specifically the correlation between them is not sufficiently considered. This insufficiency is of particular significance considering that vital signs are correlated in time and space, which are used in the health monitoring applications for diagnosis and prognosis tasks. This research proposes a novel framework consisting of multi-level HTM networks. The lower level consists of four models allocated to the four vital signs, Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Heart Rate (HR) and peripheral capillary oxygen saturation (SpO2) in order to learn the spatiotemporal patterns of each vital sign. Additionally, a higher level is introduced to learn spatiotemporal patterns of the anomalous data point detected from the four vital signs. The proposed hierarchical organisation improves the model’s performance by using the semantically improved representation of the sensed data because patterns learned at each level of the hierarchy are reused when combined in novel ways at higher levels. To investigate and evaluate the performance of the proposed framework, several data selection techniques are studied, and accordingly, a total record of 247 elderly patients is extracted from the MIMIC-III clinical database. The performance of the proposed framework is evaluated and compared against several state-of-the-art anomaly detection algorithms using both online and traditional metrics. The proposed framework achieved 83% NAB score which outperforms the HTM and k-NN algorithms by 15%, the HBOS and INFLO SVD by 16% and the k-NN PCA by 21% while the SVM scored 34%. The results prove that multiple HTM networks can achieve better performance when dealing with multi-dimensional data, i.e. data collected from more than one source/sensor
    corecore