Search CORE

24 research outputs found

Two-stage hybrid feature selection algorithms for diagnosing erythemato-squamous diseases

Author: Lei J
Liu X
Shi Y
Xie J
Xie W
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/05/2013
Field of study

This paper proposes two-stage hybrid feature selection algorithms to build the stable and efficient diagnostic models where a new accuracy measure is introduced to assess the models. The two-stage hybrid algorithms adopt Support Vector Machines (SVM) as a classification tool, and the extended Sequential Forward Search (SFS), Sequential Forward Floating Search (SFFS), and Sequential Backward Floating Search (SBFS), respectively, as search strategies, and the generalized F-score (GF) to evaluate the importance of each feature. The new accuracy measure is used as the criterion to evaluated the performance of a temporary SVM to direct the feature selection algorithms. These hybrid methods combine the advantages of filters and wrappers to select the optimal feature subset from the original feature set to build the stable and efficient classifiers. To get the stable, statistical and optimal classifiers, we conduct 10-fold cross validation experiments in the first stage; then we merge the 10 selected feature subsets of the 10-cross validation experiments, respectively, as the new full feature set to do feature selection in the second stage for each algorithm. We repeat the each hybrid feature selection algorithm in the second stage on the one fold that has got the best result in the first stage. Experimental results show that our proposed two-stage hybrid feature selection algorithms can construct efficient diagnostic models which have got better accuracy than that built by the corresponding hybrid feature selection algorithms without the second stage feature selection procedures. Furthermore our methods have got better classification accuracy when compared with the available algorithms for diagnosing erythemato-squamous diseases

Springer - Publisher Connector

PubMed Central

Brunel University Research Archive

Differential diagnosis of Erythmato-Squamous Diseases using classification and regression tree

Author: Bouraghi H.
Habibikoolaee M.
Jebraeily M.
Langarizadeh M.
Maghooli K.
Shahmoradi L.
Publication venue
Publication date: 01/01/2016
Field of study

Introduction: Differential diagnosis of Erythmato-Squamous Diseases (ESD) is a major challenge in the field of dermatology. The ESD diseases are placed into six different classes. Data mining is the process for detection of hidden patterns. In the case of ESD, data mining help us to predict the diseases. Different algorithms were developed for this purpose. Objective: we aimed to use the Classification and Regression Tree (CART) to predict differential diagnosis of ESD. Methods: we used the Cross Industry Standard Process for Data Mining (CRISP-DM) methodology. For this purpose, the dermatology data set from machine learning repository, UCI was obtained. The Clementine 12.0 software from IBM Company was used for modelling. In order to evaluation of the model we calculate the accuracy, sensitivity and specificity of the model. Results: The proposed model had an accuracy of 94.84 (Standard Deviation: 24.42) in order to correct prediction of the ESD disease. Conclusions: Results indicated that using of this classifier could be useful. But, it would be strongly recommended that the combination of machine learning methods could be more useful in terms of prediction of ESD. Â© 2016 Keivan Maghooli, Mostafa Langarizadeh, Leila Shahmoradi, Mahdi Habibi-koolaee, Mohamad Jebraeily, and Hamid Bouraghi

eprints Iran University of Medical Sciences

PubMed Central

Golestan University of Medical Sciences Repository

A Novel Clinical Decision Support System Using Improved Adaptive Genetic Algorithm for the Assessment of Fetal Well-Being

Author: Asral Bahari Jambek
Hariharan Muthusamy
Siew-Chin Neoh
Sindhu Ravindran
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2015
Field of study

A novel clinical decision support system is proposed in this paper for evaluating the fetal well-being from the cardiotocogram (CTG) dataset through an Improved Adaptive Genetic Algorithm (IAGA) and Extreme Learning Machine (ELM). IAGA employs a new scaling technique (called sigma scaling) to avoid premature convergence and applies adaptive crossover and mutation techniques with masking concepts to enhance population diversity. Also, this search algorithm utilizes three different fitness functions (two single objective fitness functions and multi-objective fitness function) to assess its performance. The classification results unfold that promising classification accuracy of 94% is obtained with an optimal feature subset using IAGA. Also, the classification results are compared with those of other Feature Reduction techniques to substantiate its exhaustive search towards the global optimum. Besides, five other benchmark datasets are used to gauge the strength of the proposed IAGA algorithm

Crossref

Directory of Open Access Journals

PubMed Central

Analysis and Design of Detection for Liver Cancer using Particle Swarm Optimization and Decision Tree

Author: Kaloria Seema
Saini S.S.
Publication venue: Auricle Global Society of Education and Research
Publication date: 01/03/2023
Field of study

Liver cancer is taken as a major cause of death all over the world. According to WHO (World Health Organization) every year 9.6 million peoples are died due to cancer worldwide. It is one of the eighth most leading causes of death in women and fifth in men as reported by the American Cancer Society. The number of death rate due to cancer is projected to increase by45 percent in between 2008 to 2030. The most common cancers are lung, breast, and liver, colorectal. Approximately 7, 82,000 peoples are died due to liver cancer each year. The most efficient way to decrease the death rate cause of liver cancer is to treat the diseases in the initial stage. Early treatment depends upon the early diagnosis, which depends on reliable diagnosis methods. CT imaging is one of the most common and important technique and it acts as an imaging tool for evaluating the patients with intuition of liver cancer. The diagnosis of liver cancer has historically been made manually by a skilled radiologist, who relied on their expertise and personal judgement to reach a conclusion. The main objective of this paper is to develop the automatic methods based on machine learning approach for accurate detection of liver cancer in order to help radiologists in the clinical practice. The paper primary contribution to the process of liver cancer lesion classification and automatic detection for clinical diagnosis. For the purpose of detecting liver cancer lesions, the best approaches based on PSO and DPSO have been given. With the help of the C4.5 decision tree classifier, wavelet-based statistical and morphological features were retrieved and categorised

International Journal on Future Revolution in Computer Science & Communication Engineering

Comparative study on the performance of Au/F-TiO2 photocatalyst synthesized from Zamzam water and distilled water under blue light irradiation

Author: Abu Bakar F.
Zayadi R.A.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Recurring problems of titanium dioxide (TiO2) for needing UV light to be activated and high electron-hole recombination rate limit the application of TiO2 as a proliﬁc photocatalyst. By modifying the morphology and introducing electron trapping species into TiO2, the photocatalytic activity of TiO2 could be improved. Solvents of two different kinds; distilled water and Zamzam water were used in peroxotitanic acid synthesis of TiO2 and the photocatalyst was utilized to degrade Reactive Blue 19 (RB19) dye under blue light irradiation (475 nm) to assess the visible light activity of synthesized TiO2. Fluorine was incorporated to control the morphology while gold nanoparticles (GNP) stabilized by arabic gum were deposited to trap electrons. The morphology of F-TiO2 which appeared to be in ovoid shape was conﬁrmed by Field Emission-Scanning Electron Microscope (FE-SEM) and Transmission Electron Microscope (TEM). Brunauer-Emmett-Teller (BET) surface area and crystallite size estimated from X-ray Diffraction (XRD) data revealed that F-TiO2 modiﬁed using HF was smaller in size and exhibited single anatase phase. The band gap of Au-TiO2 synthesized by distilled and Zamzam water was 2.78 eV and 2.89 eV respectively; shifted from 3.08 eV in blank TiO2. Peroxo Au/F-TiO2 synthesized with the incorporation of arabic gum as GNP stabilizer and HF as ﬂuorine modiﬁer degraded up to 49.23% of RB19 within two hours of reaction. The addition of ﬂuorine and gold demonstrated high ability to enhance visible light activity of TiO2 with distilled water used as solvent displayed higher photocatalytic performance compared to Zamzam water

UTHM Institutional Repository

Future Applications of Electronic-Nose Technologies in Healthcare and Biomedicine

Author: Alphus Dan Wilson
Publication venue: 'IntechOpen'
Publication date: 09/11/2011
Field of study

IntechOpen

Implementing decision tree-based algorithms in medical diagnostic decision support systems

Author: Ghiasi Mohammad M.
Publication venue: Memorial University of Newfoundland
Publication date: 01/05/2020
Field of study

As a branch of healthcare, medical diagnosis can be defined as finding the disease based on the signs and symptoms of the patient. To this end, the required information is gathered from different sources like physical examination, medical history and general information of the patient. Development of smart classification models for medical diagnosis is of great interest amongst the researchers. This is mainly owing to the fact that the machine learning and data mining algorithms are capable of detecting the hidden trends between features of a database. Hence, classifying the medical datasets using smart techniques paves the way to design more efficient medical diagnostic decision support systems. Several databases have been provided in the literature to investigate different aspects of diseases. As an alternative to the available diagnosis tools/methods, this research involves machine learning algorithms called Classification and Regression Tree (CART), Random Forest (RF) and Extremely Randomized Trees or Extra Trees (ET) for the development of classification models that can be implemented in computer-aided diagnosis systems. As a decision tree (DT), CART is fast to create, and it applies to both the quantitative and qualitative data. For classification problems, RF and ET employ a number of weak learners like CART to develop models for classification tasks. We employed Wisconsin Breast Cancer Database (WBCD), Z-Alizadeh Sani dataset for coronary artery disease (CAD) and the databanks gathered in Ghaem Hospital’s dermatology clinic for the response of patients having common and/or plantar warts to the cryotherapy and/or immunotherapy methods. To classify the breast cancer type based on the WBCD, the RF and ET methods were employed. It was found that the developed RF and ET models forecast the WBCD type with 100% accuracy in all cases. To choose the proper treatment approach for warts as well as the CAD diagnosis, the CART methodology was employed. The findings of the error analysis revealed that the proposed CART models for the applications of interest attain the highest precision and no literature model can rival it. The outcome of this study supports the idea that methods like CART, RF and ET not only improve the diagnosis precision, but also reduce the time and expense needed to reach a diagnosis. However, since these strategies are highly sensitive to the quality and quantity of the introduced data, more extensive databases with a greater number of independent parameters might be required for further practical implications of the developed models

Memorial University Research Repository

Κατηγοριοποίηση Δεδομένων Κυτταρολογικής Βιοψίας Δερματικών Παθήσεων

Author: Γιάννου Μελετία
Publication venue
Publication date: 01/01/2014
Field of study

Οι δερματοπάθειες είναι αρκετά συνηθισμένες, εμφανίζουν μεγάλο βαθμό περιπλοκότητας και επηρεάζουν σημαντικά την ποιότητα ζωής των ασθενών. Καθώς αρκετά συχνά διαφορετικές δερματικές ασθένειες- παθήσεις παρουσιάζουν παρόμοια συμπτώματα, είναι αρκετά δύσκολη η αξιόπιστη και έγκυρη διάγνωσή τους. Συνήθως μια βιοψία είναι απαραίτητη για τη διάγνωση. Η συγκεκριμένη διπλωματική εργασία εστιάζει στην ανάπτυξη και στην εφαρμογή μεθόδων αναγνώρισης προτύπων για την κατηγοριοποίηση δεδομένων κυτταρολογικής βιοψίας δερματικών ασθενειών. Συγκεκριμένα, χρησιμοποιήθηκαν δεδομένα κυτταρολογικής βιοψίας από 113 περιπτώσεις από δύο διαφορετικές δερματικές παθήσεις. Η πρώτη κατηγορία είναι η seboreicdermatitis με 61 περιπτώσεις και η δεύτερη κατηγορία είναι η chronicdermatitis με 52 περιπτώσεις. Για κάθε περίπτωση, είναι διαθέσιμα 33 χαρακτηριστικά (π.χ. erythema, scaling, itching, melanin incontinence, acanthosis).Πραγματοποιήθηκε επιλογή χαρακτηριστικών των προτύπων, με χρήση μεθόδων όπως εξαντλητική αναζήτηση και η διαχωριστική ικανότητα των χαρακτηριστικών δοκιμάστηκε με διάφορους ταξινομητές (π.χ. MDC, k-NN, SVM, PNN) και με μεθόδους αξιολόγησης όπως η Leave One Out (LOO). Τέλος, οι αναπτυχθείσες μεθοδολογίες συγκρίθηκαν μεταξύ τους ώστε να βρεθεί ο συνδυασμός μεθόδου επιλογής χαρακτηριστικών και μεθόδου κατηγοριοποίησης που παρέχει τα αποτελέσματα με τη μεγαλύτερη δυνατή ακρίβεια.Dermatoses are quite common, exhibit large degree of complexity and affect the patients` life. As quite often different skin diseases have similar symptoms, a reliable and valid diagnosis is difficult enough. Usually a biopsy is necessary for a diagnosis. This thesis focuses on the development and application of pattern recognition methods forskin diseases cytology biopsydata classification. Specifically, cytological biopsy of 113 cases of two different skin disease data was used. The first category is seboreic dermatitis in 61 cases, and the second group is chronic dermatitis with 52 cases. For each case, 33 features are available (e.g. erythema, scaling, itching, melanin incontinence, acanthosis). Standards featureselection was performed, using techniques such as exhaustive search and the resolution of the characteristics was tested by using different classifiers (e.g. MDC, k-NN, SVM, PNN) and assessment methods such as Leave One Out (LOO). Finally, the developed methods were compared, so that we find the combination of feature selection and classification methodsthat provide results with utmost accuracy

Pergamos : Unified Institutional Repository / Digital Library Platform of the National and Kapodistrian University of Athens

Application of Hierarchical Temporal Memory to Anomaly Detection of Vital Signs for Ambient Assisted Living

Author: Bastaki Benhur Bakhtiari
Publication venue
Publication date
Field of study

This thesis presents the development of a framework for anomaly detection of vital signs for an Ambient Assisted Living (AAL) health monitoring scenario. It is driven by spatiotemporal reasoning of vital signs that Cortical Learning Algorithms (CLA) based on Hierarchal Temporal Memory (HTM) theory undertakes in an AAL health monitoring scenario to detect anomalous data points preceding cardiac arrest. This thesis begins with a literature review on the existing Ambient intelligence (AmI) paradigm, AAL technologies and anomaly detection algorithms used in a health monitoring scenario. The research revealed the significance of the temporal and spatial reasoning in the vital signs monitoring as the spatiotemporal patterns of vital signs provide a basis to detect irregularities in the health status of elderly people. The HTM theory is yet to be adequately deployed in an AAL health monitoring scenario. Hence HTM theory, network and core operations of the CLA are explored. Despite the fact that standard implementation of the HTM theory comprises of a single-level hierarchy, multiple vital signs, specifically the correlation between them is not sufficiently considered. This insufficiency is of particular significance considering that vital signs are correlated in time and space, which are used in the health monitoring applications for diagnosis and prognosis tasks. This research proposes a novel framework consisting of multi-level HTM networks. The lower level consists of four models allocated to the four vital signs, Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), Heart Rate (HR) and peripheral capillary oxygen saturation (SpO2) in order to learn the spatiotemporal patterns of each vital sign. Additionally, a higher level is introduced to learn spatiotemporal patterns of the anomalous data point detected from the four vital signs. The proposed hierarchical organisation improves the model’s performance by using the semantically improved representation of the sensed data because patterns learned at each level of the hierarchy are reused when combined in novel ways at higher levels. To investigate and evaluate the performance of the proposed framework, several data selection techniques are studied, and accordingly, a total record of 247 elderly patients is extracted from the MIMIC-III clinical database. The performance of the proposed framework is evaluated and compared against several state-of-the-art anomaly detection algorithms using both online and traditional metrics. The proposed framework achieved 83% NAB score which outperforms the HTM and k-NN algorithms by 15%, the HBOS and INFLO SVD by 16% and the k-NN PCA by 21% while the SVM scored 34%. The results prove that multiple HTM networks can achieve better performance when dealing with multi-dimensional data, i.e. data collected from more than one source/sensor

STORE - Staffordshire Online Repository

Recommended from our members

The role of classifiers in feature selection: Number vs nature

Author: Chrysostomou Kyriacos Andrews
Publication venue: Brunel University, School of Information Systems, Computing and Mathematics
Publication date: 01/01/2008
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.Wrapper feature selection approaches are widely used to select a small subset of relevant features from a dataset. However, Wrappers suffer from the fact that they only use a single classifier when selecting the features. The problem of using a single classifier is that each classifier is of a different nature and will have its own biases. This means that each classifier will select different feature subsets. To address this problem, this thesis aims to investigate the effects of using different classifiers for Wrapper feature selection. More specifically, it aims to investigate the effects of using different number of classifiers and classifiers of different nature. This aim is achieved by proposing a new data mining method called Wrapper-based Decision Trees (WDT). The WDT method has the ability to combine multiple classifiers from four different families, including Bayesian Network, Decision Tree, Nearest Neighbour and Support Vector Machine, to select relevant features and visualise the relationships among the selected features using decision trees. Specifically, the WDT method is applied to investigate three research questions of this thesis: (1) the effects of number of classifiers on feature selection results; (2) the effects of nature of classifiers on feature selection results; and (3) which of the two (i.e., number or nature of classifiers) has more of an effect on feature selection results. Two types of user preference datasets derived from Human-Computer Interaction (HCI) are used with WDT to assist in answering these three research questions. The results from the investigation revealed that the number of classifiers and nature of classifiers greatly affect feature selection results. In terms of number of classifiers, the results showed that few classifiers selected many relevant features whereas many classifiers selected few relevant features. In addition, it was found that using three classifiers resulted in highly accurate feature subsets. In terms of nature of classifiers, it was showed that Decision Tree, Bayesian Network and Nearest Neighbour classifiers caused signficant differences in both the number of features selected and the accuracy levels of the features. A comparison of results regarding number of classifiers and nature of classifiers revealed that the former has more of an effect on feature selection than the latter. The thesis makes contributions to three communities: data mining, feature selection, and HCI. For the data mining community, this thesis proposes a new method called WDT which integrates the use of multiple classifiers for feature selection and decision trees to effectively select and visualise the most relevant features within a dataset. For the feature selection community, the results of this thesis have showed that the number of classifiers and nature of classifiers can truly affect the feature selection process. The results and suggestions based on the results can provide useful insight about classifiers when performing feature selection. For the HCI community, this thesis has showed the usefulness of feature selection for identifying a small number of highly relevant features for determining the preferences of different users

Brunel University Research Archive