Search CORE

496 research outputs found

Algorithms Implemented for Cancer Gene Searching and Classifications

Author: Al-Rajab Murad
Lu Joan
Publication venue
Publication date
Field of study

Understanding the gene expression is an important factor to cancer diagnosis. One target of this understanding is implementing cancer gene search and classification methods. However, cancer gene search and classification is a challenge in that there is no an obvious exact algorithm that can be implemented individually for various cancer cells. In this paper a research is con-ducted through the most common top ranked algorithms implemented for cancer gene search and classification, and how they are implemented to reach a better performance. The paper will distinguish algorithms implemented for Bio image analysis for cancer cells and algorithms implemented based on DNA array data. The main purpose of this paper is to explore a road map towards presenting the most current algorithms implemented for cancer gene search and classification

University of Huddersfield Repository

Computational models and approaches for lung cancer diagnosis

Author: Azzawi Hasseeb
Publication venue: Deakin University, Faculty of Science, Engineering and Built Environment, School of Information Technology
Publication date: 01/10/2019
Field of study

The success of treatment of patients with cancer depends on establishing an accurate diagnosis. To this end, the aim of this study is to developed novel lung cancer diagnostic models. New algorithms are proposed to analyse the biological data and extract knowledge that assists in achieving accurate diagnosis results

Deakin Research Online

An Optimisation-Driven Prediction Method for Automated Diagnosis and Prognosis

Author: Caraffini Fabio
Milani Alfredo
Santucci Valentino
Publication venue: 'MDPI AG'
Publication date: 23/10/2019
Field of study

open access articleThis article presents a novel hybrid classification paradigm for medical diagnoses and prognoses prediction. The core mechanism of the proposed method relies on a centroid classification algorithm whose logic is exploited to formulate the classification task as a real-valued optimisation problem. A novel metaheuristic combining the algorithmic structure of Swarm Intelligence optimisers with the probabilistic search models of Estimation of Distribution Algorithms is designed to optimise such a problem, thus leading to high-accuracy predictions. This method is tested over 11 medical datasets and compared against 14 cherry-picked classification algorithms. Results show that the proposed approach is competitive and superior to the state-of-the-art on several occasions

Multidisciplinary Digital Publishing Institute

De Montfort University Open Research Archive

A Review of Missing Data Handling Techniques for Machine Learning

Author: Babu Sena Paul
Luke Oluwaseye Joel
Wesley Doorsamy
Publication venue: Talent under Liberty in Technology (TULTECH) Registrikood: 80569671
Publication date: 08/09/2022
Field of study

Real-world data are commonly known to contain missing values, and consequently affect the performance of most machine learning algorithms adversely when employed on such datasets. Precisely, missing values are among the various challenges occurring in real-world data. Since the accuracy and efficiency of machine learning models depend on the quality of the data used, there is a need for data analysts and researchers working with data, to seek out some relevant techniques that can be used to handle these inescapable missing values. This paper reviews some state-of-art practices obtained in the literature for handling missing data problems for machine learning. It lists some evaluation metrics used in measuring the performance of these techniques. This study tries to put these techniques and evaluation metrics in clear terms, followed by some mathematical equations. Furthermore, some recommendations to consider when dealing with missing data handling techniques were provided

International Journal of Innovative Technology and Interdisciplinary Sciences (IJITIS)

A particle swarm based hybrid system for imbalanced medical data sampling

Author: Xu Liang
Yang Pengyi
Zhang Zili
Zhou Bing B
Zomaya Albert Y
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

BackgroundMedical and biological data are commonly with small sample size, missing values, and most importantly, imbalanced class distribution. In this study we propose a particle swarm based hybrid system for remedying the class imbalance problem in medical and biological data mining. This hybrid system combines the particle swarm optimization (PSO) algorithm with multiple classifiers and evaluation metrics for evaluation fusion. Samples from the majority class are ranked using multiple objectives according to their merit in compensating the class imbalance, and then combined with the minority class to form a balanced dataset.ResultsOne important finding of this study is that different classifiers and metrics often provide different evaluation results. Nevertheless, the proposed hybrid system demonstrates consistent improvements over several alternative methods with three different metrics. The sampling results also demonstrate good generalization on different types of classification algorithms, indicating the advantage of information fusion applied in the hybrid system.ConclusionThe experimental results demonstrate that unlike many currently available methods which often perform unevenly with different datasets the proposed hybrid system has a better generalization property which alleviates the method-data dependency problem. From the biological perspective, the system provides indication for further investigation of the highly ranked samples, which may result in the discovery of new conditions or disease subtypes.<br /

Deakin Research Online

Crossref

Springer - Publisher Connector

PubMed Central

Systematic Review on Missing Data Imputation Techniques with Machine Learning Algorithms for Healthcare

Author: Abidin Nadzurah Zainal
Ismail Amelia Ritahani
Maen Mhd Khaled
Publication venue: 'Universitas Muhammadiyah Yogyakarta'
Publication date: 05/02/2022
Field of study

Missing data is one of the most common issues encountered in data cleaning process especially when dealing with medical dataset. A real collected dataset is prone to be incomplete, inconsistent, noisy and redundant due to potential reasons such as human errors, instrumental failures, and adverse death. Therefore, to accurately deal with incomplete data, a sophisticated algorithm is proposed to impute those missing values. Many machine learning algorithms have been applied to impute missing data with plausible values. However, among all machine learning imputation algorithms, KNN algorithm has been widely adopted as an imputation for missing data due to its robustness and simplicity and it is also a promising method to outperform other machine learning methods. This paper provides a comprehensive review of different imputation techniques used to replace the missing data. The goal of the review paper is to bring specific attention to potential improvements to existing methods and provide readers with a better grasps of imputation technique trends

The International Islamic University Malaysia Repository

Leading & Enlightening Journal UMY

A Review of Particle Swarm Optimization: Feature Selection, Classification and Hybridizations

Author: Madan Madhaw Shrivas, Amit Saxena, Leeladhar Kumar Gavel
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 30/04/2015
Field of study

Particle swarm optimization (PSO) is a recently grown, popular, evolutionary and conceptually simple but efficient algorithm which belongs to swarm intelligence category. This paper outlines basic concepts and reviews PSO based techniques with their applications to classification and feature selection along with some of the hybridized applications of PSO with similar other techniques. DOI: 10.17762/ijritcc2321-8169.16041

International Journal on Recent and Innovation Trends in Computing and Communication

Ensemble of heterogeneous flexible neural trees using multiobjective genetic programming

Author: Abdelwahab
Ajith Abraham
Alcala-Fdez
Alcalá
Ammar
Aouiti
Basheer
Bouaziz
Bouaziz
Bouaziz
Bouaziz
Burianek
Chen
Chen
Chen
Chen
Chen
Chen
Chen
Chen
Chen
Cho
Cordón
Das
Das
Deb
Deb
Dhahri
Dhahri
Eiben
Fahlman
Ferreira
Foresti
Gacto
Guo
Hastie
Haykin
Holm
Jang
Jia
Jin
Jin
Juang
Juang
Kar
Karaboga
Kasabov
Kasabov
Kennedy
Koppen
Kuncheva
Li
Lichman
L’Ecuyer
Maren
Matsumoto
Micheloni
Miranian
Musilek
Nadal
Novosad
Ojha
Ojha
Oltean
Pan
Peng
Polikar
Potter
Qu
Rajini
Rani
Riolo
Rustagi
Salustowicz
Sethi
Shan
Shou-ning
Stanley
Sánchez
Tkáč
Van den Bergh
Varun Kumar Ojha
Václav Snášel
Wang
Wang
Wang
Weiss
Wolpert
Wongseree
Wu
Yaghini
Yang
Yang
Yang
Yao
Yao
Yilmaz
Zhang
Zhang
Zhou
Zhou
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Machine learning algorithms are inherently multiobjective in nature, where approximation error minimization and model's complexity simplification are two conflicting objectives. We proposed a multiobjective genetic programming (MOGP) for creating a heterogeneous flexible neural tree (HFNT), tree-like flexible feedforward neural network model. The functional heterogeneity in neural tree nodes was introduced to capture a better insight of data during learning because each input in a dataset possess different features. MOGP guided an initial HFNT population towards Pareto-optimal solutions, where the final population was used for making an ensemble system. A diversity index measure along with approximation error and complexity was introduced to maintain diversity among the candidates in the population. Hence, the ensemble was created by using accurate, structurally simple, and diverse candidates from MOGP final population. Differential evolution algorithm was applied to fine-tune the underlying parameters of the selected candidates. A comprehensive test over classification, regression, and time-series datasets proved the efficiency of the proposed algorithm over other available prediction methods. Moreover, the heterogeneous creation of HFNT proved to be efficient in making ensemble system from the final population

arXiv.org e-Print Archive

Central Archive at the University of Reading

Crossref

DSpace at VSB Technical University of Ostrava

One-Class Classification: Taxonomy of Study and Review of Techniques

Author: Khan Shehroz S.
Madden Michael G.
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 29/11/2013
Field of study

One-class classification (OCC) algorithms aim to build classification models when the negative class is either absent, poorly sampled or not well defined. This unique situation constrains the learning of efficient classifiers by defining class boundary just with the knowledge of positive class. The OCC problem has been considered and applied under many research themes, such as outlier/novelty detection and concept learning. In this paper we present a unified view of the general problem of OCC by presenting a taxonomy of study for OCC problems, which is based on the availability of training data, algorithms used and the application domains applied. We further delve into each of the categories of the proposed taxonomy and present a comprehensive literature review of the OCC algorithms, techniques and methodologies with a focus on their significance, limitations and applications. We conclude our paper by discussing some open research problems in the field of OCC and present our vision for future research.Comment: 24 pages + 11 pages of references, 8 figure

arXiv.org e-Print Archive

Access to Research at National University of Ireland, Galway