Enhancing Classification Performance through FeatureBoostThyro: A Comparative Study of Machine Learning Algorithms and Feature Selection

Abstract

Early-stage prediction of a disease is an important and challenging task. The application of machine learning techniques is playing an important role in this era. Thyroid is one of the chronic endocrine diseases, and approximately 42 million people in India are affected by this disease. This paper presents a comprehensive investigation into the enhancement of classification performance through the novel ‘FeatureBoostThyro’ (FBT) model. The study evaluates various machine learning algorithms, including stochastic gradient descent (SGD), K nearest neighbor (KNN), logistic regression (LR), naive bayes (NB), and support vector machine (SVM), in conjunction with diverse feature selection methods. The research systematically explores the impact of feature selection techniques such as information gain, relief F, chi-square, gini index, forward selection, backward selection, recursive feature elimination, and LASSO on model performance across the chosen algorithms. The analysis reveals notable variations in performance metrics, including accuracy, precision, recall, and F1-score, providing valuable insights into the interplay between algorithm and feature selection. One main contribution of this research is the introduction of the FBT model, which consistently outperforms other models across various feature selection methods, making it a promising tool for addressing complex classification tasks. The findings contribute to a broader understanding of model selection and optimization in machine learning applications. The proposed model undergoes evaluation using two distinct datasets: the primary dataset acquired from Lata Mangeshkar Hospital in Nagpur and the secondary dataset obtained from the UCI dataset

    Similar works