2 research outputs found
Machine learning-based diagnosis of breast cancer utilizing feature optimization technique
Breast cancer disease is recognized as one of the leading causes of death in women worldwide after lung cancer. Breast cancer refers to a malignant neoplasm that develops from breast cells. Developed and less developed countries both are suffering from this extensive cancer. This cancer can be recuperated if it is detected at an early stage. Many researchers have proposed several machine learning techniques to predict breast cancer with the highest accuracy in the past years. In this research work, the Wisconsin Breast Cancer Dataset (WBCD) has been used as a training set from the UCI machine learning repository to compare the performance of the various machine learning techniques. Different kinds of machine learning classifiers such as support vector machine (SVM), Random Forest (RF), K-nearest neighbors(K-NN), Decision tree (DT), Naïve Bayes (NB), Logistic Regression (LR), AdaBoost (AB), Gradient Boosting (GB), Multi-layer perceptron (MLP), Nearest Cluster Classifier (NCC), and voting classifier (VC) have been used for comparing and analyzing breast cancer into benign and malignant tumors. Various matrices such as error rate, Accuracy, Precision, F1-score, and recall have been implemented to measure the model's performance. Each Algorithm's accuracy has been ascertained for finding the best suitable one. Based on the analysis, the result shows that the Voting classifier has the highest accuracy, which is 98.77%, with the lowest error rate. Finally, a web page is developed using a flask micro-framework integrating the best model using react
XML-GBM lung: An explainable machine learning-based application for the diagnosis of lung cancer
Lung cancer has been the leading cause of cancer-related deaths worldwide. Early detection and diagnosis of lung cancer can greatly improve the chances of survival for patients. Machine learning has been increasingly used in the medical sector for the detection of lung cancer, but the lack of interpretability of these models remains a significant challenge. Explainable machine learning (XML) is a new approach that aims to provide transparency and interpretability for machine learning models. The entire experiment has been performed in the lung cancer dataset obtained from Kaggle. The outcome of the predictive model with ROS (Random Oversampling) class balancing technique is used to comprehend the most relevant clinical features that contributed to the prediction of lung cancer using a machine learning explainable technique termed SHAP (SHapley Additive exPlanation). The results show the robustness of GBM's capacity to detect lung cancer, with 98.76% accuracy, 98.79% precision, 98.76% recall, 98.76% F-Measure, and 0.16% error rate, respectively. Finally, a mobile app is developed incorporating the best model to show the efficacy of our approach