22 research outputs found

    Meta-Analysis and Systematic Review of the Application of Machine Learning Classifiers in Biomedical Applications of Infrared Thermography

    Get PDF
    Atypical body temperature values can be an indication of abnormal physiological processes associated with several health conditions. Infrared thermal (IRT) imaging is an innocuous imaging modality capable of capturing the natural thermal radiation emitted by the skin surface, which is connected to physiology-related pathological states. The implementation of artificial intelligence (AI) methods for interpretation of thermal data can be an interesting solution to supply a second opinion to physicians in a diagnostic/therapeutic assessment scenario. The aim of this work was to perform a systematic review and meta-analysis concerning different biomedical thermal applications in conjunction with machine learning strategies. The bibliographic search yielded 68 records for a qualitative synthesis and 34 for quantitative analysis. The results show potential for the implementation of IRT imaging with AI, but more work is needed to retrieve significant features and improve classification metrics.info:eu-repo/semantics/publishedVersio

    Cost-sensitive decision tree ensembles for effective imbalanced classification

    Get PDF
    Real-life datasets are often imbalanced, that is, there are significantly more training samples available for some classes than for others, and consequently the conventional aim of reducing overall classification accuracy is not appropriate when dealing with such problems. Various approaches have been introduced in the literature to deal with imbalanced datasets, and are typically based on oversampling, undersampling or cost-sensitive classification. In this paper, we introduce an effective ensemble of cost-sensitive decision trees for imbalanced classification. Base classifiers are constructed according to a given cost matrix, but are trained on random feature subspaces to ensure sufficient diversity of the ensemble members. We employ an evolutionary algorithm for simultaneous classifier selection and assignment of committee member weights for the fusion process. Our proposed algorithm is evaluated on a variety of benchmark datasets, and is confirmed to lead to improved recognition of the minority class, to be capable of outperforming other state-of-the-art algorithms, and hence to represent a useful and effective approach for dealing with imbalanced datasets

    Detection of Breast Thermograms using Ensemble Classifiers

    Get PDF
    Mortality rate of breast cancer can be reduced by detecting breast cancer in its early stage. Breast thermography plays an important role in early detection of breast cancer, as it can detect tumors when the physiological changes start in the breast prior to structural changes. Computer Aided Detection (CAD) systems improve the diagnostic accuracy by providing a detailed analysis of images, which are not visible to the naked eye. The performance of CAD systems depends on many factors. One of the important factors is the classifier used for classification of breast thermograms. In this paper, we made a comparison of classifier performances using two ensemble classifiers namely Ensemble Bagged Trees and AdaBoost. Spatial and spectral features are used for classification. Ensemble Bagged Trees classifier performed better than AdaBoost in terms of accuracy of classification, but training time required is higher than AdaBoost classifier. An accuracy of 87%, sensitivity of 83% and specificity of 90.6% is obtained using Ensemble Bagged Trees classifier

    Medical infrared thermal image based fatty liver classification using machine and deep learning

    Get PDF
    Non-alcoholic fatty liver disease (NAFLD) causes accumulation of excess fat in the liver affecting people who drink little to no alcohol. Non-alcoholic steatohepatitis (NASH) is an aggressive form of fatty liver disease (inflammation in the liver), may progress to cirrhosis and liver failure. Liver function tests, ultrasound (US) and magnetic resonance imaging (MRI) are used to help diagnose and monitor liver disease or damage. In this study, the feasibility of medical infrared thermal imaging (MITI) in automatic detection of NAFLD was investigated, and 167 MITI images (44 positive) from 32 patients (7 positive) were evaluated using image processing and classification methods. Convolutional neural network (CNN) architectures and texture analysis methods were used in the feature selection phase. After feature selection and binary classification, the highest values from different setups for recall, f-score, specificity, accuracy, and area-under-curve (AUC) were 1.00, 1.00, 0.83, 1.0, 0.94, and 0.92, respectively. The highest values were achieved by CNN based methods on different datasets, however, texture analysis method performed lower. Here, it is shown that some of the CNN architectures have high potential on extracting features from thermal images. Finally, machine and deep learning approaches can be combined in detecting NAFLD using infrared thermal images

    Predicción del cáncer de mama utilizando algoritmos de aprendizaje automático en diferentes conjuntos de datos

    Get PDF
    Breast cancer is a disease that is becoming more and more common day by day, causing emotional and behavioral reactions and having fatal consequences if not detected early. At this point, traditional methods are insufficient, especially in early diagnosis. In this context, this study aimed to predict breast cancer by using machine learning (ML) algorithms on different datasets and to demonstrate the applicability of these algorithms. Algorithm performances were compared on balanced and unbalanced datasets, taking into account the performance metrics obtained in applications on different datasets. In addition, a model based on the Borda Voting method was developed by including the results obtained from four different algorithms (NB, KNN, DT, and RF) in the process. The prediction values obtained from each algorithm were written in different columns on the same excel file and the most repetitive value was accepted as the final result value. The developed model was tested on real data consisting of 60 records and the results were analyzed. When the results were examined, it was seen that higher performance was obtained with the proposed RF model compared to similar studies in the literature. Finally, the prediction results obtained with the developed model revealed the applicability of ML algorithms in the diagnosis of breast cancer.Introducción: El trabajo de investigación “Predicción del cáncer de mama utilizando algoritmos de aprendizaje automático en diferentes conjuntos de datos”, se desarrolló en la Universidad Técnica de Karadeniz en el año 2022. Problema: El cáncer de mama es una enfermedad cada vez más común, día a día, provocando reacciones emocionales y conductuales y con consecuencias fatales si no se detecta a tiempo. En este punto, los métodos tradicionales son insuficientes, sobre todo en el diagnóstico precoz. Este estudio tiene como objetivo predecir el cáncer de mama mediante el uso de algoritmos de aprendizaje automático (ML) en diferentes conjuntos de datos y demuestra la aplicabilidad de estos algoritmos. Metodología: se compararon los rendimientos de los algoritmos en conjuntos de datos equilibrados y no equilibrados, teniendo en cuenta las métricas de rendimiento obtenidas en aplicaciones en diferentes conjuntos de datos. Además, se desarrolló un modelo basado en el método Borda Voting al incluir en el proceso los resultados obtenidos de cuatro algoritmos diferentes (NB, KNN, DT y RF). Originalidad y Limitaciones de la Investigación: En el modelo desarrollado en el marco del estudio se combinaron los valores de los resultados obtenidos de diferentes algoritmos como NB, KNN, DT y RF; el objetivo es aumentar el rendimiento del modelo con este proceso, que se basa en el método Borda Voting. Resultados: Los valores de predicción obtenidos de cada algoritmo se escribieron en diferentes columnas en la misma hoja de cálculo y se aceptó el valor más repetitivo como valor final del resultado. El modelo desarrollado se probó en datos reales que constaban de 60 registros y se analizaron los resultados. Conclusión: Cuando se examinaron los resultados, se observó que se obtuvo un mayor rendimiento con el modelo de RF propuesto en comparación con estudios similares en la literatura.

    Learning from class-imbalanced data: overlap-driven resampling for imbalanced data classification.

    Get PDF
    Classification of imbalanced datasets has attracted substantial research interest over the past years. This is because imbalanced datasets are common in several domains such as health, finance and security, but learning algorithms are generally not designed to handle them. Many existing solutions focus mainly on the class distribution problem. However, a number of reports showed that class overlap had a higher negative impact on the learning process than class imbalance. This thesis thoroughly explores the impact of class overlap on the learning algorithm and demonstrates how elimination of class overlap can effectively improve the classification of imbalanced datasets. Novel undersampling approaches were developed with the main objective of enhancing the presence of minority class instances in the overlapping region. This is achieved by identifying and removing majority class instances potentially residing in such a region. Seven methods under the two different approaches were designed for the task. Extensive experiments were carried out to evaluate the methods on simulated and well-known real-world datasets. Results showed that substantial improvement in the classification accuracy of the minority class was obtained with favourable trade-offs with the majority class accuracy. Moreover, successful application of the methods in predictive diagnostics of diseases with imbalanced records is presented. These novel overlap-based approaches have several advantages over other common resampling methods. First, the undersampling amount is independent of class imbalance and proportional to the degree of overlap. This could effectively address the problem of class overlap while reducing the effect of class imbalance. Second, information loss is minimised as instance elimination is contained within the problematic region. Third, adaptive parameters enable the methods to be generalised across different problems. It is also worth pointing out that these methods provide different trade-offs, which offer more alternatives to real-world users in selecting the best fit solution to the problem

    A model for the detection of breast cancer using machine learning and thermal images in a mobile environment

    Get PDF
    Breast cancer is the most common cancer amongst women and one of the deadliest. Various modalities exist which image the breasts, all with a focus on early detection; thermography is one such method. It is a non-invasive test, which is safe and can be used for a wide variety of breast densities. It functions by analysing thermal patterns captured via an infrared camera of the surface of the breast. Advances in infrared and mobile technology enable this modality to be mobile based; allowing a high degree of portability at a lower cost. Furthermore, as technology has improved, machine learning has played a larger role in medical practices by offering unbiased, consistent, and timely second opinions. Machine learning algorithms are able to classify medical images automatically if offered in the correct format. This study aims to provide a model, which integrates breast cancer detection, thermal imaging, machine learning, and mobile technology. The conceptual model is theorised from three literature studies regarding: identifiable aspects of breast cancer through thermal imaging, the mobile ecosystem, and classification using machine learning algorithms. The model is implemented and evaluated using an experiment designed to classify automatically thermal breast images of the same quality that mobile attachable thermal cameras are able to capture. The experiment contrasts various combinations of segmentation methods, extracted features, and classification algorithms. Promising results were shown in the experiment with a high degree of accuracy obtained. The successful results obtained from the experimentation process validates the feasibility of the model

    A model for the detection of breast cancer using machine learning and thermal images in a mobile environment

    Get PDF
    Breast cancer is the most common cancer amongst women and one of the deadliest. Various modalities exist which image the breasts, all with a focus on early detection; thermography is one such method. It is a non-invasive test, which is safe and can be used for a wide variety of breast densities. It functions by analysing thermal patterns captured via an infrared camera of the surface of the breast. Advances in infrared and mobile technology enable this modality to be mobile based; allowing a high degree of portability at a lower cost. Furthermore, as technology has improved, machine learning has played a larger role in medical practices by offering unbiased, consistent, and timely second opinions. Machine learning algorithms are able to classify medical images automatically if offered in the correct format. This study aims to provide a model, which integrates breast cancer detection, thermal imaging, machine learning, and mobile technology. The conceptual model is theorised from three literature studies regarding: identifiable aspects of breast cancer through thermal imaging, the mobile ecosystem, and classification using machine learning algorithms. The model is implemented and evaluated using an experiment designed to classify automatically thermal breast images of the same quality that mobile attachable thermal cameras are able to capture. The experiment contrasts various combinations of segmentation methods, extracted features, and classification algorithms. Promising results were shown in the experiment with a high degree of accuracy obtained. The successful results obtained from the experimentation process validates the feasibility of the model
    corecore