898 research outputs found
Efficient Feature Selection and ML Algorithm for Accurate Diagnostics
Machine learning algorithms have been deployed in numerous optimization, prediction and classification problems. This has endeared them for application in fields such as computer networks and medical diagnosis. Although these machine learning algorithms achieve convincing results in these fields, they face numerous challenges when deployed on imbalanced dataset. Consequently, these algorithms are often biased towards majority class, hence unable to generalize the learning process. In addition, they are unable to effectively deal with high-dimensional datasets. Moreover, the utilization of conventional feature selection techniques from a dataset based on attribute significance render them ineffective for majority of the diagnosis applications. In this paper, feature selection is executed using the more effective Neighbour Components Analysis (NCA). During the classification process, an ensemble classifier comprising of K-Nearest Neighbours (KNN), Naive Bayes (NB), Decision Tree (DT) and Support Vector Machine (SVM) is built, trained and tested. Finally, cross validation is carried out to evaluate the developed ensemble model. The results shows that the proposed classifier has the best performance in terms of precision, recall, F-measure and classification accuracy
A survey on computational intelligence approaches for predictive modeling in prostate cancer
Predictive modeling in medicine involves the development of computational models which are capable of analysing large amounts of data in order to predict healthcare outcomes for individual patients. Computational intelligence approaches are suitable when the data to be modelled are too complex forconventional statistical techniques to process quickly and eciently. These advanced approaches are based on mathematical models that have been especially developed for dealing with the uncertainty and imprecision which is typically found in clinical and biological datasets. This paper provides a survey of recent work on computational intelligence approaches that have been applied to prostate cancer predictive modeling, and considers the challenges which need to be addressed. In particular, the paper considers a broad definition of computational intelligence which includes evolutionary algorithms (also known asmetaheuristic optimisation, nature inspired optimisation algorithms), Artificial Neural Networks, Deep Learning, Fuzzy based approaches, and hybrids of these,as well as Bayesian based approaches, and Markov models. Metaheuristic optimisation approaches, such as the Ant Colony Optimisation, Particle Swarm Optimisation, and Artificial Immune Network have been utilised for optimising the performance of prostate cancer predictive models, and the suitability of these approaches are discussed
CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison
Large, labeled datasets have driven deep learning methods to achieve
expert-level performance on a variety of medical imaging tasks. We present
CheXpert, a large dataset that contains 224,316 chest radiographs of 65,240
patients. We design a labeler to automatically detect the presence of 14
observations in radiology reports, capturing uncertainties inherent in
radiograph interpretation. We investigate different approaches to using the
uncertainty labels for training convolutional neural networks that output the
probability of these observations given the available frontal and lateral
radiographs. On a validation set of 200 chest radiographic studies which were
manually annotated by 3 board-certified radiologists, we find that different
uncertainty approaches are useful for different pathologies. We then evaluate
our best model on a test set composed of 500 chest radiographic studies
annotated by a consensus of 5 board-certified radiologists, and compare the
performance of our model to that of 3 additional radiologists in the detection
of 5 selected pathologies. On Cardiomegaly, Edema, and Pleural Effusion, the
model ROC and PR curves lie above all 3 radiologist operating points. We
release the dataset to the public as a standard benchmark to evaluate
performance of chest radiograph interpretation models.
The dataset is freely available at
https://stanfordmlgroup.github.io/competitions/chexpert .Comment: Published in AAAI 201
Lung Disease Classification using Dense Alex Net Framework with Contrast Normalisation and Five-Fold Geometric Transformation
lung disease is one of the leading causes of death worldwide. Most cases of lung diseases are found when the disease is in an advanced stage. Therefore, the development of systems and methods that begin to diagnose quickly and prematurely plays a vital role in today's world. Currently, in detecting differences in lung cancer, an accurate diagnosis of cancer types is needed. However, improving the accuracy and reducing training time of the diagnosis remains a challenge. In this study, we have developed an automated classification scheme for lung cancer presented in histopathological images using a dense Alex Net framework. The proposed methodology carries out several phases includes pre-processing, contrast normalization, data augmentation and classification. Initially, the pre-processing step is accompanied to diminish the noisy contents present in the image. Contrast normalization has been explored to maintain the same illumination factor among histopathological lung images next to pre-processing. Afterwards, data augmentation phase has been carried out to enhance the dataset further to avoid over-fitting problems. Finally, the Dense Alex Net is utilized for classification that comprises five convolutional layers, one multi-scale convolution layer, and three fully connected layers. In evaluation experiments, the proposed approach was trained using our original database to provide rich and meaningful features. The accuracy attained by the proposed methodology is93%, which is maximum compared with the existing algorithm
Identifying Lung Cancer Using CT Scan Images Based On Artificial Intelligence
Lung cancer appears to be the common reason behind the death of human beings at some stage on the planet. Early detection of lung cancers can growth the possibility of survival amongst human beings. The preferred 5-years survival rate for lung most cancers sufferers will increase from 16% to 50% if the disease is detected in time. Although computerized tomography (CT) is frequently more efficient than X-ray. However, the problem regarded to merge way to time constraints in detecting this lung cancer concerning the numerous diagnosing strategies used. Hence, a lung cancer detection system that usage of image processing is hired to categorize lung cancer in CT images. In image processing procedures, procedures like image pre-processing, segmentation, and have extraction are mentioned intimately. This paper is pointing to set off the extra precise comes approximately through making use of distinctive improve and department procedures. In this proposal paper, the proposed method is built in some filter and segmentation that pre-process the data and classify the trained data. After the classification and trained WONN-MLB method is used to reduce the time complexity of finding result. Therefore, our research goal is to get the maximum result of lung cancer detection
Malaria Parasitic Detection using a New Deep Boosted and Ensemble Learning Framework
Malaria is a potentially fatal plasmodium parasite injected by female
anopheles mosquitoes that infect red blood cells and millions worldwide yearly.
However, specialists' manual screening in clinical practice is laborious and
prone to error. Therefore, a novel Deep Boosted and Ensemble Learning (DBEL)
framework, comprising the stacking of new Boosted-BR-STM convolutional neural
networks (CNN) and the ensemble ML classifiers, is developed to screen malaria
parasite images. The proposed Boosted-BR-STM is based on a new
dilated-convolutional block-based split transform merge (STM) and feature-map
Squeezing-Boosting (SB) ideas. Moreover, the new STM block uses regional and
boundary operations to learn the malaria parasite's homogeneity, heterogeneity,
and boundary with patterns. Furthermore, the diverse boosted channels are
attained by employing Transfer Learning-based new feature-map SB in STM blocks
at the abstract, medium, and conclusion levels to learn minute intensity and
texture variation of the parasitic pattern. The proposed DBEL framework
implicates the stacking of prominent and diverse boosted channels and provides
the generated discriminative features of the developed Boosted-BR-STM to the
ensemble of ML classifiers. The proposed framework improves the discrimination
ability and generalization of ensemble learning. Moreover, the deep feature
spaces of the developed Boosted-BR-STM and customized CNNs are fed into ML
classifiers for comparative analysis. The proposed DBEL framework outperforms
the existing techniques on the NIH malaria dataset that are enhanced using
discrete wavelet transform to enrich feature space. The proposed DBEL framework
achieved Accuracy (98.50%), Sensitivity (0.9920), F-score (0.9850), and AUC
(0.997), which suggest it to be utilized for malaria parasite screening.Comment: 26 pages, 10 figures, 9 Table
Comparison of supervised machine learning classification techniques in prediction of locoregional recurrences in early oral tongue cancer
Background: The proper estimate of the risk of recurrences in early-stage oral tongue squamous cell carcinoma (OTSCC) is mandatory for individual treatment-decision making. However, this remains a challenge even for experienced multidisciplinary centers. Objectives: We compared the performance of four machine learning (ML) algorithms for predicting the risk of locoregional recurrences in patients with OTSCC. These algorithms were Support Vector Machine (SVM), Naive Bayes (NB), Boosted Decision Tree (BDT), and Decision Forest (DF). Materials and methods: The study cohort comprised 311 cases from the five University Hospitals in Finland and A.C. Camargo Cancer Center, Sao Paulo, Brazil. For comparison of the algorithms, we used the harmonic mean of precision and recall called F1 score, specificity, and accuracy values. These algorithms and their corresponding permutation feature importance (PFI) with the input parameters were externally tested on 59 new cases. Furthermore, we compared the performance of the algorithm that showed the highest prediction accuracy with the prognostic significance of depth of invasion (DOI). Results: The results showed that the average specificity of all the algorithms was 71% The SVM showed an accuracy of 68% and F1 score of 0.63, NB an accuracy of 70% and F1 score of 0.64, BDT an accuracy of 81% and F1 score of 0.78, and DF an accuracy of 78% and F1 score of 0.70. Additionally, these algorithms outperformed the DOI-based approach, which gave an accuracy of 63%. With PFI-analysis, there was no significant difference in the overall accuracies of three of the algorithms; PFI-BDT accuracy increased to 83.1%, PFI-DF increased to 80%, PFI-SVM decreased to 64.4%, while PFI-NB accuracy increased significantly to 81.4%. Conclusions: Our findings show that the best classification accuracy was achieved with the boosted decision tree algorithm. Additionally, these algorithms outperformed the DOI-based approach. Furthermore, with few parameters identified in the PFI analysis, ML technique still showed the ability to predict locoregional recurrence. The application of boosted decision tree machine learning algorithm can stratify OTSCC patients and thus aid in their individual treatment planning.Peer reviewe
Self-paced Convolutional Neural Network for Computer Aided Detection in Medical Imaging Analysis
Tissue characterization has long been an important component of Computer
Aided Diagnosis (CAD) systems for automatic lesion detection and further
clinical planning. Motivated by the superior performance of deep learning
methods on various computer vision problems, there has been increasing work
applying deep learning to medical image analysis. However, the development of a
robust and reliable deep learning model for computer-aided diagnosis is still
highly challenging due to the combination of the high heterogeneity in the
medical images and the relative lack of training samples. Specifically,
annotation and labeling of the medical images is much more expensive and
time-consuming than other applications and often involves manual labor from
multiple domain experts. In this work, we propose a multi-stage, self-paced
learning framework utilizing a convolutional neural network (CNN) to classify
Computed Tomography (CT) image patches. The key contribution of this approach
is that we augment the size of training samples by refining the unlabeled
instances with a self-paced learning CNN. By implementing the framework on high
performance computing servers including the NVIDIA DGX1 machine, we obtained
the experimental result, showing that the self-pace boosted network
consistently outperformed the original network even with very scarce manual
labels. The performance gain indicates that applications with limited training
samples such as medical image analysis can benefit from using the proposed
framework.Comment: accepted by 8th International Workshop on Machine Learning in Medical
Imaging (MLMI 2017
- …