2,386 research outputs found

    Conceptual data sampling for breast cancer histology image classification

    Get PDF
    Data analytics have become increasingly complicated as the amount of data has increased. One technique that is used to enable data analytics in large datasets is data sampling, in which a portion of the data is selected to preserve the data characteristics for use in data analytics. In this paper, we introduce a novel data sampling technique that is rooted in formal concept analysis theory. This technique is used to create samples reliant on the data distribution across a set of binary patterns. The proposed sampling technique is applied in classifying the regions of breast cancer histology images as malignant or benign. The performance of our method is compared to other classical sampling methods. The results indicate that our method is efficient and generates an illustrative sample of small size. It is also competing with other sampling methods in terms of sample size and sample quality represented in classification accuracy and F1 measure

    Breast cancer image classification using pattern-based Hyper Conceptual Sampling method

    Get PDF
    The increase in biomedical data has given rise to the need for developing data sampling techniques. With the emergence of big data and the rise of popularity of data science, sampling or reduction techniques have been assistive to significantly hasten the data analytics process. Intuitively, without sampling techniques, it would be difficult to efficiently extract useful patterns from a large dataset. However, by using sampling techniques, data analysis can effectively be performed on huge datasets, to produce a relatively small portion of data, which extracts the most representative objects from the original dataset. However, to reach effective conclusions and predictions, the samples should preserve the data behavior. In this paper, we propose a unique data sampling technique which exploits the notion of formal concept analysis. Machine learning experiments are performed on the resulting sample to evaluate quality, and the performance of our method is compared with another sampling technique proposed in the literature. The results demonstrate the effectiveness and competitiveness of the proposed approach in terms of sample size and quality, as determined by accuracy and the F1-measure. 2018This contribution was made possible by NPRP-07-794-1-145 grant from the Qatar National Research Fund (a member of Qatar foundation). The statements made herein are solely the responsibility of the authors.Scopu

    Breast cancer classification with histopathological image based on machine learning

    Get PDF
    Breast cancer represents one of the most common reasons for death in the worldwide. It has a substantially higher death rate than other types of cancer. Early detection can enhance the chances of receiving proper treatment and survival. In order to address this problem, this work has provided a convolutional neural network (CNN) deep learning (DL) based model on the classification that may be used to differentiate breast cancer histopathology images as benign or malignant. Besides that, five different types of pre-trained CNN architectures have been used to investigate the performance of the model to solve this problem which are the residual neural network-50 (ResNet-50), visual geometry group-19 (VGG-19), Inception-V3, and AlexNet while the ResNet-50 is also functions as a feature extractor to retrieve information from images and passed them to machine learning algorithms, in this case, a random forest (RF) and k-nearest neighbors (KNN) are employed for classification. In this paper, experiments are done using the BreakHis public dataset. As a result, the ResNet-50 network has the highest test accuracy of 97% to classify breast cancer images

    Towards Secure and Intelligent Diagnosis: Deep Learning and Blockchain Technology for Computer-Aided Diagnosis Systems

    Get PDF
    Cancer is the second leading cause of death across the world after cardiovascular disease. The survival rate of patients with cancerous tissue can significantly decrease due to late-stage diagnosis. Nowadays, advancements of whole slide imaging scanners have resulted in a dramatic increase of patient data in the domain of digital pathology. Large-scale histopathology images need to be analyzed promptly for early cancer detection which is critical for improving patient's survival rate and treatment planning. Advances of medical image processing and deep learning methods have facilitated the extraction and analysis of high-level features from histopathological data that could assist in life-critical diagnosis and reduce the considerable healthcare cost associated with cancer. In clinical trials, due to the complexity and large variance of collected image data, developing computer-aided diagnosis systems to support quantitative medical image analysis is an area of active research. The first goal of this research is to automate the classification and segmentation process of cancerous regions in histopathology images of different cancer tissues by developing models using deep learning-based architectures. In this research, a framework with different modules is proposed, including (1) data pre-processing, (2) data augmentation, (3) feature extraction, and (4) deep learning architectures. Four validation studies were designed to conduct this research. (1) differentiating benign and malignant lesions in breast cancer (2) differentiating between immature leukemic blasts and normal cells in leukemia cancer (3) differentiating benign and malignant regions in lung cancer, and (4) differentiating benign and malignant regions in colorectal cancer. Training machine learning models, disease diagnosis, and treatment often requires collecting patients' medical data. Privacy and trusted authenticity concerns make data owners reluctant to share their personal and medical data. Motivated by the advantages of Blockchain technology in healthcare data sharing frameworks, the focus of the second part of this research is to integrate Blockchain technology in computer-aided diagnosis systems to address the problems of managing access control, authentication, provenance, and confidentiality of sensitive medical data. To do so, a hierarchical identity and attribute-based access control mechanism using smart contract and Ethereum Blockchain is proposed to securely process healthcare data without revealing sensitive information to an unauthorized party leveraging the trustworthiness of transactions in a collaborative healthcare environment. The proposed access control mechanism provides a solution to the challenges associated with centralized access control systems and ensures data transparency and traceability for secure data sharing, and data ownership

    Seeing the random forest through the decision trees. Supporting learning health systems from histopathology with machine learning models: Challenges and opportunities

    Full text link
    This paper discusses some overlooked challenges faced when working with machine learning models for histopathology and presents a novel opportunity to support "Learning Health Systems" with them. Initially, the authors elaborate on these challenges after separating them according to their mitigation strategies: those that need innovative approaches, time, or future technological capabilities and those that require a conceptual reappraisal from a critical perspective. Then, a novel opportunity to support "Learning Health Systems" by integrating hidden information extracted by ML models from digitalized histopathology slides with other healthcare big data is presented

    Tumor cell load and heterogeneity estimation from diffusion-weighted MRI calibrated with histological data: an example from lung cancer

    Get PDF
    Producción CientíficaDiffusion-weighted magnetic resonance imaging (DWI) is a key non-invasive imaging technique for cancer diagnosis and tumor treatment assessment, reflecting Brownian movement of water molecules in tissues. Since densely packed cells restrict molecule mobility, tumor tissues produce usually higher signal (a.k.a. less attenuated signal) on isotropic maps compared with normal tissues. However, no general quantitative relation between DWI data and the cell density has been established. In order to link low-resolution clinical cross-sectional data with high-resolution histological information, we developed an image processing and analysis chain, which was used to study the correlation between the diffusion coefficient (D value) estimated from DWI and tumor cellularity from serial histological slides of a resected non-small cell lung cancer tumor. Color deconvolution followed by cell nuclei segmentation was performed on digitized histological images to determine local and cell-type specific 2d (two-dimensional) densities. From these, the 3d cell density was inferred by a model-based sampling technique, which is necessary for the calculation of local and global 3d tumor cell count. Next, DWI sequence information was overlaid with high-resolution CT data and the resected histology using prominent anatomical hallmarks for co-registration of histology tissue blocks and non-invasive imaging modalities' data. The integration of cell numbers information and DWI data derived from different tumor areas revealed a clear negative correlation between cell density and D value. Importantly, spatial tumor cell density can be calculated based on DWI data. In summary, our results demonstrate that tumor cell count and heterogeneity can be predicted from DWI data, which may open new opportunities for personalized diagnosis and therapy optimization

    Minimally invasive complete response assessment of the breast after neoadjuvant systemic therapy for early breast cancer (micra trial) : interim analysis of a multicenter observational cohort study

    Get PDF
    Background The added value of surgery in breast cancer patients with pathological complete response (pCR) after neoadjuvant systemic therapy (NST) is uncertain. The accuracy of imaging identifying pCR for omission of surgery, however, is insufficient. We investigated the accuracy of ultrasound-guided biopsies identifying breast pCR (ypT0) after NST in patients with radiological partial (rPR) or complete response (rCR) on MRI. Methods We performed a multicenter, prospective single-arm study in three Dutch hospitals. Patients with T1-4(N0 or N +) breast cancer with MRI rPR and enhancement <= 2.0 cm or MRI rCR after NST were enrolled. Eight ultrasound-guided 14-G core biopsies were obtained in the operating room before surgery close to the marker placed centrally in the tumor area at diagnosis (no attempt was made to remove the marker), and compared with the surgical specimen of the breast. Primary outcome was the false-negative rate (FNR). Results Between April 2016 and June 2019, 202 patients fulfilled eligibility criteria. Pre-surgical biopsies were obtained in 167 patients, of whom 136 had rCR and 31 had rPR on MRI. Forty-three (26%) tumors were hormone receptor (HR)-positive/HER2-negative, 64 (38%) were HER2-positive, and 60 (36%) were triple-negative. Eighty-nine patients had pCR (53%; 95% CI 45-61) and 78 had residual disease. Biopsies were false-negative in 29 (37%; 95% CI 27-49) of 78 patients. The multivariable associated with false-negative biopsies was rCR (FNR 47%; OR 9.81, 95% CI 1.72-55.89; p = 0.01); a trend was observed for HR-negative tumors (FNR 71% in HER2-positive and 55% in triple-negative tumors; OR 4.55, 95% CI 0.95-21.73; p = 0.058) and smaller pathological lesions (6 mm vs 15 mm; OR 0.93, 95% CI 0.87-1.00; p = 0.051). Conclusion The MICRA trial showed that ultrasound-guided core biopsies are not accurate enough to identify breast pCR in patients with good response on MRI after NST. Therefore, breast surgery cannot safely be omitted relying on the results of core biopsies in these patients

    Review of Methods for Intraoperative Margin Detection for Breast Conserving Surgery

    Get PDF
    Breast conserving surgery (BCS) is an effective treatment for early-stage cancers as long as the margins of the resected tissue are free of disease according to consensus guidelines for patient management. However, 15% to 35% of patients undergo a second surgery since malignant cells are found close to or at the margins of the original resection specimen. This review highlights imaging approaches being investigated to reduce the rate of positive margins, and they are reviewed with the assumption that a new system would need high sensitivity near 95% and specificity near 85%. The problem appears to be twofold. The first is for complete, fast surface scanning for cellular, structural, and/or molecular features of cancer, in a lumpectomy volume, which is variable in size, but can be large, irregular, and amorphous. A second is for full, volumetric imaging of the specimen at high spatial resolution, to better guide internal radiologic decision-making about the spiculations and duct tracks, which may inform that surfaces are involved. These two demands are not easily solved by a single tool. Optical methods that scan large surfaces quickly are needed with cellular/molecular sensitivity to solve the first problem, but volumetric imaging with high spatial resolution for soft tissues is largely outside of the optical realm and requires x-ray, micro-CT, or magnetic resonance imaging if they can be achieved efficiently. In summary, it appears that a combination of systems into hybrid platforms may be the optimal solution for these two very different problems. This concept must be cost-effective, image specimens within minutes and be coupled to decision-making tools that help a surgeon without adding to the procedure. The potential for optical systems to be involved in this problem is emerging and clinical trials are underway in several of these technologies to see if they could reduce positive margin rates in BCS
    • …
    corecore