4,098 research outputs found

    Coding of details in very low bit-rate video systems

    Get PDF
    In this paper, the importance of including small image features at the initial levels of a progressive second generation video coding scheme is presented. It is shown that a number of meaningful small features called details should be coded, even at very low data bit-rates, in order to match their perceptual significance to the human visual system. We propose a method for extracting, perceptually selecting and coding of visual details in a video sequence using morphological techniques. Its application in the framework of a multiresolution segmentation-based coding algorithm yields better results than pure segmentation techniques at higher compression ratios, if the selection step fits some main subjective requirements. Details are extracted and coded separately from the region structure and included in the reconstructed images in a later stage. The bet of considering the local background of a given detail for its perceptual selection breaks the concept ofPeer ReviewedPostprint (published version

    Uncertainty-Aware Organ Classification for Surgical Data Science Applications in Laparoscopy

    Get PDF
    Objective: Surgical data science is evolving into a research field that aims to observe everything occurring within and around the treatment process to provide situation-aware data-driven assistance. In the context of endoscopic video analysis, the accurate classification of organs in the field of view of the camera proffers a technical challenge. Herein, we propose a new approach to anatomical structure classification and image tagging that features an intrinsic measure of confidence to estimate its own performance with high reliability and which can be applied to both RGB and multispectral imaging (MI) data. Methods: Organ recognition is performed using a superpixel classification strategy based on textural and reflectance information. Classification confidence is estimated by analyzing the dispersion of class probabilities. Assessment of the proposed technology is performed through a comprehensive in vivo study with seven pigs. Results: When applied to image tagging, mean accuracy in our experiments increased from 65% (RGB) and 80% (MI) to 90% (RGB) and 96% (MI) with the confidence measure. Conclusion: Results showed that the confidence measure had a significant influence on the classification accuracy, and MI data are better suited for anatomical structure labeling than RGB data. Significance: This work significantly enhances the state of art in automatic labeling of endoscopic videos by introducing the use of the confidence metric, and by being the first study to use MI data for in vivo laparoscopic tissue classification. The data of our experiments will be released as the first in vivo MI dataset upon publication of this paper.Comment: 7 pages, 6 images, 2 table

    Compact color texture descriptor based on rank transform and product ordering in the RGB color space

    Get PDF

    Region-based representations of image and video: segmentation tools for multimedia services

    Get PDF
    This paper discusses region-based representations of image and video that are useful for multimedia services such as those supported by the MPEG-4 and MPEG-7 standards. Classical tools related to the generation of the region-based representations are discussed. After a description of the main processing steps and the corresponding choices in terms of feature spaces, decision spaces, and decision algorithms, the state of the art in segmentation is reviewed. Mainly tools useful in the context of the MPEG-4 and MPEG-7 standards are discussed. The review is structured around the strategies used by the algorithms (transition based or homogeneity based) and the decision spaces (spatial, spatio-temporal, and temporal). The second part of this paper proposes a partition tree representation of images and introduces a processing strategy that involves a similarity estimation step followed by a partition creation step. This strategy tries to find a compromise between what can be done in a systematic and universal way and what has to be application dependent. It is shown in particular how a single partition tree created with an extremely simple similarity feature can support a large number of segmentation applications: spatial segmentation, motion estimation, region-based coding, semantic object extraction, and region-based retrieval.Peer ReviewedPostprint (published version

    Characterization and automatic screening of reactive and abnormal neoplastic B lymphoid cells from peripheral blood

    Get PDF
    The objective was to advance in the automatic, image-based, characterization and recognition of a heterogeneous set of lymphoid cells from peripheral blood, including normal, reactive, and five groups of abnormal lymphocytes: hairy cells, mantle cells, follicular lymphoma, chronic lymphocytic leukemia, and prolymphocytes. Methods: A number of 4389 images from 105 patients were selected by pathologists, based on morphologic visual appearance, from patients whose diagnosis was confirmed by all the remaining complementary tests. Besides geometry, new color and texture features were extracted using six alternative color spaces to obtain rich information to characterize the cell groups. The recognition system was designed using support vector machines trained with the whole image set. Results: In the experimental tests, individual sets of images from 21 new patients were analyzed by the trained recognition system and compared with the true diagnosis. An overall recognition accuracy of 97.67% was achieved when the cell screening was performed into three groups: normal lymphocytes, abnormal lymphoid cells, and reactive lymphocytes. The accuracy of the whole experimental study was 91.23% when considering the further discrimination of the abnormal lymphoid cells into the specific five groups. Conclusion: The excellent automatic screening of the three groups of normal, reactive, and abnormal lymphocytes is useful as it discriminates between malignancy and not malignancy. The discrimination of the five groups of abnormal lymphoid cells is encouraging toward the idea that the system could be an automated image-based screening method to identify blood involvement by a variety of B lymphomas.Preprin

    A comparative study of algorithms for automatic segmentation of dermoscopic images

    Get PDF
    Melanoma is the most common as well as the most dangerous type of skin cancer. Nevertheless, it can be effectively treated if detected early. Dermoscopy is one of the major non-invasive imaging techniques for the diagnosis of skin lesions. The computer-aided diagnosis based on the processing of dermoscopic images aims to reduce the subjectivity and time-consuming analysis related to traditional diagnosis. The first step of automatic diagnosis is image segmentation. In this project, the implementation and evaluation of several methods were proposed for the automatic segmentation of lesion regions in dermoscopic images, along with the corresponding implemented phases for image preprocessing and postprocessing. The developed algorithms include methods based on different state of the art techniques. The main groups of techniques which have been selected to be studied and implemented are thresholding-based methods, region-based methods, segmentation based on deformable models, as well as a new proposed approach based on the bag-of-words model. The implemented methods incorporate modifications for a better adaptation to features associated with dermoscopic images. Each implemented method was applied to a database constituted by 724 dermoscopic images. The output of the automatic segmentation procedure for each image was compared with the corresponding manual segmentation in order to evaluate the performance. The comparison between algorithms was carried out regarding the obtained evaluation metrics. The best results were achieved by the combination of region-based segmentation based on the multi-region adaptation of the k-means algorithm and the subIngeniería de Sistemas Audiovisuale

    Classification of Plasmodium Malariae dan Plasmodium Ovale in Microscopic Thin Blood Smear Digital Images

    Get PDF
    Malaria is one of the global diseases, which mostly found in eastern Indonesia. It is caused by Plasmodium parasite infection, with four type of common species that are Plasmodium ovale (PO), Plasmodium Malaria (PM), Plasmodium falciparum (PF) and Plasmodium vivax (PV). Malaria can be detected by taking a microscopic analysis from a patient blood sample. Although it is a gold standard of malaria identification according to the WHO, this method has a risk of miss diagnosis due to the human factors. This study proposed a classification method with morphological features of PM and PO in order to help the medical expertise in identifying the malaria parasite from a thin blood smear digital microscopic image. The data used are digital images that have been through the Region of Interest (ROI) determination process. Furthermore, the process followed by improving the morphological and feature extraction of shapes and colors. Based on these obtained features, the parasites are classified by using the multilayer perceptron method. From this study, we found that the classification system has the accuracy of 95%, the sensitivity of 93%, and the specificity of 97%
    corecore