526 research outputs found

    Enhancement of Image Resolution by Binarization

    Full text link
    Image segmentation is one of the principal approaches of image processing. The choice of the most appropriate Binarization algorithm for each case proved to be a very interesting procedure itself. In this paper, we have done the comparison study between the various algorithms based on Binarization algorithms and propose a methodologies for the validation of Binarization algorithms. In this work we have developed two novel algorithms to determine threshold values for the pixels value of the gray scale image. The performance estimation of the algorithm utilizes test images with, the evaluation metrics for Binarization of textual and synthetic images. We have achieved better resolution of the image by using the Binarization method of optimum thresholding techniques.Comment: 5 pages, 8 figure

    Composite Correlation Quantization for Efficient Multimodal Retrieval

    Full text link
    Efficient similarity retrieval from large-scale multimodal database is pervasive in modern search engines and social networks. To support queries across content modalities, the system should enable cross-modal correlation and computation-efficient indexing. While hashing methods have shown great potential in achieving this goal, current attempts generally fail to learn isomorphic hash codes in a seamless scheme, that is, they embed multiple modalities in a continuous isomorphic space and separately threshold embeddings into binary codes, which incurs substantial loss of retrieval accuracy. In this paper, we approach seamless multimodal hashing by proposing a novel Composite Correlation Quantization (CCQ) model. Specifically, CCQ jointly finds correlation-maximal mappings that transform different modalities into isomorphic latent space, and learns composite quantizers that convert the isomorphic latent features into compact binary codes. An optimization framework is devised to preserve both intra-modal similarity and inter-modal correlation through minimizing both reconstruction and quantization errors, which can be trained from both paired and partially paired data in linear time. A comprehensive set of experiments clearly show the superior effectiveness and efficiency of CCQ against the state of the art hashing methods for both unimodal and cross-modal retrieval

    Adaptive detection and tracking using multimodal information

    Get PDF
    This thesis describes work on fusing data from multiple sources of information, and focuses on two main areas: adaptive detection and adaptive object tracking in automated vision scenarios. The work on adaptive object detection explores a new paradigm in dynamic parameter selection, by selecting thresholds for object detection to maximise agreement between pairs of sources. Object tracking, a complementary technique to object detection, is also explored in a multi-source context and an efficient framework for robust tracking, termed the Spatiogram Bank tracker, is proposed as a means to overcome the difficulties of traditional histogram tracking. As well as performing theoretical analysis of the proposed methods, specific example applications are given for both the detection and the tracking aspects, using thermal infrared and visible spectrum video data, as well as other multi-modal information sources

    Achieving Information Security by multi-Modal Iris-Retina Biometric Approach Using Improved Mask R-CNN

    Get PDF
    The need for reliable user recognition (identification/authentication) techniques has grown in response to heightened security concerns and accelerated advances in networking, communication, and mobility. Biometrics, defined as the science of recognizing an individual based on his or her physical or behavioral characteristics, is gaining recognition as a method for determining an individual\u27s identity. Various commercial, civilian, and forensic applications now use biometric systems to establish identity. The purpose of this paper is to design an efficient multimodal biometric system based on iris and retinal features to assure accurate human recognition and improve the accuracy of recognition using deep learning techniques. Deep learning models were tested using retinographies and iris images acquired from the MESSIDOR and CASIA-IrisV1 databases for the same person. The Iris region was segmented from the image using the custom Mask R-CNN method, and the unique blood vessels were segmented from retinal images of the same person using principal curvature. Then, in order to aid precise recognition, they optimally extract significant information from the segmented images of the iris and retina. The suggested model attained 98% accuracy, 98.1% recall, and 98.1% precision. It has been discovered that using a custom Mask R-CNN approach on Iris-Retina images improves efficiency and accuracy in person recognition

    Parallel Genetic Algorithm based Thresholding Schemes for Image Segmentation

    Get PDF
    In this thesis, the problem of image segmentation has been addressed using the notion of thresholding.Since the focus of this work is primarily on object/objects background classification and fault detection in a given scene, the segmentation problem is viewed as a classification problem. In this regard, the notion of thresholding has been used to classify the range of gray values and hence classifies the image. The gray level distributions of the original image or the proposed feature image have been used to obtain the optimal threshold. Initially, PGA based class models have been developed to classify different classes of a nonlinear multimodal function. This problem is formulated where the nonlinear multimodal function is viewed as consisting of multiple class distributions.Each class could be represented by the niche or peaks of that class.Hence, the problem has been formulated to detect the peaks of the functions. PGA based clustering algorithm has been proposed to maintain stable sub-populations in the niches and hence the peaks could be detected. A new interconnection model has been proposed for PGA to accelerate the rate of convergence to the optimal solution. Convergence analysis of the proposed PGA based algorithm has been carried out and is shown to converge to the solution. The proposed PGA based clustering algorithm could successfully be tested for different classes and is found to converge much faster than that of GA based clustering algorithm

    Multilevel Thresholding of Brain Tumor MRI Images: Patch-Levy Bees Algorithm versus Harmony Search Algorithm

    Get PDF
    Image segmentation of brain magnetic resonance imaging (MRI) plays a crucial role among radiologists in terms of diagnosing brain disease. Parts of the brain such as white matter, gray matter and cerebrospinal fluids (CFS), have to be clearly determined by the radiologist during the process of brain abnormalities detection. Manual segmentation is grueling and may be prone to error, which can in turn affect the result of the diagnosis. Nature-inspired metaheuristic algorithms such as Harmony Search (HS), which was successfully applied in multilevel thresholding for brain tumor segmentation instead of the Patch-Levy Bees algorithm (PLBA). Even though the PLBA is one powerful multilevel thresholding, it has not been applied to brain tumor segmentation. This paper focuses on a comparative study of the PLBA and HS for brain tumor segmentation. The test dataset consisting of nine images was collected from the Tuanku Muhriz UKM Hospital (HCTM). As for the result, it shows that the PLBA has significantly outperformed HS. The performance of both algorithms is evaluated in terms of solution quality and stability

    Analytical methods fort he study of color in digital images

    Get PDF
    La descripció qualitativa dels colors que composen una imatge digital és una tasca molt senzilla pel sistema visual humà. Per un ordinador aquesta tasca involucra una gran quantitat de qüestions i de dades que la converteixen en una operació de gran complexitat. En aquesta tesi desenvolupam un mètode automàtic per a la construcció d’una paleta de colors d’una imatge digital, intentant respondre a les diferents qüestions que se’ns plantegen quan treballam amb colors a dins el món computacional. El desenvolupament d’aquest mètode suposa l’obtenció d’un algorisme automàtic de segmentació d’histogrames, el qual és construït en detall a la tesi i diferents aplicacions del mateix son donades. Finalment, també s’explica el funcionament de CProcess, un ‘software’ amigable desenvolupat per a la fàcil comprensió del color

    Image similarity in medical images

    Get PDF
    corecore