19,687 research outputs found

    Automatic Document Image Binarization using Bayesian Optimization

    Full text link
    Document image binarization is often a challenging task due to various forms of degradation. Although there exist several binarization techniques in literature, the binarized image is typically sensitive to control parameter settings of the employed technique. This paper presents an automatic document image binarization algorithm to segment the text from heavily degraded document images. The proposed technique uses a two band-pass filtering approach for background noise removal, and Bayesian optimization for automatic hyperparameter selection for optimal results. The effectiveness of the proposed binarization technique is empirically demonstrated on the Document Image Binarization Competition (DIBCO) and the Handwritten Document Image Binarization Competition (H-DIBCO) datasets

    Review of Face Detection Systems Based Artificial Neural Networks Algorithms

    Get PDF
    Face detection is one of the most relevant applications of image processing and biometric systems. Artificial neural networks (ANN) have been used in the field of image processing and pattern recognition. There is lack of literature surveys which give overview about the studies and researches related to the using of ANN in face detection. Therefore, this research includes a general review of face detection studies and systems which based on different ANN approaches and algorithms. The strengths and limitations of these literature studies and systems were included also.Comment: 16 pages, 12 figures, 1 table, IJMA Journa

    A Learning Framework for Morphological Operators using Counter-Harmonic Mean

    Full text link
    We present a novel framework for learning morphological operators using counter-harmonic mean. It combines concepts from morphology and convolutional neural networks. A thorough experimental validation analyzes basic morphological operators dilation and erosion, opening and closing, as well as the much more complex top-hat transform, for which we report a real-world application from the steel industry. Using online learning and stochastic gradient descent, our system learns both the structuring element and the composition of operators. It scales well to large datasets and online settings.Comment: Submitted to ISMM'1

    Grounding semantics in robots for Visual Question Answering

    Get PDF
    In this thesis I describe an operational implementation of an object detection and description system that incorporates in an end-to-end Visual Question Answering system and evaluated it on two visual question answering datasets for compositional language and elementary visual reasoning

    The ALHAMBRA Project: A large area multi medium-band optical and NIR photometric survey

    Get PDF
    (ABRIDGED) We describe the first results of the ALHAMBRA survey which provides cosmic tomography of the evolution of the contents of the Universe over most of Cosmic history. Our approach employs 20 contiguous, equal-width, medium-band filters covering from 3500 to 9700 A, plus the JHKs bands, to observe an area of 4 sqdeg on the sky. The optical photometric system has been designed to maximize the number of objects with accurate classification by SED and redshift, and to be sensitive to relatively faint emission lines. The observations are being carried out with the Calar Alto 3.5m telescope using the cameras LAICA and O-2000. The first data confirm that we are reaching the expected magnitude limits of AB<~25 mag in the optical filters from the blue to 8300 A, and from AB=24.7 to 23.4 for the redder ones. The limit in the NIR is (Vega) K_s~20, H~21, J~22. We expect to obtain accurate redshift values, Delta z/(1+z) <~ 0.03 for about 5x10^5 galaxies with I<~25 (60% complete), and z_med=0.74. This accuracy, together with the homogeneity of the selection function, will allow for the study of the redshift evolution of the large scale structure, the galaxy population and its evolution with redshift, the identification of clusters of galaxies, and many other studies, without the need for any further follow-up. It will also provide targets for detailed studies with 10m-class telescopes. Given its area, spectral coverage and its depth, apart from those main goals, the ALHAMBRA-Survey will also produce valuable data for galactic studies.Comment: Accepted to the Astronomical Journal. 43 pages, 18 figures. The images have been reduced in resolution to adapt to standard file sizes. Readers can find the full-resolution version of the paper at the ALHAMBRA web site (http://www.iaa.es/alhambra) under the "Publications" lin
    corecore