3,470 research outputs found

    Page layout analysis and classification in complex scanned documents

    Get PDF
    Page layout analysis has been extensively studied since the 1980`s, particularly after computers began to be used for document storage or database units. For efficient document storage and retrieval from a database, a paper document would be transformed into its electronic version. Algorithms and methodologies are used for document image analysis in order to segment a scanned document into different regions such as text, image or line regions. To contribute a novel approach in the field of page layout analysis and classification, this algorithm is developed for both RGB space and grey-scale scanned documents without requiring any specific document types, and scanning techniques. In this thesis, a page classification algorithm is proposed which mainly applies wavelet transform, Markov random field (MRF) and Hough transform to segment text, photo and strong edge/ line regions in both color and gray-scale scanned documents. The algorithm is developed to handle both simple and complex page layout structures and contents (text only vs. book cover that includes text, lines and/or photos). The methodology consists of five modules. In the first module, called pre-processing, image enhancements techniques such as image scaling, filtering, color space conversion or gamma correction are applied in order to reduce computation time and enhance the scanned document. The techniques, used to perform the classification, are employed on the one-fourth resolution input image in the CIEL*a*b* color space. In the second module, the text detection module uses wavelet analysis to generate a text-region candidate map which is enhanced by applying a Run Length Encoding (RLE) technique for verification purposes. The third module, photo detection, initially uses block-wise segmentation which is based on basis vector projection technique. Then, MRF with maximum a-posteriori (MAP) optimization framework is utilized to generate photo map. Next, Hough transform is applied to locate lines in the fourth module. Techniques for edge detection, edge linkages, and line-segment fitting are used to detect strong-edges in the module as well. After those three classification maps are obtained, in the last module a final page layout map is generated by using K-Means. Features are extracted to classify the intersection regions and merge into one classification map with K-Means clustering. The proposed technique is tested on several hundred images and its performance is validated by utilizing Confusion Matrix (CM). It shows that the technique achieves an average of 85% classification accuracy rate in text, photo, and background regions on a variety of scanned documents like articles, magazines, business-cards, dictionaries or newsletters etc. More importantly, it performs independently from a scanning process and an input scanned document (RGB or gray-scale) with comparable classification quality

    Framework for comprehensive enhancement of brain tumor images with single-window operation

    Get PDF
    Usage of grayscale format of radiological images is proportionately more as compared to that of colored one. This format of medical image suffers from all the possibility of improper clinical inference which will lead to error-prone analysis in further usage of such images in disease detection or classification. Therefore, we present a framework that offers single-window operation with a set of image enhancing algorithm meant for further optimizing the visuality of medical images. The framework performs preliminary pre-processing operation followed by implication of linear and non-linear filter and multi-level image enhancement processes. The significant contribution of this study is that it offers a comprehensive mechanism to implement the various enhancement schemes in highly discrete way that offers potential flexibility to physical in order to draw clinical conclusion about the disease being monitored. The proposed system takes the case study of brain tumor to implement to testify the framework

    Human object annotation for surveillance video forensics

    Get PDF
    A system that can automatically annotate surveillance video in a manner useful for locating a person with a given description of clothing is presented. Each human is annotated based on two appearance features: primary colors of clothes and the presence of text/logos on clothes. The annotation occurs after a robust foreground extraction stage employing a modified Gaussian mixture model-based approach. The proposed pipeline consists of a preprocessing stage where color appearance of an image is improved using a color constancy algorithm. In order to annotate color information for human clothes, we use the color histogram feature in HSV space and find local maxima to extract dominant colors for different parts of a segmented human object. To detect text/logos on clothes, we begin with the extraction of connected components of enhanced horizontal, vertical, and diagonal edges in the frames. These candidate regions are classified as text or nontext on the basis of their local energy-based shape histogram features. Further, to detect humans, a novel technique has been proposed that uses contourlet transform-based local binary pattern (CLBP) features. In the proposed method, we extract the uniform direction invariant LBP feature descriptor for contourlet transformed high-pass subimages from vertical and diagonal directional bands. In the final stage, extracted CLBP descriptors are classified by a trained support vector machine. Experimental results illustrate the superiority of our method on large-scale surveillance video data

    Stochastic signatures of involuntary head micro-movements can be used to classify females of ABIDE into different subtypes of neurodevelopmental disorders.

    Get PDF
    © 2017 Torres, Mistry, Caballero and Whyatt. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY).Background: The approximate 5:1 male to female ratio in clinical detection of Autism Spectrum Disorder (ASD) prevents research from characterizing the female phenotype. Current open access repositories [such as those in the Autism Brain Imaging Data Exchange (ABIDE I-II)] contain large numbers of females to help begin providing a new characterization of females on the autistic spectrum. Here we introduce new methods to integrate data in a scale-free manner from continuous biophysical rhythms of the nervous systems and discrete (ordinal) observational scores. Methods: New data-types derived from image-based involuntary head motions and personalized statistical platform were combined with a data-driven approach to unveil sub-groups within the female cohort. Further, to help refine the clinical DSM-based ASD vs. Asperger's Syndrome (AS) criteria, distributional analyses of ordinal score data from Autism Diagnostic Observation Schedule (ADOS)-based criteria were used on both the female and male phenotypes. Results: Separate clusters were automatically uncovered in the female cohort corresponding to differential levels of severity. Specifically, the AS-subgroup emerged as the most severely affected with an excess level of noise and randomness in the involuntary head micro-movements. Extending the methods to characterize males of ABIDE revealed ASD-males to be more affected than AS-males. A thorough study of ADOS-2 and ADOS-G scores provided confounding results regarding the ASD vs. AS male comparison, whereby the ADOS-2 rendered the AS-phenotype worse off than the ASD-phenotype, while ADOS-G flipped the results. Females with AS scored higher on severity than ASD-females in all ADOS test versions and their scores provided evidence for significantly higher severity than males. However, the statistical landscapes underlying female and male scores appeared disparate. As such, further interpretation of the ADOS data seems problematic, rather suggesting the critical need to develop an entirely new metric to measure social behavior in females. Conclusions: According to the outcome of objective, data-driven analyses and subjective clinical observation, these results support the proposition that the female phenotype is different. Consequently the “social behavioral male ruler” will continue to mask the female autistic phenotype. It is our proposition that new observational behavioral tests ought to contain normative scales, be statistically sound and combined with objective data-driven approaches to better characterize the females across the human lifespan.Peer reviewe

    Doctor of Philosophy

    Get PDF
    dissertationConfocal microscopy has become a popular imaging technique in biology research in recent years. It is often used to study three-dimensional (3D) structures of biological samples. Confocal data are commonly multichannel, with each channel resulting from a different fluorescent staining. This technique also results in finely detailed structures in 3D, such as neuron fibers. Despite the plethora of volume rendering techniques that have been available for many years, there is a demand from biologists for a flexible tool that allows interactive visualization and analysis of multichannel confocal data. Together with biologists, we have designed and developed FluoRender. It incorporates volume rendering techniques such as a two-dimensional (2D) transfer function and multichannel intermixing. Rendering results can be enhanced through tone-mappings and overlays. To facilitate analyses of confocal data, FluoRender provides interactive operations for extracting complex structures. Furthermore, we developed the Synthetic Brainbow technique, which takes advantage of the asynchronous behavior in Graphics Processing Unit (GPU) framebuffer loops and generates random colorizations for different structures in single-channel confocal data. The results from our Synthetic Brainbows, when applied to a sequence of developing cells, can then be used for tracking the movements of these cells. Finally, we present an application of FluoRender in the workflow of constructing anatomical atlases

    Analisa Peningkatan Kualitas Citra Bawah Air Berbasis Koreksi Gamma Dan Histogram Equalization

    Get PDF
    Underwater image of water quality in the dark, it depends on the depth of water at the time of image acquisition or image. The results of the image quality is adversely affecting the results matching the image pairs underwater with SIFT algorithm. This research aims to use the method of image preprocessing and Histogram Equalization Gamma Correction that works to improve the quality of images underwater. The results showed 27.76% increase using image preprocessing Gamma Correction and Histogram Equalization compared with no increase in image quality. Results of paired t-test has the null hypothesis is rejected so that there is a significant difference between the application of Gamma Correction Histogram Equalization with and without image enhancement

    Advanced Image Acquisition, Processing Techniques and Applications

    Get PDF
    "Advanced Image Acquisition, Processing Techniques and Applications" is the first book of a series that provides image processing principles and practical software implementation on a broad range of applications. The book integrates material from leading researchers on Applied Digital Image Acquisition and Processing. An important feature of the book is its emphasis on software tools and scientific computing in order to enhance results and arrive at problem solution

    The Fast and Furious Decay of the Peculiar Type Ic Supernova 2005ek

    Get PDF
    We present extensive multi-wavelength observations of the extremely rapidly declining Type Ic supernova, SN 2005ek. Reaching a peak magnitude of M_R = -17.3 and decaying by ~3 mag in the first 15 days post-maximum, SN 2005ek is among the fastest Type I supernovae observed to date. The spectra of SN 2005ek closely resemble those of normal SN Ic, but with an accelerated evolution. There is evidence for the onset of nebular features at only nine days post-maximum. Spectroscopic modeling reveals an ejecta mass of ~0.3 Msun that is dominated by oxygen (~80%), while the pseudo-bolometric light curve is consistent with an explosion powered by ~0.03 Msun of radioactive Ni-56. Although previous rapidly evolving events (e.g., SN 1885A, SN 1939B, SN 2002bj, SN 2010X) were hypothesized to be produced by the detonation of a helium shell on a white dwarf, oxygen-dominated ejecta are difficult to reconcile with this proposed mechanism. We find that the properties of SN 2005ek are consistent with either the edge-lit double detonation of a low-mass white dwarf or the iron-core collapse of a massive star, stripped by binary interaction. However, if we assume that the strong spectroscopic similarity of SN 2005ek to other SN Ic is an indication of a similar progenitor channel, then a white-dwarf progenitor becomes very improbable. SN 2005ek may be one of the lowest mass stripped-envelope core-collapse explosions ever observed. We find that the rate of such rapidly declining Type I events is at least 1-3% of the normal SN Ia rate.Comment: Accepted for publication in ApJ. Please visit http://www.cfa.harvard.edu/~mdrout to hear a sonification of SN2005e
    corecore