1,087 research outputs found

    Human Shape and Clothing Estimation

    Full text link
    Human shape and clothing estimation has gained significant prominence in various domains, including online shopping, fashion retail, augmented reality (AR), virtual reality (VR), and gaming. The visual representation of human shape and clothing has become a focal point for computer vision researchers in recent years. This paper presents a comprehensive survey of the major works in the field, focusing on four key aspects: human shape estimation, fashion generation, landmark detection, and attribute recognition. For each of these tasks, the survey paper examines recent advancements, discusses their strengths and limitations, and qualitative differences in approaches and outcomes. By exploring the latest developments in human shape and clothing estimation, this survey aims to provide a comprehensive understanding of the field and inspire future research in this rapidly evolving domain

    An investigation of techniques in deformable object recognition

    Get PDF
    The human\u27s innate ability to process information garnered from a visual scene has no parallel in the digital realm. This task is taken for granted in human cognition, but has not been met by a complete digital solution even following years of research. This difficulty can be explained by the shear complexity of the physology of the visual pathway. Although a complete solution has not been created, there are a number of examples of solutions that address parts of the problem. The recognition of deformable objects is the area addressed in this work. The specific task researched was the recognition of creatures in structured visual scenes. The focus was on developing a set of features which are able to differentiate between target creature classes. The implications of this research lie in ecoinformatics and field biology with the automated collection and annotation of biological data. The thesis will present a survey of the current literature addressing techniques which have been used to solve similar problems. An algorithm to perform the recognition will be presented and the results discussed. Finally, potential areas for improvement will be described

    Document Image Analysis Techniques for Handwritten Text Segmentation, Document Image Rectification and Digital Collation

    Get PDF
    Document image analysis comprises all the algorithms and techniques that are utilized to convert an image of a document to a computer readable description. In this work we focus on three such techniques, namely (1) Handwritten text segmentation (2) Document image rectification and (3) Digital Collation. Offline handwritten text recognition is a very challenging problem. Aside from the large variation of different handwriting styles, neighboring characters within a word are usually connected, and we may need to segment a word into individual characters for accurate character recognition. Many existing methods achieve text segmentation by evaluating the local stroke geometry and imposing constraints on the size of each resulting character, such as the character width, height and aspect ratio. These constraints are well suited for printed texts, but may not hold for handwritten texts. Other methods apply holistic approach by using a set of lexicons to guide and correct the segmentation and recognition. This approach may fail when the domain lexicon is insufficient. In the first part of this work, we present a new global non-holistic method for handwritten text segmentation, which does not make any limiting assumptions on the character size and the number of characters in a word. We conduct experiments on real images of handwritten texts taken from the IAM handwriting database and compare the performance of the presented method against an existing text segmentation algorithm that uses dynamic programming and achieve significant performance improvement. Digitization of document images using OCR based systems is adversely affected if the image of the document contains distortion (warping). Often, costly and precisely calibrated special hardware such as stereo cameras, laser scanners, etc. are used to infer the 3D model of the distorted image which is used to remove the distortion. Recent methods focus on creating a 3D shape model based on 2D distortion informa- tion obtained from the document image. The performance of these methods is highly dependent on estimating an accurate 2D distortion grid. These methods often affix the 2D distortion grid lines to the text line, and as such, may suffer in the presence of unreliable textual cues due to preprocessing steps such as binarization. In the domain of printed document images, the white space between the text lines carries as much information about the 2D distortion as the text lines themselves. Based on this intuitive idea, in the second part of our work we build a 2D distortion grid from white space lines, which can be used to rectify a printed document image by a dewarping algorithm. We compare our presented method against a state-of-the-art 2D distortion grid construction method and obtain better results. We also present qualitative and quantitative evaluations for the presented method. Collation of texts and images is an indispensable but labor-intensive step in the study of print materials. It is an often used methodology by textual scholars when the manuscript of the text does not exist. Although various methods and machines have been designed to assist in this labor, it still remains an expensive and time- consuming process, often requiring travel to distant repositories for the painstaking visual examination of multiple original copies. Efforts to digitize collation have so far depended on first transcribing the texts to be compared, thus introducing into the process more labor and expense, and also more potential error. Digital collation will instead automate the first stages of collation directly from the document images of the original texts, thereby speeding the process of comparison. We describe such a novel framework for digital collation in the third part of this work and provide qualitative results

    Automated separation of bone joint structures for medical image reconstruction

    Get PDF
    Automated separation of reconstructed bone joints from 3D medical images is a challenging task due to surrounding soft tissue and adjacent bones that can affect the clarity of bone boundaries. Existing approaches typically require human intervention to correct improper results of segmentation before the joint model is reconstructed. This dissertation presents a new methodology for separating bone joint models using a completely automated approach. Rather than trying to offer a solution for segmenting medical images, the proposed method first allows errors in the reconstructed model and later removes these errors without the help of a medical expert or technician. This method utilizes known anatomical information from a generic CAD model, which is a properly generated model of the anatomy of a similar human subject, with regard to age, gender, height, etc. The intent is to aid in the separation of bones in the joint areas by comparing the reconstructed model that might contain errors to the generic model which has individual bones separated properly. The human hip joint is employed as an example of algorithm implementation in this dissertation. The proposed method is a general approach that should be adequately flexible to extend to other type of joints such as knee and elbow

    Segmentation of Brain MRI

    Get PDF
    • …
    corecore