5,072 research outputs found

    DepthCut: Improved Depth Edge Estimation Using Multiple Unreliable Channels

    Get PDF
    In the context of scene understanding, a variety of methods exists to estimate different information channels from mono or stereo images, including disparity, depth, and normals. Although several advances have been reported in the recent years for these tasks, the estimated information is often imprecise particularly near depth discontinuities or creases. Studies have however shown that precisely such depth edges carry critical cues for the perception of shape, and play important roles in tasks like depth-based segmentation or foreground selection. Unfortunately, the currently extracted channels often carry conflicting signals, making it difficult for subsequent applications to effectively use them. In this paper, we focus on the problem of obtaining high-precision depth edges (i.e., depth contours and creases) by jointly analyzing such unreliable information channels. We propose DepthCut, a data-driven fusion of the channels using a convolutional neural network trained on a large dataset with known depth. The resulting depth edges can be used for segmentation, decomposing a scene into depth layers with relatively flat depth, or improving the accuracy of the depth estimate near depth edges by constraining its gradients to agree with these edges. Quantitatively, we compare against 15 variants of baselines and demonstrate that our depth edges result in an improved segmentation performance and an improved depth estimate near depth edges compared to data-agnostic channel fusion. Qualitatively, we demonstrate that the depth edges result in superior segmentation and depth orderings.Comment: 12 page

    A perception and manipulation system for collecting rock samples

    Get PDF
    An important part of a planetary exploration mission is to collect and analyze surface samples. As part of the Carnegie Mellon University Ambler Project, researchers are investigating techniques for collecting samples using a robot arm and a range sensor. The aim of this work is to make the sample collection operation fully autonomous. Described here are the components of the experimental system, including a perception module that extracts objects of interest from range images and produces models of their shapes, and a manipulation module that enables the system to pick up the objects identified by the perception module. The system was tested on a small testbed using natural terrain

    Part Description and Segmentation Using Contour, Surface and Volumetric Primitives

    Get PDF
    The problem of part definition, description, and decomposition is central to the shape recognition systems. The Ultimate goal of segmenting range images into meaningful parts and objects has proved to be very difficult to realize, mainly due to the isolation of the segmentation problem from the issue of representation. We propose a paradigm for part description and segmentation by integration of contour, surface, and volumetric primitives. Unlike previous approaches, we have used geometric properties derived from both boundary-based (surface contours and occluding contours), and primitive-based (quadric patches and superquadric models) representations to define and recover part-whole relationships, without a priori knowledge about the objects or object domain. The object shape is described at three levels of complexity, each contributing to the overall shape. Our approach can be summarized as answering the following question : Given that we have all three different modules for extracting volume, surface and boundary properties, how should they be invoked, evaluated and integrated? Volume and boundary fitting, and surface description are performed in parallel to incorporate the best of the coarse to fine and fine to coarse segmentation strategy. The process involves feedback between the segmentor (the Control Module) and individual shape description modules. The control module evaluates the intermediate descriptions and formulates hypotheses about parts. Hypotheses are further tested by the segmentor and the descriptors. The descriptions thus obtained are independent of position, orientation, scale, domain and domain properties, and are based purely on geometric considerations. They are extremely useful for the high level domain dependent symbolic reasoning processes, which need not deal with tremendous amount of data, but only with a rich description of data in terms of primitives recovered at various levels of complexity

    Range Image Segmentation for 3-D Object Recognition

    Get PDF
    Three dimensional scene analysis in an unconstrained and uncontrolled environment is the ultimate goal of computer vision. Explicit depth information about the scene is of tremendous help in segmentation and recognition of objects. Range image interpretation with a view of obtaining low-level features to guide mid-level and high-level segmentation and recognition processes is described. No assumptions about the scene are made and algorithms are applicable to any general single viewpoint range image. Low-level features like step edges and surface characteristics are extracted from the images and segmentation is performed based on individual features as well as combination of features. A high level recognition process based on superquadric fitting is described to demonstrate the usefulness of initial segmentation based on edges. A classification algorithm based on surface curvatures is used to obtain initial segmentation of the scene. Objects segmented using edge information are then classified using surface curvatures. Various applications of surface curvatures in mid and high level recognition processes are discussed. These include surface reconstruction, segmentation into convex patches and detection of smooth edges. Algorithms are run on real range images and results are discussed in detail

    3-D data handling and registration of multiple modality medical images

    Get PDF
    The many different clinical imaging modalities used in diagnosis and therapy deliver two different types of information: morphological and functional. Clinical interpretation can be assisted and enhanced by combining such information (e.g. superimposition or fusion). The handling of such data needs to be performed in 3-D. Various methods for registration developed by other authors are reviewed and compared. Many of these are based on registering external reference markers, and are cumbersome and present significant problems to both patients and operators. Internal markers have also been used, but these may be very difficult to identify. Alternatively, methods based on the external surface of an object have been developed which eliminate some of the problems associated with the other methods. Thus the methods which have been extended, developed, and described here, are based primarily on the fitting of surfaces, as determined from images obtained from the different modalities to be registered. Annex problems to that of the surface fitting are those of surface detection and display. Some segmentation and surface reconstruction algorithms have been developed to identify the surface to be registered. Surface and volume rendering algorithms have also been implemented to facilitate the display of clinical results. An iterative surface fitting algorithm has been developed based on the minimization of a least squares distance (LSD) function, using the Powell method and alternative minimization algorithms. These algorithms and the qualities of fit so obtained were intercompared. Some modifications were developed to enhance the speed of convergence, to improve the accuracy, and to enhance the display of results during the process of fitting. A common problem with all such methods was found to be the choice of the starting point (the initial transformation parameters) and the avoidance of local minima which often require manual operator intervention. The algorithm was modified to apply a global minimization by using a cumulative distance error in a sequentially terminated process in order to speed up the time of evaluating of each search location. An extension of the algorithm into multi-resolution (scale) space was also implemented. An initial global search is performed at coarse resolution for the 3-D surfaces of both modalities where an appropriate threshold is defined to reject likely mismatch transformations by testing of only a limited subset of surface points. This process is used to define the set of points in the transformation space to be used for the next level of resolution, again with appropriately chosen threshold levels, and continued down to the finest resolution level. All these processes were evaluated using sets of well defined image models. The assessment of this algorithm for 3-D surface registration of data from (3-D) MRI with MRI, MRI with PET, MRI with SPECT, and MRI with CT data is presented, and clinical examples are illustrated and assessed. In the current work, the data from multi-modality imaging of two different types phantom (e.g. Hoffman brain phantom, Jaszczak phantom), thirty routinely imaged patients and volunteer subjects, and ten patients with setting external markers on their head were used to assess and verify 3-D registration. The accuracy of the sequential multi-resolution method obtained by the distance values of 4-10 selected reference points on each data set gave an accuracy of 1.44±0.42 mm for MR-MR, 1.82±0.65 for MR-CT, 2.38±0.88 for MR-PET, and 3.17±1.12 for MR-SPECT registration. The cost of this process was determined to be of the order of 200 seconds (on a Micro-VAX II), although this is highly dependent on some adjustable parameters of the process (e.g. threshold and the size of the geometrical transformation space) by which the accuracy is aimed

    Image processing for plastic surgery planning

    Get PDF
    This thesis presents some image processing tools for plastic surgery planning. In particular, it presents a novel method that combines local and global context in a probabilistic relaxation framework to identify cephalometric landmarks used in Maxillofacial plastic surgery. It also uses a method that utilises global and local symmetry to identify abnormalities in CT frontal images of the human body. The proposed methodologies are evaluated with the help of several clinical data supplied by collaborating plastic surgeons

    Beyond factual to formulated silhouettes

    Get PDF
    When sketching terrain, a view-dependent framework of silhouette-related cues is required. This framework is prominent in manual sketches and is especially important in small-scale depictions viewed obliquely from above. Occluding contours, namely the lines delineating depth discontinuities in the projected surface, are insufficient for forming this framework. The role which the occluding contour, or Factual Silhouette, plays in structuring the sketch becomes increasingly minimal as more of the terrain becomes visible, as the viewpoint is raised.The aim of this research is to extend the set of occluding contours to encompass situations that are perceived as causing an occlusion and would therefore be sketched in a similar manner. These locations, termed Formulated Silhouettes supplement the set of occluding contours and provide a successful structuring framework. The proposed method processes visible areas of terrain, which are turning away from view, to extract a classified, vector-based description for a given view of a Digital Elevation Model. Background approaches to silhouette rendering are reviewed and the specific contributions of this thesis are discussed.The method is tested using case studies composed of terrain of varying scale and character and two application studies demonstrate how silhouettes can be used to enhance existing terrain visualization techniques, both abstract and realistic. In addition, consultation with cartographic designers provides external verification of the research. The thesis concludes by noting how silhouette contours relate to perceived entities rather than actual occlusions
    corecore