20 research outputs found

    3D Object Class Detection in the Wild

    Full text link
    Object class detection has been a synonym for 2D bounding box localization for the longest time, fueled by the success of powerful statistical learning techniques, combined with robust image representations. Only recently, there has been a growing interest in revisiting the promise of computer vision from the early days: to precisely delineate the contents of a visual scene, object by object, in 3D. In this paper, we draw from recent advances in object detection and 2D-3D object lifting in order to design an object class detector that is particularly tailored towards 3D object class detection. Our 3D object class detection method consists of several stages gradually enriching the object detection output with object viewpoint, keypoints and 3D shape estimates. Following careful design, in each stage it constantly improves the performance and achieves state-ofthe-art performance in simultaneous 2D bounding box and viewpoint estimation on the challenging Pascal3D+ dataset

    Detail-preserving and Content-aware Variational Multi-view Stereo Reconstruction

    Full text link
    Accurate recovery of 3D geometrical surfaces from calibrated 2D multi-view images is a fundamental yet active research area in computer vision. Despite the steady progress in multi-view stereo reconstruction, most existing methods are still limited in recovering fine-scale details and sharp features while suppressing noises, and may fail in reconstructing regions with few textures. To address these limitations, this paper presents a Detail-preserving and Content-aware Variational (DCV) multi-view stereo method, which reconstructs the 3D surface by alternating between reprojection error minimization and mesh denoising. In reprojection error minimization, we propose a novel inter-image similarity measure, which is effective to preserve fine-scale details of the reconstructed surface and builds a connection between guided image filtering and image registration. In mesh denoising, we propose a content-aware p\ell_{p}-minimization algorithm by adaptively estimating the pp value and regularization parameters based on the current input. It is much more promising in suppressing noise while preserving sharp features than conventional isotropic mesh smoothing. Experimental results on benchmark datasets demonstrate that our DCV method is capable of recovering more surface details, and obtains cleaner and more accurate reconstructions than state-of-the-art methods. In particular, our method achieves the best results among all published methods on the Middlebury dino ring and dino sparse ring datasets in terms of both completeness and accuracy.Comment: 14 pages,16 figures. Submitted to IEEE Transaction on image processin

    A multi-view probabilistic model for 3D object classes

    Get PDF

    Representing images of a rotating object with cyclic permutation for view-based pose estimation

    Get PDF
    In this paper, we propose a novel approach using a cyclic group to model the appearance change in an image sequence of an object rotated about an arbitrary axis (1DOF out-of-plane rotation). In the sequence, an image xj is followed by an image xj+1. We represent the relationship between images by a cyclic group as xj+1 = Gxj , and obtain the matrix G by real block diagonalization. Then, G to the power of a real number is used to represent the image sequence and also for pose estimation. Two estimation methods are proposed and evaluated with real image sequences from the COIL-20, COIL-100, and ALOI datasets, and also compared to the Parametric Eigenspace method. Additionally, we discuss the relationship of the proposed approach to the pixel-wise Discrete Fourier Transform (DFT) and to linear regression, and also outline several extensions

    Agro-ecological evaluation of sustainable area for citrus crop production in Ramsar District, Iran

    Get PDF
    Citrus growing is regarded as an important cash crop in Ramsar, Iran. Ramsar District has a temperate climate zone, while citrus is a sub-tropical fruit. Few studies on citrus crop in terms of negative environmental factors have been carried out by researchers around the world. This study aims to integrate Geographical Information System (GIS) and Analytical Network Process (ANP) model for determination of citrus suitability zones. This study evaluates the agro-ecological suitability, determine potentials and constraints of the region based on effective criteria using ANP model. ANP model was used to determine suitable, moderate and unsuitable areas based on (i) socio-economic, morphometry and hydro-climate factors using 15 layers based on experts’ opinion; (ii) Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) satellite image of the year 2003 with 98.45% overall accuracy, and (iii) developed Multiple Linear Regression (MLR) model for citrus prediction. Thereby, weighted overlay of 15 factors was obtained using GIS. In this study, the citrus orchards map of 2003 and the new map of the citrus areas of 2014 namely Citrus State Development Program (CSDP) of the study area were compared. The results of this study demonstrated: (i) suitable areas (free risk areas) based on negative environmental factors and areas which are susceptible to citrus plantation; (ii) high-risk areas which are unsuitable for citrus plantation, and (iii) the high weights derived by ANP model were assigned to altitude, frost and minimum temperature. The MLR model was successfully developed to predict citrus yield in Ramsar District by 10% error. The MLR model would propose optimum citrus crop production areas. As conclusion, the main outcome of this study could help growers and decision makers to enhance the current citrus management activities for current and future citrus planning

    Joint Target Tracking and Recognition Using Shape-based Generative Model

    Get PDF
    Recently a generative model that combines both of identity and view manifolds was proposed for multi-view shape modeling that was originally used for pose estimation and recognition of civilian vehicles from image sequences. In this thesis, we extend this model to both civilian and military vehicles, and examine its effectiveness for real-world automated target tracking and recognition (ATR) applications in both infrared and visible image sequences. A particle filter-based ATR algorithm is introduced where the generative model is used for shape interpolation along both the view and identity manifolds. The ATR algorithm is tested on the newly released SENSIAC (Military Sensing Information Analysis Center) infrared database along with some visible-band image sequences. Overall tracking and recognition performance is evaluated in terms of the accuracy of 3D position/pose estimation and target classification.</School of Electrical & Computer Engineerin

    Інтерактивна система розпізнавання 3D об’єктів

    Get PDF
    Пояснювальна записка дипломного проекту складається з чотирьох розділів, містить 108 сторінок, 22 рисунки, 6 таблиць, 159 джерел, 9 ілюстративних матеріалів – загалом 108 сторінок. Об`єкт дослідження: 3D-об’єкти. Мета магістерської дисертації: підвищення ефективності системи розпізнавання тривимірних об’єктів, на основі аналізу двовимірних зображень. У вступі викладена мета, об’єкт і предмет дослідження, актуальність та новизна дисертації. Перший розділ містить аналіз сучасного стану систем розпізнавання просторових об’єктів. У другому розділі розглянуто та проаналізовано модифікований метод розпізнавання просторових об’єктів. У третьому розділі описано розробку структури та елементів інтерактивної системи розпізнавання тривимірних об’єктів. Четвертий розділ містить дослідження інтерактивної системи розпізнавання 3d-об’єктів.The explanatory note of the diploma project consists of four sections, contains 108 pages, 22 figures, 6 tables, 159 sources, 9 illustrative materials - a total of 108 pages. Object of research: 3D objects. The purpose of the master's dissertation: to increase the efficiency of the system of recognition of three-dimensional objects, based on the analysis of two-dimensional images. The introduction outlines the purpose, object and subject of research, relevance and novelty of the dissertation. The first section provides an analysis of the current state of spatial object recognition systems. The second section discusses and analyzes a modified method of spatial object recognition. The third section describes the development of the structure and elements of an interactive system for recognizing three-dimensional objects. The fourth section contains research on an interactive 3D object recognition system

    Machine Learning Approaches to Human Body Shape Analysis

    Get PDF
    Soft biometrics, biomedical sciences, and many other fields of study pay particular attention to the study of the geometric description of the human body, and its variations. Although multiple contributions, the interest is particularly high given the non-rigid nature of the human body, capable of assuming different poses, and numerous shapes due to variable body composition. Unfortunately, a well-known costly requirement in data-driven machine learning, and particularly in the human-based analysis, is the availability of data, in the form of geometric information (body measurements) with related vision information (natural images, 3D mesh, etc.). We introduce a computer graphics framework able to generate thousands of synthetic human body meshes, representing a population of individuals with stratified information: gender, Body Fat Percentage (BFP), anthropometric measurements, and pose. This contribution permits an extensive analysis of different bodies in different poses, avoiding the demanding, and expensive acquisition process. We design a virtual environment able to take advantage of the generated bodies, to infer the body surface area (BSA) from a single view. The framework permits to simulate the acquisition process of newly introduced RGB-D devices disentangling different noise components (sensor noise, optical distortion, body part occlusions). Common geometric descriptors in soft biometric, as well as in biomedical sciences, are based on body measurements. Unfortunately, as we prove, these descriptors are not pose invariant, constraining the usability in controlled scenarios. We introduce a differential geometry approach assuming body pose variations as isometric transformations of the body surface, and body composition changes covariant to the body surface area. This setting permits the use of the Laplace-Beltrami operator on the 2D body manifold, describing the body with a compact, efficient, and pose invariant representation. We design a neural network architecture able to infer important body semantics from spectral descriptors, closing the gap between abstract spectral features, and traditional measurement-based indices. Studying the manifold of body shapes, we propose an innovative generative adversarial model able to learn the body shapes. The method permits to generate new bodies with unseen geometries as a walk on the latent space, constituting a significant advantage over traditional generative methods
    corecore