Search CORE

5,529 research outputs found

Variational shape matching for shape classification and retrieval

Author: Benzinou Abdesslam
Fablet Ronan
Nasreddine Kamal
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

International audienceIn this paper we define a multi-scale distance between shapes based on geodesics in the shape space. The proposed distance, robust to outliers, uses shape matching to compare shapes locally. The multi-scale analysis is introduced in order to address local and global variabilities. The resulting similarity measure is invariant to translation, rotation and scaling independently of constraints or landmarks, but constraints can be added to the approach formulation when needed. An evaluation of the proposed approach is reported for shape classification and shape retrieval on the part B of the MPEG-7 shape database. The proposed approach is shown to significantly outperform previous works and reaches 89.05% of retrieval accuracy and 98.86% of correct classification rate

HAL-Université de Bretagne Occidentale

HAL Descartes

Hal-Diderot

Learning a Hierarchical Latent-Variable Model of 3D Shapes

Author: Giles C. Lee
Liu Shikun
Ororbia II Alexander G.
Publication venue
Publication date: 04/08/2018
Field of study

We propose the Variational Shape Learner (VSL), a generative model that learns the underlying structure of voxelized 3D shapes in an unsupervised fashion. Through the use of skip-connections, our model can successfully learn and infer a latent, hierarchical representation of objects. Furthermore, realistic 3D objects can be easily generated by sampling the VSL's latent probabilistic manifold. We show that our generative model can be trained end-to-end from 2D images to perform single image 3D model retrieval. Experiments show, both quantitatively and qualitatively, the improved generalization of our proposed model over a range of tasks, performing better or comparable to various state-of-the-art alternatives.Comment: Accepted as oral presentation at International Conference on 3D Vision (3DV), 201

arXiv.org e-Print Archive

Crossref

LiveSketch: Query Perturbations for Guided Sketch-based Visual Search

Author: Bui Tu
Collomosse John
Jin Hailin
Publication venue
Publication date: 01/01/2019
Field of study

LiveSketch is a novel algorithm for searching large image collections using hand-sketched queries. LiveSketch tackles the inherent ambiguity of sketch search by creating visual suggestions that augment the query as it is drawn, making query specification an iterative rather than one-shot process that helps disambiguate users' search intent. Our technical contributions are: a triplet convnet architecture that incorporates an RNN based variational autoencoder to search for images using vector (stroke-based) queries; real-time clustering to identify likely search intents (and so, targets within the search embedding); and the use of backpropagation from those targets to perturb the input stroke sequence, so suggesting alterations to the query in order to guide the search. We show improvements in accuracy and time-to-task over contemporary baselines using a 67M image corpus.Comment: Accepted to CVPR 201

arXiv.org e-Print Archive

Crossref

University of Surrey

Surrey Research Insight

GRASS: Generative Recursive Autoencoders for Shape Structures

Author: Chaudhuri Siddhartha
Guibas Leonidas
Li Jun
Xu Kai
Yumer Ersin
Zhang Hao
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

We introduce a novel neural network architecture for encoding and synthesis of 3D shapes, particularly their structures. Our key insight is that 3D shapes are effectively characterized by their hierarchical organization of parts, which reflects fundamental intra-shape relationships such as adjacency and symmetry. We develop a recursive neural net (RvNN) based autoencoder to map a flat, unlabeled, arbitrary part layout to a compact code. The code effectively captures hierarchical structures of man-made 3D objects of varying structural complexities despite being fixed-dimensional: an associated decoder maps a code back to a full hierarchy. The learned bidirectional mapping is further tuned using an adversarial setup to yield a generative model of plausible structures, from which novel structures can be sampled. Finally, our structure synthesis framework is augmented by a second trained module that produces fine-grained part geometry, conditioned on global and local structural context, leading to a full generative pipeline for 3D shapes. We demonstrate that without supervision, our network learns meaningful structural hierarchies adhering to perceptual grouping principles, produces compact codes which enable applications such as shape classification and partial matching, and supports shape synthesis and interpolation with significant variations in topology and geometry.Comment: Corresponding author: Kai Xu ([email protected]

arXiv.org e-Print Archive

Dspace at IIT Bombay

Hybrid Bayesian Eigenobjects: Combining Linear Subspace and Deep Network Methods for 3D Robot Vision

Author: Burchfiel Benjamin
Konidaris George
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/07/2018
Field of study

We introduce Hybrid Bayesian Eigenobjects (HBEOs), a novel representation for 3D objects designed to allow a robot to jointly estimate the pose, class, and full 3D geometry of a novel object observed from a single viewpoint in a single practical framework. By combining both linear subspace methods and deep convolutional prediction, HBEOs efficiently learn nonlinear object representations without directly regressing into high-dimensional space. HBEOs also remove the onerous and generally impractical necessity of input data voxelization prior to inference. We experimentally evaluate the suitability of HBEOs to the challenging task of joint pose, class, and shape inference on novel objects and show that, compared to preceding work, HBEOs offer dramatically improved performance in all three tasks along with several orders of magnitude faster runtime performance.Comment: To appear in the International Conference on Intelligent Robots (IROS) - Madrid, 201

arXiv.org e-Print Archive

Crossref