Search CORE

11,221 research outputs found

Generative Model with Coordinate Metric Learning for Object Recognition Based on 3D Models

Author: Deng Weihong
Wang Yida
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 24/04/2018
Field of study

Given large amount of real photos for training, Convolutional neural network shows excellent performance on object recognition tasks. However, the process of collecting data is so tedious and the background are also limited which makes it hard to establish a perfect database. In this paper, our generative model trained with synthetic images rendered from 3D models reduces the workload of data collection and limitation of conditions. Our structure is composed of two sub-networks: semantic foreground object reconstruction network based on Bayesian inference and classification network based on multi-triplet cost function for avoiding over-fitting problem on monotone surface and fully utilizing pose information by establishing sphere-like distribution of descriptors in each category which is helpful for recognition on regular photos according to poses, lighting condition, background and category information of rendered images. Firstly, our conjugate structure called generative model with metric learning utilizing additional foreground object channels generated from Bayesian rendering as the joint of two sub-networks. Multi-triplet cost function based on poses for object recognition are used for metric learning which makes it possible training a category classifier purely based on synthetic data. Secondly, we design a coordinate training strategy with the help of adaptive noises acting as corruption on input images to help both sub-networks benefit from each other and avoid inharmonious parameter tuning due to different convergence speed of two sub-networks. Our structure achieves the state of the art accuracy of over 50\% on ShapeNet database with data migration obstacle from synthetic images to real photos. This pipeline makes it applicable to do recognition on real images only based on 3D models.Comment: 14 page

arXiv.org e-Print Archive

Recommended from our members

A Goal-Directed Bayesian Framework for Categorization

Author: Acuna
Anderson
Anthony
Ashby
Ashby
Barsalou
Barsalou
Barsalou
Barto
Bishop
Botvinick
Botvinick
Bouton
Caramazza
Caramazza
Chater
Clark
Collins
Collins
Courville
Dayan
Dickinson
Dolan
Estes
FitzGerald
Friston
Friston
Friston
Friston
Friston
Friston
Friston
Gershman
Griffiths
Hinton
Hobson
Hoeting
Hohwy
Homa
Kauffman
Knill
Lamberts
Maddox
McClelland
Miller
Mirza
Nosofsky
Oaksford
Pezzulo
Pezzulo
Pezzulo
Pezzulo
Pezzulo
Rigoli
Rigoli
Rigoli
Roach
Rosch
Rosch
Rosch
Rosenblueth
Rosenman
Shenhav
Sjöberg
Smith
Solway
Squire
Stoianov
Tononi
Traulsen
Tulving
Tulving
Warrington
Warrington
Warrington
Warrington
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

Categorization is a fundamental ability for efficient behavioral control. It allows organisms to remember the correct responses to categorical cues and not for every stimulus encountered (hence eluding computational cost or complexity), and to generalize appropriate responses to novel stimuli dependant on category assignment. Assuming the brain performs Bayesian inference, based on a generative model of the external world and future goals, we propose a computational model of categorization in which important properties emerge. These properties comprise the ability to infer latent causes of sensory experience, a hierarchical organization of latent causes, and an explicit inclusion of context and action representations. Crucially, these aspects derive from considering the environmental statistics that are relevant to achieve goals, and from the fundamental Bayesian principle that any generative model should be preferred over alternative models based on an accuracy-complexity trade-off. Our account is a step toward elucidating computational principles of categorization and its role within the Bayesian brain hypothesis

City Research Online

Crossref

Frontiers - Publisher Connector

PubMed Central

UCL Discovery

MPG.PuRe

Latent Fisher Discriminant Analysis

Author: Chen Gang
Publication venue
Publication date: 20/09/2013
Field of study

Linear Discriminant Analysis (LDA) is a well-known method for dimensionality reduction and classification. Previous studies have also extended the binary-class case into multi-classes. However, many applications, such as object detection and keyframe extraction cannot provide consistent instance-label pairs, while LDA requires labels on instance level for training. Thus it cannot be directly applied for semi-supervised classification problem. In this paper, we overcome this limitation and propose a latent variable Fisher discriminant analysis model. We relax the instance-level labeling into bag-level, is a kind of semi-supervised (video-level labels of event type are required for semantic frame extraction) and incorporates a data-driven prior over the latent variables. Hence, our method combines the latent variable inference and dimension reduction in an unified bayesian framework. We test our method on MUSK and Corel data sets and yield competitive results compared to the baseline approach. We also demonstrate its capacity on the challenging TRECVID MED11 dataset for semantic keyframe extraction and conduct a human-factors ranking-based experimental evaluation, which clearly demonstrates our proposed method consistently extracts more semantically meaningful keyframes than challenging baselines.Comment: 12 page

arXiv.org e-Print Archive

CiteSeerX

Breaking Sticks and Ambiguities with Adaptive Skip-gram

Author: Bartunov Sergey
Kondrashkin Dmitry
Osokin Anton
Vetrov Dmitry
Publication venue
Publication date: 15/11/2015
Field of study

Recently proposed Skip-gram model is a powerful method for learning high-dimensional word representations that capture rich semantic relationships between words. However, Skip-gram as well as most prior work on learning word representations does not take into account word ambiguity and maintain only single representation per word. Although a number of Skip-gram modifications were proposed to overcome this limitation and learn multi-prototype word representations, they either require a known number of word meanings or learn them using greedy heuristic approaches. In this paper we propose the Adaptive Skip-gram model which is a nonparametric Bayesian extension of Skip-gram capable to automatically learn the required number of representations for all words at desired semantic resolution. We derive efficient online variational learning algorithm for the model and empirically demonstrate its efficiency on word-sense induction task

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

Feature discovery and visualization of robot mission data using convolutional autoencoders and Bayesian nonparametric topic models

Author: Flaspohler Genevieve
Girdhar Yogesh
Roy Nicholas
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 30/11/2017
Field of study

The gap between our ability to collect interesting data and our ability to analyze these data is growing at an unprecedented rate. Recent algorithmic attempts to fill this gap have employed unsupervised tools to discover structure in data. Some of the most successful approaches have used probabilistic models to uncover latent thematic structure in discrete data. Despite the success of these models on textual data, they have not generalized as well to image data, in part because of the spatial and temporal structure that may exist in an image stream. We introduce a novel unsupervised machine learning framework that incorporates the ability of convolutional autoencoders to discover features from images that directly encode spatial information, within a Bayesian nonparametric topic model that discovers meaningful latent patterns within discrete data. By using this hybrid framework, we overcome the fundamental dependency of traditional topic models on rigidly hand-coded data representations, while simultaneously encoding spatial dependency in our topics without adding model complexity. We apply this model to the motivating application of high-level scene understanding and mission summarization for exploratory marine robots. Our experiments on a seafloor dataset collected by a marine robot show that the proposed hybrid framework outperforms current state-of-the-art approaches on the task of unsupervised seafloor terrain characterization.Comment: 8 page

arXiv.org e-Print Archive

Crossref

The supervised IBP: neighbourhood preserving infinite latent feature models

Author: Ghahramani Z
Knowles DA
Quadrianto N
Sharmanska V
Publication venue: Association for Uncertainty in Artificial Intelligence
Publication date: 01/01/2013
Field of study

We propose a probabilistic model to infer supervised latent variables in the Hamming space from observed data. Our model allows simultaneous inference of the number of binary latent variables, and their values. The latent variables preserve neighbourhood structure of the data in a sense that objects in the same semantic concept have similar latent values, and objects in different concepts have dissimilar latent values. We formulate the supervised infinite latent variable problem based on an intuitive principle of pulling objects together if they are of the same type, and pushing them apart if they are not. We then combine this principle with a flexible Indian Buffet Process prior on the latent variables. We show that the inferred supervised latent variables can be directly used to perform a nearest neighbour search for the purpose of retrieval. We introduce a new application of dynamically extending hash codes, and show how to effectively couple the structure of the hash codes with continuously growing structure of the neighbourhood preserving infinite latent feature space

arXiv.org e-Print Archive

CiteSeerX

IST PubRep

IST Austria: PubRep (Institute of Science and Technology)

Sussex Research Online

CUED - Cambridge University Engineering Department