Search CORE

3 research outputs found

Embedding based on function approximation for large scale image search

Author: Cheung Ngai-Man
Do Thanh-Toan
Publication venue
Publication date: 03/04/2017
Field of study

The objective of this paper is to design an embedding method that maps local features describing an image (e.g. SIFT) to a higher dimensional representation useful for the image retrieval problem. First, motivated by the relationship between the linear approximation of a nonlinear function in high dimensional space and the stateof-the-art feature representation used in image retrieval, i.e., VLAD, we propose a new approach for the approximation. The embedded vectors resulted by the function approximation process are then aggregated to form a single representation for image retrieval. Second, in order to make the proposed embedding method applicable to large scale problem, we further derive its fast version in which the embedded vectors can be efficiently computed, i.e., in the closed-form. We compare the proposed embedding methods with the state of the art in the context of image search under various settings: when the images are represented by medium length vectors, short vectors, or binary vectors. The experimental results show that the proposed embedding methods outperform existing the state of the art on the standard public image retrieval benchmarks.Comment: Accepted to TPAMI 2017. The implementation and precomputed features of the proposed F-FAemb are released at the following link: http://tinyurl.com/F-FAem

arXiv.org e-Print Archive

Adelaide Research & Scholarship

Active Object Classification from 3D Range Data with Mobile Robots

Author: Patten Timothy
Publication venue: Faculty of Engineering and Information Technologies, School of Aerospace, Mechanical and Mechatronic Engineering
Publication date: 01/01/2017
Field of study

This thesis addresses the problem of how to improve the acquisition of 3D range data with a mobile robot for the task of object classification. Establishing the identities of objects in unknown environments is fundamental for robotic systems and helps enable many abilities such as grasping, manipulation, or semantic mapping. Objects are recognised by data obtained from sensor observations, however, data is highly dependent on viewpoint; the variation in position and orientation of the sensor relative to an object can result in large variation in the perception quality. Additionally, cluttered environments present a further challenge because key data may be missing. These issues are not always solved by traditional passive systems where data are collected from a fixed navigation process then fed into a perception pipeline. This thesis considers an active approach to data collection by deciding where is most appropriate to make observations for the perception task. The core contributions of this thesis are a non-myopic planning strategy to collect data efficiently under resource constraints, and supporting viewpoint prediction and evaluation methods for object classification. Our approach to planning uses Monte Carlo methods coupled with a classifier based on non-parametric Bayesian regression. We present a novel anytime and non-myopic planning algorithm, Monte Carlo active perception, that extends Monte Carlo tree search to partially observable environments and the active perception problem. This is combined with a particle-based estimation process and a learned observation likelihood model that uses Gaussian process regression. To support planning, we present 3D point cloud prediction algorithms and utility functions that measure the quality of viewpoints by their discriminatory ability and effectiveness under occlusion. The utility of viewpoints is quantified by information-theoretic metrics, such as mutual information, and an alternative utility function that exploits learned data is developed for special cases. The algorithms in this thesis are demonstrated in a variety of scenarios. We extensively test our online planning and classification methods in simulation as well as with indoor and outdoor datasets. Furthermore, we perform hardware experiments with different mobile platforms equipped with different types of sensors. Most significantly, our hardware experiments with an outdoor robot are to our knowledge the first demonstrations of online active perception in a real outdoor environment. Active perception has broad significance in many applications. This thesis emphasises the advantages of an active approach to object classification and presents its assimilation with a wide range of robotic systems, sensors, and perception algorithms. By demonstration of performance enhancements and diversity, our hope is that the concept of considering perception and planning in an integrated manner will be of benefit in improving current systems that rely on passive data collection

Sydney eScholarship

Identificación de la fuente de adquisición de ficheros multimedia de dispositivos móviles mediante Deep Learning

Author: Outeda Rodríguez Almudena
Publication venue
Publication date: 01/01/2019
Field of study

Actualmente, la sociedad vive rodeada de contenido multimedia como son las imágenes y los vídeos. La presencia de dispositivos electrónicos capaces de realizar fotografías o grabar videos es una realidad en nuestra vida cotidiana, y su número aumenta con el paso del tiempo. La gran mayoría de la sociedad lleva un móvil en el bolsillo y hace uso de él para realizar fotos o vídeos. Ligado a ello, con los años han ido apareciendo técnicas de falsificación y manipulación de contenidos multimedia que dificultan saber si ese contenido es auténtico o no y de dónde procede, lo que hace que las técnicas de análisis forense sean una necesidad actual. En este trabajo se propone una red neuronal convolucional capaz de identificar la fuente de adquisición de vídeos grabados con un dispositivo móvil. Los resultados obtenidos de los experimentos realizados en este trabajo demuestran la eficiencia de métodos propuestos. Para la evaluación de los métodos propuestos se realizaron experimentos con un dataset público ampliamente utilizado en la literatura y un dataset generado

Docta Complutense