Search CORE

1,497 research outputs found

Particular object retrieval with integral max-pooling of CNN activations

Author: Jégou Hervé
Sicre Ronan
Tolias Giorgos
Publication venue
Publication date: 24/02/2016
Field of study

Recently, image representation built upon Convolutional Neural Network (CNN) has been shown to provide effective descriptors for image search, outperforming pre-CNN features as short-vector representations. Yet such models are not compatible with geometry-aware re-ranking methods and still outperformed, on some particular object retrieval benchmarks, by traditional image search systems relying on precise descriptor matching, geometric re-ranking, or query expansion. This work revisits both retrieval stages, namely initial search and re-ranking, by employing the same primitive information derived from the CNN. We build compact feature vectors that encode several image regions without the need to feed multiple inputs to the network. Furthermore, we extend integral images to handle max-pooling on convolutional layer activations, allowing us to efficiently localize matching objects. The resulting bounding box is finally used for image re-ranking. As a result, this paper significantly improves existing CNN-based recognition pipeline: We report for the first time results competing with traditional methods on the challenging Oxford5k and Paris6k datasets

arXiv.org e-Print Archive

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

HAL-Rennes 1

Real-Time Chromakey Matting Using Image Statistics

Author: Beato Nicholas
Colbert Mark
Hughes Charles
Yamazawa Kazumasa
Zhang Yunjun
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 02/07/2013
Field of study

Given a video signal, we generate an alpha matte based on the chromakey information. The computation is done in interactive-time using pixel shaders. To accomplish this, we use Principle Components Analysis to generate a linear transformation matrix where the resulting color triplets Euclidean distance is directly related to the probability that the color exists in the chromakey spectrum. The result of this process is a trimap of the video signals opacity. To solve the alpha matte from the trimap, we minimize an energy function constrained by the trimap with gradient descent. This energy function is based on the least-squared error of overlapping neighborhoods around each pixel and is independent of the background or foreground color

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Modeling Camera Effects to Improve Visual Learning from Synthetic Data

Author: A Andreopoulos
A Foi
C Kanan
H Cheong
HC Karaimer
MD Grossberg
O Ronneberger
RS Hunter
S Annadurai
SA Bhukhanwala
SR Richter
Publication venue
Publication date: 01/10/2018
Field of study

Recent work has focused on generating synthetic imagery to increase the size and variability of training data for learning visual tasks in urban scenes. This includes increasing the occurrence of occlusions or varying environmental and weather effects. However, few have addressed modeling variation in the sensor domain. Sensor effects can degrade real images, limiting generalizability of network performance on visual tasks trained on synthetic data and tested in real environments. This paper proposes an efficient, automatic, physically-based augmentation pipeline to vary sensor effects --chromatic aberration, blur, exposure, noise, and color cast-- for synthetic imagery. In particular, this paper illustrates that augmenting synthetic training datasets with the proposed pipeline reduces the domain gap between synthetic and real domains for the task of object detection in urban driving scenes

arXiv.org e-Print Archive

Crossref