Search CORE

112 research outputs found

QuickCSG: Fast Arbitrary Boolean Combinations of N Solids

Author: Douze Matthijs
Franco Jean-Sébastien
Raffin Bruno
Publication venue
Publication date: 05/06/2017
Field of study

QuickCSG computes the result for general N-polyhedron boolean expressions without an intermediate tree of solids. We propose a vertex-centric view of the problem, which simplifies the identification of final geometric contributions, and facilitates its spatial decomposition. The problem is then cast in a single KD-tree exploration, geared toward the result by early pruning of any region of space not contributing to the final surface. We assume strong regularity properties on the input meshes and that they are in general position. This simplifying assumption, in combination with our vertex-centric approach, improves the speed of the approach. Complemented with a task-stealing parallelization, the algorithm achieves breakthrough performance, one to two orders of magnitude speedups with respect to state-of-the-art CPU algorithms, on boolean operations over two to dozens of polyhedra. The algorithm also outperforms GPU implementations with approximate discretizations, while producing an output without redundant facets. Despite the restrictive assumptions on the input, we show the usefulness of QuickCSG for applications with large CSG problems and strong temporal constraints, e.g. modeling for 3D printers, reconstruction from visual hulls and collision detection

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Low-shot learning with large-scale diffusion

Author: Douze Matthijs
Hariharan Bharath
Jégou Hervé
Szlam Arthur
Publication venue
Publication date: 15/06/2018
Field of study

This paper considers the problem of inferring image labels from images when only a few annotated examples are available at training time. This setup is often referred to as low-shot learning, where a standard approach is to re-train the last few layers of a convolutional neural network learned on separate classes for which training examples are abundant. We consider a semi-supervised setting based on a large collection of images to support label propagation. This is possible by leveraging the recent advances on large-scale similarity graph construction. We show that despite its conceptual simplicity, scaling label propagation up to hundred millions of images leads to state of the art accuracy in the low-shot learning regime

arXiv.org e-Print Archive

Crossref

QuickCSG: Fast Arbitrary Boolean Combinations of N Solids

Author: Douze Matthijs
Franco Jean-Sébastien
Raffin Bruno
Publication venue
Publication date: 01/01/1760
Field of study

arXiv.org e-Print Archive

Biblioteca Digital de la Comunidad de Madrid

Galiciana

Estimation d'homographies inter-images : cas des mosaïques et du suivi en temps réel : applications en réalité augmentée

Author: Douze Matthijs
Publication venue
Publication date: 10/12/2004
Field of study

Nous abordons l'estimation d'homographies sous plusieurs angles. D'une part, nous l'utilisons pour engendrer des mosaïques d'images panoramiques. Nous améliorons la technique connue de mise en correspondance de points par une approximation basée sur des régions et un schéma d'estimation robuste (LTTS). D'autre part, une estimation d'homographie en temps réel permet de suivre visuellement des cibles planes. Nous hiérarchisons des algorithmes classiques des plus robustes aux plus précis et y adjoignons un traitement contre les occultations à base de downdating matriciel. Nous étendons ceci au suivi d'arrière-plans représentés par une image panoramique. Par différence, les objets et les personnages mouvants dans la scène peuvent être segmentés. Nous avons appliqué la technique en réalité augmentée, puisque les objets sont manipulables indépendamment. Elle peut aussi servir au codage vidéo : objets mouvants et arrière-plan peuvent être codés séparément

Thèses en Ligne

Open Archive Toulouse Archive Ouverte

Circulant temporal encoding for video retrieval and temporal alignment

Author: Douze Matthijs
Jégou Hervé
Revaud Jérôme
Schmid Cordelia
Verbeek Jakob
Publication venue
Publication date: 30/11/2015
Field of study

We address the problem of specific video event retrieval. Given a query video of a specific event, e.g., a concert of Madonna, the goal is to retrieve other videos of the same event that temporally overlap with the query. Our approach encodes the frame descriptors of a video to jointly represent their appearance and temporal order. It exploits the properties of circulant matrices to efficiently compare the videos in the frequency domain. This offers a significant gain in complexity and accurately localizes the matching parts of videos. The descriptors can be compressed in the frequency domain with a product quantizer adapted to complex numbers. In this case, video retrieval is performed without decompressing the descriptors. We also consider the temporal alignment of a set of videos. We exploit the matching confidence and an estimate of the temporal offset computed for all pairs of videos by our retrieval approach. Our robust algorithm aligns the videos on a global timeline by maximizing the set of temporally consistent matches. The global temporal alignment enables synchronous playback of the videos of a given scene

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1

Link and code: Fast indexing with graphs and compact regression codes

Author: Douze Matthijs
Jégou Hervé
Sablayrolles Alexandre
Publication venue
Publication date: 27/04/2018
Field of study

Similarity search approaches based on graph walks have recently attained outstanding speed-accuracy trade-offs, taking aside the memory requirements. In this paper, we revisit these approaches by considering, additionally, the memory constraint required to index billions of images on a single server. This leads us to propose a method based both on graph traversal and compact representations. We encode the indexed vectors using quantization and exploit the graph structure to refine the similarity estimation. In essence, our method takes the best of these two worlds: the search strategy is based on nested graphs, thereby providing high precision with a relatively small set of comparisons. At the same time it offers a significant memory compression. As a result, our approach outperforms the state of the art on operating points considering 64-128 bytes per vector, as demonstrated by our results on two billion-scale public benchmarks

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1

Packing bag-of-features

Author: Cordelia Schmid
Hervé Jégou
Matthijs Douze
Publication venue
Publication date: 01/01/2009
Field of study

One of the main limitations of image search based on bag-of-features is the memory usage per image. Only a few million images can be handled on a single machine in reasonable response time. In this paper, we first evaluate how the memory usage is reduced by using lossless index compression. We then propose an approximate representation of bag-of-features obtained by projecting the corresponding histogram onto a set of pre-defined sparse projection functions, producing several image descriptors. Coupled with a proper indexing structure, an image is represented by a few hundred bytes. A distance expectation criterion is then used to rank the images. Our method is at least one order of magnitude faster than standard bag-of-features while providing excellent search quality. 1

HAL-CentraleSupelec

CiteSeerX

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

HAL-Rennes 1