Search CORE

1,912 research outputs found

Fast Color Quantization Using Weighted Sort-Means Clustering

Author: Balasubramanian
Bing
Chang
Cheng
Dekker
Deng
Deng
Drineas
Equitz
Forgy
Gentile
Heckbert
Hu
Hu
Huang
Joy
Kanjanawanishkul
Kanungo
Kasuga
Kolen
Kuo
Linde
Lloyd
M. Emre Celebi
Orchard
Ozdemir
Papamarkos
Schaefer
Scheunders
Sirisathitkul
Wan
Xiang
Xiang
Yang
Yang
Publication venue: 'The Optical Society'
Publication date: 01/01/2009
Field of study

Color quantization is an important operation with numerous applications in graphics and image processing. Most quantization methods are essentially based on data clustering algorithms. However, despite its popularity as a general purpose clustering algorithm, k-means has not received much respect in the color quantization literature because of its high computational requirements and sensitivity to initialization. In this paper, a fast color quantization method based on k-means is presented. The method involves several modifications to the conventional (batch) k-means algorithm including data reduction, sample weighting, and the use of triangle inequality to speed up the nearest neighbor search. Experiments on a diverse set of images demonstrate that, with the proposed modifications, k-means becomes very competitive with state-of-the-art color quantization methods in terms of both effectiveness and efficiency.Comment: 30 pages, 2 figures, 4 table

arXiv.org e-Print Archive

CiteSeerX

Crossref

Superpixel Convolutional Networks using Bilateral Inceptions

Author: A Adams
ES Gastal
J Domke
JB Tenenbaum
K He
M Kiefel
R Achanta
S Gould
S He
S Nowozin
S Paris
T-Y Lin
Publication venue
Publication date: 08/08/2016
Field of study

In this paper we propose a CNN architecture for semantic image segmentation. We introduce a new 'bilateral inception' module that can be inserted in existing CNN architectures and performs bilateral filtering, at multiple feature-scales, between superpixels in an image. The feature spaces for bilateral filtering and other parameters of the module are learned end-to-end using standard backpropagation techniques. The bilateral inception module addresses two issues that arise with general CNN segmentation architectures. First, this module propagates information between (super) pixels while respecting image edges, thus using the structured information of the problem for improved results. Second, the layer recovers a full resolution segmentation result from the lower resolution solution of a CNN. In the experiments, we modify several existing CNN architectures by inserting our inception module between the last CNN (1x1 convolution) layers. Empirical results on three different datasets show reliable improvements not only in comparison to the baseline networks, but also in comparison to several dense-pixel prediction techniques such as CRFs, while being competitive in time.Comment: European Conference on Computer Vision (ECCV), 201

arXiv.org e-Print Archive

Crossref

MPG.PuRe

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Pooling-Invariant Image Feature Learning

Author: Darrell Trevor
Jia Yangqing
Vinyals Oriol
Publication venue
Publication date: 15/01/2013
Field of study

Unsupervised dictionary learning has been a key component in state-of-the-art computer vision recognition architectures. While highly effective methods exist for patch-based dictionary learning, these methods may learn redundant features after the pooling stage in a given early vision architecture. In this paper, we offer a novel dictionary learning scheme to efficiently take into account the invariance of learned features after the spatial pooling stage. The algorithm is built on simple clustering, and thus enjoys efficiency and scalability. We discuss the underlying mechanism that justifies the use of clustering algorithms, and empirically show that the algorithm finds better dictionaries than patch-based methods with the same dictionary size

arXiv.org e-Print Archive

CiteSeerX

Batch and median neural gas

Author: Alexander Hasenfuß
Barbara Hammer
Belkin
Blake
Borg
Bottou
Bunke
Cheng
Cottrell
Cottrell
Duda
Fort
Graepel
Guenter
Hammer
Heskes
Kaski
Kohonen
Kohonen
Lundsteen
Marie Cottrell
Martinetz
Martinetz
Mevissen
Murty
Ripley
Seo
Somervuo
Thomas Villmann
Villmann
Zhong
Publication venue: 'Elsevier BV'
Publication date: 01/01/2006
Field of study

Neural Gas (NG) constitutes a very robust clustering algorithm given euclidian data which does not suffer from the problem of local minima like simple vector quantization, or topological restrictions like the self-organizing map. Based on the cost function of NG, we introduce a batch variant of NG which shows much faster convergence and which can be interpreted as an optimization of the cost function by the Newton method. This formulation has the additional benefit that, based on the notion of the generalized median in analogy to Median SOM, a variant for non-vectorial proximity data can be introduced. We prove convergence of batch and median versions of NG, SOM, and k-means in a unified formulation, and we investigate the behavior of the algorithms in several experiments.Comment: In Special Issue after WSOM 05 Conference, 5-8 september, 2005, Pari

arXiv.org e-Print Archive

CiteSeerX

Crossref

Publications at Bielefeld University

HAL-Paris1

Deep Metric Learning via Lifted Structured Feature Embedding

Author: Jegelka Stefanie
Savarese Silvio
Song Hyun Oh
Xiang Yu
Publication venue
Publication date: 19/11/2015
Field of study

Learning the distance metric between pairs of examples is of great importance for learning and visual recognition. With the remarkable success from the state of the art convolutional neural networks, recent works have shown promising results on discriminatively training the networks to learn semantic feature embeddings where similar examples are mapped close to each other and dissimilar examples are mapped farther apart. In this paper, we describe an algorithm for taking full advantage of the training batches in the neural network training by lifting the vector of pairwise distances within the batch to the matrix of pairwise distances. This step enables the algorithm to learn the state of the art feature embedding by optimizing a novel structured prediction objective on the lifted problem. Additionally, we collected Online Products dataset: 120k images of 23k classes of online products for metric learning. Our experiments on the CUB-200-2011, CARS196, and Online Products datasets demonstrate significant improvement over existing deep feature embedding methods on all experimented embedding sizes with the GoogLeNet network.Comment: 11 page

arXiv.org e-Print Archive

DSpace@MIT

Crossref

Semantic Cross-View Matching

Author: Angst Roland
Castaldo Francesco
Palmieri Francesco
Savarese Silvio
Zamir Amir
Publication venue
Publication date: 01/01/2015
Field of study

Matching cross-view images is challenging because the appearance and viewpoints are significantly different. While low-level features based on gradient orientations or filter responses can drastically vary with such changes in viewpoint, semantic information of images however shows an invariant characteristic in this respect. Consequently, semantically labeled regions can be used for performing cross-view matching. In this paper, we therefore explore this idea and propose an automatic method for detecting and representing the semantic information of an RGB image with the goal of performing cross-view matching with a (non-RGB) geographic information system (GIS). A segmented image forms the input to our system with segments assigned to semantic concepts such as traffic signs, lakes, roads, foliage, etc. We design a descriptor to robustly capture both, the presence of semantic concepts and the spatial layout of those segments. Pairwise distances between the descriptors extracted from the GIS map and the query image are then used to generate a shortlist of the most promising locations with similar semantic concepts in a consistent spatial layout. An experimental evaluation with challenging query images and a large urban area shows promising results

arXiv.org e-Print Archive

MPG.PuRe

Archivio Istituzionale della Ricerca - Università degli Studi della Campania "Luigi Vanvitelli"