Search CORE

45,414 research outputs found

Unsupervised Feature Learning by Deep Sparse Coding

Author: He Yunlong
Kavukcuoglu Koray
Qi Yanjun
Szlam Arthur
Wang Yun
Publication venue
Publication date: 19/12/2013
Field of study

In this paper, we propose a new unsupervised feature learning framework, namely Deep Sparse Coding (DeepSC), that extends sparse coding to a multi-layer architecture for visual object recognition tasks. The main innovation of the framework is that it connects the sparse-encoders from different layers by a sparse-to-dense module. The sparse-to-dense module is a composition of a local spatial pooling step and a low-dimensional embedding process, which takes advantage of the spatial smoothness information in the image. As a result, the new method is able to learn several levels of sparse representation of the image which capture features at a variety of abstraction levels and simultaneously preserve the spatial smoothness between the neighboring image patches. Combining the feature representations from multiple layers, DeepSC achieves the state-of-the-art performance on multiple object recognition tasks.Comment: 9 pages, submitted to ICL

arXiv.org e-Print Archive

CiteSeerX

Provably scale-covariant networks from oriented quasi quadrature measures in cascade

Author: DH Hubel
DJ Heeger
DLK Yamins
E Adelson
EN Johnson
H Bay
J Bruna
J Touryan
J Westö
JJ Koenderink
JJ Koenderink
K Fukushima
L Liu
L Liu
L Liu
M Carandini
M Varma
RL Valois De
RLT Goris
T Lindeberg
T Lindeberg
T Lindeberg
T Lindeberg
T Lindeberg
T Ojala
T Serre
TH Chan
Z Cai
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

This article presents a continuous model for hierarchical networks based on a combination of mathematically derived models of receptive fields and biologically inspired computations. Based on a functional model of complex cells in terms of an oriented quasi quadrature combination of first- and second-order directional Gaussian derivatives, we couple such primitive computations in cascade over combinatorial expansions over image orientations. Scale-space properties of the computational primitives are analysed and it is shown that the resulting representation allows for provable scale and rotation covariance. A prototype application to texture analysis is developed and it is demonstrated that a simplified mean-reduced representation of the resulting QuasiQuadNet leads to promising experimental results on three texture datasets.Comment: 12 pages, 3 figures, 1 tabl

arXiv.org e-Print Archive

Digitala Vetenskapliga Arkivet - Academic Archive On-line

A fine-grained approach to scene text script identification

Author: Gomez Lluis
Karatzas Dimosthenis
Publication venue
Publication date: 24/02/2016
Field of study

This paper focuses on the problem of script identification in unconstrained scenarios. Script identification is an important prerequisite to recognition, and an indispensable condition for automatic text understanding systems designed for multi-language environments. Although widely studied for document images and handwritten documents, it remains an almost unexplored territory for scene text images. We detail a novel method for script identification in natural images that combines convolutional features and the Naive-Bayes Nearest Neighbor classifier. The proposed framework efficiently exploits the discriminative power of small stroke-parts, in a fine-grained classification framework. In addition, we propose a new public benchmark dataset for the evaluation of joint text detection and script identification in natural scenes. Experiments done in this new dataset demonstrate that the proposed method yields state of the art results, while it generalizes well to different datasets and variable number of scripts. The evidence provided shows that multi-lingual scene text recognition in the wild is a viable proposition. Source code of the proposed method is made available online

arXiv.org e-Print Archive