Search CORE

9,941 research outputs found

Improving Unsupervised Defect Segmentation by Applying Structural Similarity to Autoencoders

Author: Bergmann Paul
Fauser Michael
Löwe Sindy
Sattlegger David
Steger Carsten
Publication venue: 'Scitepress'
Publication date: 01/01/2019
Field of study

Convolutional autoencoders have emerged as popular methods for unsupervised defect segmentation on image data. Most commonly, this task is performed by thresholding a pixel-wise reconstruction error based on an

\ell^p

distance. This procedure, however, leads to large residuals whenever the reconstruction encompasses slight localization inaccuracies around edges. It also fails to reveal defective regions that have been visually altered when intensity values stay roughly consistent. We show that these problems prevent these approaches from being applied to complex real-world scenarios and that it cannot be easily avoided by employing more elaborate architectures such as variational or feature matching autoencoders. We propose to use a perceptual loss function based on structural similarity which examines inter-dependencies between local image regions, taking into account luminance, contrast and structural information, instead of simply comparing single pixel values. It achieves significant performance gains on a challenging real-world dataset of nanofibrous materials and a novel dataset of two woven fabrics over the state of the art approaches for unsupervised defect segmentation that use pixel-wise reconstruction error metrics

arXiv.org e-Print Archive

Crossref

Photon counting compressive depth mapping

Author: Howell John C.
Howland Gregory A.
Lum Daniel J.
Ware Matthew R.
Publication venue: 'The Optical Society'
Publication date: 17/09/2013
Field of study

We demonstrate a compressed sensing, photon counting lidar system based on the single-pixel camera. Our technique recovers both depth and intensity maps from a single under-sampled set of incoherent, linear projections of a scene of interest at ultra-low light levels around 0.5 picowatts. Only two-dimensional reconstructions are required to image a three-dimensional scene. We demonstrate intensity imaging and depth mapping at 256 x 256 pixel transverse resolution with acquisition times as short as 3 seconds. We also show novelty filtering, reconstructing only the difference between two instances of a scene. Finally, we acquire 32 x 32 pixel real-time video for three-dimensional object tracking at 14 frames-per-second.Comment: 16 pages, 8 figure

arXiv.org e-Print Archive

Chapman University Digital Commons

Positive Definite Kernels in Machine Learning

Author: Cuturi Marco
Publication venue
Publication date: 01/01/2009
Field of study

This survey is an introduction to positive definite kernels and the set of methods they have inspired in the machine learning literature, namely kernel methods. We first discuss some properties of positive definite kernels as well as reproducing kernel Hibert spaces, the natural extension of the set of functions

\{k(x,\cdot),x\in\mathcal{X}\}

associated with a kernel

k

defined on a space

\mathcal{X}

. We discuss at length the construction of kernel functions that take advantage of well-known statistical models. We provide an overview of numerous data-analysis methods which take advantage of reproducing kernel Hilbert spaces and discuss the idea of combining several kernels to improve the performance on certain tasks. We also provide a short cookbook of different kernels which are particularly useful for certain data-types such as images, graphs or speech segments.Comment: draft. corrected a typo in figure

arXiv.org e-Print Archive

CiteSeerX

Linear Spatial Pyramid Matching Using Non-convex and non-negative Sparse Coding for Image Classification

Author: Bao Chengqiang
He Liangtian
Wang Yilun
Publication venue
Publication date: 26/04/2015
Field of study

Recently sparse coding have been highly successful in image classification mainly due to its capability of incorporating the sparsity of image representation. In this paper, we propose an improved sparse coding model based on linear spatial pyramid matching(SPM) and Scale Invariant Feature Transform (SIFT ) descriptors. The novelty is the simultaneous non-convex and non-negative characters added to the sparse coding model. Our numerical experiments show that the improved approach using non-convex and non-negative sparse coding is superior than the original ScSPM[1] on several typical databases

arXiv.org e-Print Archive

Crossref