25 research outputs found

    Graph Spectral Image Processing

    Full text link
    Recent advent of graph signal processing (GSP) has spurred intensive studies of signals that live naturally on irregular data kernels described by graphs (e.g., social networks, wireless sensor networks). Though a digital image contains pixels that reside on a regularly sampled 2D grid, if one can design an appropriate underlying graph connecting pixels with weights that reflect the image structure, then one can interpret the image (or image patch) as a signal on a graph, and apply GSP tools for processing and analysis of the signal in graph spectral domain. In this article, we overview recent graph spectral techniques in GSP specifically for image / video processing. The topics covered include image compression, image restoration, image filtering and image segmentation

    Fast, Exact and Multi-Scale Inference for Semantic Image Segmentation with Deep Gaussian CRFs

    Get PDF
    In this work we propose a structured prediction technique that combines the virtues of Gaussian Conditional Random Fields (G-CRF) with Deep Learning: (a) our structured prediction task has a unique global optimum that is obtained exactly from the solution of a linear system (b) the gradients of our model parameters are analytically computed using closed form expressions, in contrast to the memory-demanding contemporary deep structured prediction approaches that rely on back-propagation-through-time, (c) our pairwise terms do not have to be simple hand-crafted expressions, as in the line of works building on the DenseCRF, but can rather be `discovered' from data through deep architectures, and (d) out system can trained in an end-to-end manner. Building on standard tools from numerical analysis we develop very efficient algorithms for inference and learning, as well as a customized technique adapted to the semantic segmentation task. This efficiency allows us to explore more sophisticated architectures for structured prediction in deep learning: we introduce multi-resolution architectures to couple information across scales in a joint optimization framework, yielding systematic improvements. We demonstrate the utility of our approach on the challenging VOC PASCAL 2012 image segmentation benchmark, showing substantial improvements over strong baselines. We make all of our code and experiments available at {https://github.com/siddharthachandra/gcrf}Comment: Our code is available at https://github.com/siddharthachandra/gcr

    Nonparametric multiscale energy-based model and its application in some imagery problems

    Full text link

    확률론적 순차적 그래프 근사를 이용한 MRF 최적화

    Get PDF
    학위논문 (석사)-- 서울대학교 대학원 : 전기·컴퓨터공학부, 2015. 2. 이경무.Markov random elds have been powerful models in computer vision but tractable algorithms to obtain exact solution for the corresponding energy functions are lim- itedapproximate solutions, in most cases are provided for efficiency. In this work graduated optimization technique is applied in a novel way to develop an efficient al- gorithm for solving general multi-label MRF optimization problem called Stochastic Graduated graph approximation (SGGA) algorithm. The algorithm initially min- imizes a simplied function and progressively transforms that function until it is equivalent to the original function. However, it is hard to nd how to generate the sequence of intermediate functions and what parameter to use for making transition from one problem to another. For this we propose a new iterative method of build- ing the sequence of approximations for the original energy function. We exploit a stochastic method to generate intermediate functions, which guides the intermedi- ate solutions to the near-optimal solution for the original problem. The transition from one intermediate problem to another is controlled by the schedule of gradual addition of edges. In each iteration, a deterministic algorithm such as block ICM is applied to minimize intermediate functions and to generate initial solution for the next problem. The proposed algorithm guarantees the convergence of local mini- mum. We test on a synthetic image deconvolution problem and also on the set of experiments with the OpenGM2 benchmark.Abstract i Contents iii List of Figures iv List of Tables viii 1 Introduction 2 1.1 Background of research . . . . . . . . . . . . . . . . . . . . . . . . . 2 1.2 Objective . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 1.3 Outline of thesis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2 Related works 8 2.1 Graduated optimization . . . . . . . . . . . . . . . . . . . . . . . . . 8 2.2 Sequential Monte Carlo . . . . . . . . . . . . . . . . . . . . . . . . . 11 3 Stochastic graduated graph approximation 13 3.1 Graph approximation by scanlines . . . . . . . . . . . . . . . . . . . 13 3.2 Graph approximation by trees . . . . . . . . . . . . . . . . . . . . . . 18 4 Minimization of intermediate energy functions 20 4.1 Block Iterated conditional modes . . . . . . . . . . . . . . . . . . . . 20 4.1.1 Block Iterated conditional modes: general idea . . . . . . . . 20 4.1.2 Block ICM for graduated graph approximation . . . . . . . . 21 4.2 Dynamic programming . . . . . . . . . . . . . . . . . . . . . . . . . . 23 4.2.1 Dynamic programming: general idea . . . . . . . . . . . . . . 23 4.2.2 The DP algorithm on scanlines for graduated graph approxi- mation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 4.2.3 The DP algorithm on trees . . . . . . . . . . . . . . . . . . . 27 5 Experiments 29 5.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 5.2 Image deconvolution . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 5.3 OpenGM2 benchmark . . . . . . . . . . . . . . . . . . . . . . . . . . 43 6 Conclusion 51 Bibliography 52 59 60Maste

    Graph Signal Processing: Overview, Challenges and Applications

    Full text link
    Research in Graph Signal Processing (GSP) aims to develop tools for processing data defined on irregular graph domains. In this paper we first provide an overview of core ideas in GSP and their connection to conventional digital signal processing. We then summarize recent developments in developing basic GSP tools, including methods for sampling, filtering or graph learning. Next, we review progress in several application areas using GSP, including processing and analysis of sensor network data, biological data, and applications to image processing and machine learning. We finish by providing a brief historical perspective to highlight how concepts recently developed in GSP build on top of prior research in other areas.Comment: To appear, Proceedings of the IEE

    Statistical models for natural scene data

    Get PDF
    This thesis considers statistical modelling of natural image data. Obtaining advances in this field can have significant impact for both engineering applications, and for the understanding of the human visual system. Several recent advances in natural image modelling have been obtained with the use of unsupervised feature learning. We consider a class of such models, restricted Boltzmann machines (RBMs), used in many recent state-of-the-art image models. We develop extensions of these stochastic artificial neural networks, and use them as a basis for building more effective image models, and tools for computational vision. We first develop a novel framework for obtaining Boltzmann machines, in which the hidden unit activations co-transform with transformed input stimuli in a stable and predictable way throughout the network. We define such models to be transformation equivariant. Such properties have been shown useful for computer vision systems, and have been motivational for example in the development of steerable filters, a widely used classical feature extraction technique. Translation equivariant feature sharing has been the standard method for scaling image models beyond patch-sized data to large images. In our framework we extend shallow and deep models to account for other kinds of transformations as well, focusing on in-plane rotations. Motivated by the unsatisfactory results of current generative natural image models, we take a step back, and evaluate whether they are able to model a subclass of the data, natural image textures. This is a necessary subcomponent of any credible model for visual scenes. We assess the performance of a state- of-the-art model of natural images for texture generation, using a dataset and evaluation techniques from in prior work. We also perform a dissection of the model architecture, uncovering the properties important for good performance. Building on this, we develop structured extensions for more complicated data comprised of textures from multiple classes, using the single-texture model architecture as a basis. These models are shown to be able to produce state-of-the-art texture synthesis results quantitatively, and are also effective qualitatively. It is demonstrated empirically that the developed multiple-texture framework provides a means to generate images of differently textured regions, more generic globally varying textures, and can also be used for texture interpolation, where the approach is radically dfferent from the others in the area. Finally we consider visual boundary prediction from natural images. The work aims to improve understanding of Boltzmann machines in the generation of image segment boundaries, and to investigate deep neural network architectures for learning the boundary detection problem. The developed networks (which avoid several hand-crafted model and feature designs commonly used for the problem), produce the fastest reported inference times in the literature, combined with state-of-the-art performance

    Bayesian image reconstruction and adaptive scene sampling in single-photon LiDAR imaging

    Get PDF
    Three-Dimensional multispectral Light Detection And Ranging (LiDAR) used with time-correlated Single-Photon (SP) detection has emerged as a key imaging modality for high-resolution depth imaging due to its high sensitivity and excellent surface-to-surface resolution. This allowed depth imaging through adversarial conditions with a prime role in numerous applications. However, several practical challenges currently limit the use of LiDAR in real-world conditions. Large data volume constitutes a major challenge for multispectral SP-LiDAR imaging due to the acquisition of millions of events per second that are usually gathered in large histogram cubes. This challenge is more evident when the useful signal photons are attenuated and the background noise is amplified as a result of imaging through a scattering environment such as underwater or fog. Another limitation includes the detection of multiple-surfaces-per pixel which usually occurs when imaging through semi-transparent materials (e.g., windows, camouflage), or in long-range profiling. This thesis proposes robust and fast computational solutions to improve the acquisition and processing of LiDAR data while measuring uncertainty on high-dimensional data. A smart task-based sampling framework is proposed to improve the acquisition process and reduce data volume. In addition, the processing was improved using a Bayesian approach to different types of inverse problems (e.g. spectral classification, and scene reconstruction). The contributions of this thesis enables fast and robust 3D reconstruction of complex scenes, paving the way for the extensive use of single-photon imaging in real-world applications
    corecore