Search CORE

141 research outputs found

CLEAR: Covariant LEAst-square Re-fitting with applications to image restoration

Author: Deledalle C-A.
Papadakis N.
Salmon J.
Vaiter S.
Publication venue
Publication date: 16/06/2016
Field of study

In this paper, we propose a new framework to remove parts of the systematic errors affecting popular restoration algorithms, with a special focus for image processing tasks. Generalizing ideas that emerged for

\ell_1

regularization, we develop an approach re-fitting the results of standard methods towards the input data. Total variation regularizations and non-local means are special cases of interest. We identify important covariant information that should be preserved by the re-fitting method, and emphasize the importance of preserving the Jacobian (w.r.t. the observed signal) of the original estimator. Then, we provide an approach that has a "twicing" flavor and allows re-fitting the restored signal by adding back a local affine transformation of the residual term. We illustrate the benefits of our method on numerical simulations for image restoration tasks

arXiv.org e-Print Archive

HAL-uB

HAL - Université de Franche-Comté

Oskar Bordeaux

Structured learning of sum-of-submodular higher order energy functions

Author: Fix Alexander
Joachims Thorsten
Park Sam
Zabih Ramin
Publication venue
Publication date: 30/09/2013
Field of study

Submodular functions can be exactly minimized in polynomial time, and the special case that graph cuts solve with max flow \cite{KZ:PAMI04} has had significant impact in computer vision \cite{BVZ:PAMI01,Kwatra:SIGGRAPH03,Rother:GrabCut04}. In this paper we address the important class of sum-of-submodular (SoS) functions \cite{Arora:ECCV12,Kolmogorov:DAM12}, which can be efficiently minimized via a variant of max flow called submodular flow \cite{Edmonds:ADM77}. SoS functions can naturally express higher order priors involving, e.g., local image patches; however, it is difficult to fully exploit their expressive power because they have so many parameters. Rather than trying to formulate existing higher order priors as an SoS function, we take a discriminative learning approach, effectively searching the space of SoS functions for a higher order prior that performs well on our training set. We adopt a structural SVM approach \cite{Joachims/etal/09a,Tsochantaridis/etal/04} and formulate the training problem in terms of quadratic programming; as a result we can efficiently search the space of SoS priors via an extended cutting-plane algorithm. We also show how the state-of-the-art max flow method for vision problems \cite{Goldberg:ESA11} can be modified to efficiently solve the submodular flow problem. Experimental comparisons are made against the OpenCV implementation of the GrabCut interactive segmentation technique \cite{Rother:GrabCut04}, which uses hand-tuned parameters instead of machine learning. On a standard dataset \cite{Gulshan:CVPR10} our method learns higher order priors with hundreds of parameter values, and produces significantly better segmentations. While our focus is on binary labeling problems, we show that our techniques can be naturally generalized to handle more than two labels

arXiv.org e-Print Archive

CiteSeerX

Crossref

Coupling BM3D with directional wavelet packets for image denoising

Author: Averbuch Amir
Hauser Jonathan
Neittaanmaki Pekka
Salhov Moshe
Zheludev Valery
Publication venue
Publication date: 25/08/2020
Field of study

The paper presents an image denoising algorithm by combining a method that is based on directional quasi-analytic wavelet packets (qWPs) with the popular BM3D algorithm. The qWPs and its corresponding transforms are designed in [1]. The denoising algorithm qWP (qWPdn) applies an adaptive localized soft thresholding to the transform coefficients using the Bivariate Shrinkage methodology. The combined method consists of several iterations of qWPdn and BM3D algorithms, where the output from one algorithm updates the input to the other (cross-boosting).The qWPdn and BM3D methods complement each other. The qWPdn capabilities to capture edges and fine texture patterns are coupled with utilizing the sparsity in real images and self-similarity of patches in the image that is inherent in the BM3D. The obtained results are quite competitive with the best state-of-the-art algorithms. We compare the performance of the combined methodology with the performances of cptTP-CTF6, DAS-2 algorithms, which use directional frames, and the BM3D algorithm. In the overwhelming majority of the experiments, the combined algorithm outperformed the above methods.Comment: 26 pages. arXiv admin note: substantial text overlap with arXiv:2001.04899, arXiv:1907.01479; text overlap with arXiv:2008.0536

arXiv.org e-Print Archive

Structured learning of sum-of-submodular higher order energy functions

Author: Alexander Fix
Ramin Zabih
Sam Park
Thorsten Joachims
Publication venue
Publication date
Field of study

Submodular functions can be exactly minimized in polynomial time, and the special case that graph cuts solve with max flow [18] has had significant impact in computer vision [5, 20, 27]. In this paper we address the important class of sum-of-submodular (SoS) functions [2, 17], which can be efficiently minimized via a variant of max flow called submodular flow [6]. SoS functions can naturally express higher order priors involving, e.g., local image patches; however, it is difficult to fully exploit their expressive power because they have so many parameters. Rather than trying to formulate existing higher order priors as an SoS function, we take a discriminative learning approach, effectively searching the space of SoS functions for a higher order prior that performs well on our training set. We adopt a structural SVM approach [14, 33] and formulate the training problem in terms of quadratic programming; as a result we can efficiently search the space of SoS priors via an extended cutting-plane algorithm. We also show how the state-of-the-art max flow method for vision problems [10] can be modified to efficiently solve the submodular flow problem. Experimental comparisons are made against the OpenCV implementation of the GrabCut interactive segmentation technique [27], which uses hand-tuned parameters instead of machine learning. On a standard dataset [11] our method learns higher order priors with hundreds of parameter values, and produces significantly better segmentations. While our focus is on binary labeling problems, we show that our techniques can be naturally generalized to handle more than two labels. 1

CiteSeerX