947 research outputs found

    Learning Moore-Penrose based residuals for robust non-blind image deconvolution

    Get PDF
    This work was supported by grants P20_00286 and B-TIC-324-UGR20 funded by Consejerรญa de Universidad, Investigaciรณn e Innovaciรณn ( Junta de Andalucรญa ) and by โ€œ ERDF A way of making Europeโ€. Funding for open access charge: Universidad de Granada / CBUA.This paper proposes a deep learning-based method for image restoration given an inaccurate knowledge of the degradation. We first show how the impulse response of a Wiener filter can approximate the Moore-Penrose pseudo-inverse of the blur convolution operator. The deconvolution problem is then cast as the learning of a residual in the null space of the blur kernel, which, when added to the Wiener restoration, will satisfy the image formation model. This approach is expected to make the network capable of dealing with different blurs since only residuals associated with the Wiener filter have to be learned. Artifacts caused by inaccuracies in the blur estimation and other image formation model inconsistencies are removed by a Dynamic Filter Network. The extensive experiments carried out on several synthetic and real image datasets assert the proposed method's performance and robustness and demonstrate the advantage of the proposed method over existing ones.Junta de Andalucรญa P20_00286, B-TIC-324-UGR20ERDF A way of making EuropeUniversidad de Granada / CBU

    ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง์„ ํ™œ์šฉํ•œ ์˜์ƒ ํ’ˆ์งˆ ๊ฐ•ํ™” ๊ธฐ๋ฒ•

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ(๋ฐ•์‚ฌ) -- ์„œ์šธ๋Œ€ํ•™๊ต๋Œ€ํ•™์› : ์ž์—ฐ๊ณผํ•™๋Œ€ํ•™ ํ˜‘๋™๊ณผ์ • ๊ณ„์‚ฐ๊ณผํ•™์ „๊ณต, 2021.8. ๋…ธํ˜•๋ฏผ.In this thesis, we focus on deep learning methods to enhance the quality of a single image. We first categorize the image quality enhancement problem into three tasks: denoising, deblurring, and super-resolution, then introduce deep learning techniques optimized for each problem. To solve these problems, we introduce a novel deep neural network suitable for multi-scale analysis and propose efficient model-agnostic methods that help the network extract information from high-frequency domains to reconstruct clearer images. Experiments on SIDD, Flickr2K, DIV2K, and REDS datasets show that our method achieves state-of-the-art performance on each task. Furthermore, we show that our model can overcome the over-smoothing problem commonly observed in existing PSNR-oriented methods and generate more natural high-resolution images by applying adversarial training.๋ณธ ํ•™์œ„ ๋…ผ๋ฌธ์€ ๋‹จ์ผ ์˜์ƒ์˜ ํ’ˆ์งˆ ๊ฐ•ํ™”๋ฅผ ์œ„ํ•œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฒ•์— ๋Œ€ํ•œ ์—ฐ๊ตฌ๋ฅผ ๋‹ค๋ฃฌ๋‹ค. ์˜์ƒ ํ’ˆ์งˆ ๊ฐ•ํ™”๋ฅผ ์†์ƒ๋œ ์ด๋ฏธ์ง€์˜ ์žก์Œ ์ œ๊ฑฐ ๋ฐ ๋””๋ธ”๋Ÿฌ๋ง๊ณผ ์ €ํ•ด์ƒ๋„ ์ด๋ฏธ์ง€๋ฅผ ๊ณ ํ•ด์ƒ๋„๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ์ดˆํ•ด์ƒ๋„ ๋ฌธ์ œ๋กœ ์„ธ๋ถ„ํ™”ํ•œ ๋’ค, ๊ฐ๊ฐ์˜ ๋ฌธ์ œ ํ•ด๊ฒฐ์— ์ตœ์ ํ™”๋œ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฒ•์„ ๋‹จ๊ณ„๋ณ„๋กœ ์†Œ๊ฐœํ•œ๋‹ค. ํŠนํžˆ, ์†์ƒ๋œ ์˜์ƒ์˜ ํŠน์„ฑ์„ ํšจ๊ณผ์ ์œผ๋กœ ๋ถ„์„ํ•˜๊ณ  ๋ณด๋‹ค ๊น”๋”ํ•œ ๊ณ ํ•ด์ƒ๋„ ์˜์ƒ์„ ์ƒ์„ฑํ•˜๊ธฐ ์œ„ํ•˜์—ฌ ์ฃผ์–ด์ง„ ์˜์ƒ์„ ๋‹ค์ค‘ ์Šค์ผ€์ผ๋กœ ๋ถ„์„ํ•˜๋Š” ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง ๊ตฌ์กฐ๋ฅผ ์ œ์•ˆํ•˜์˜€์œผ๋ฉฐ, ์ด์™ธ์—๋„ ๋”ฅ๋Ÿฌ๋‹ ๋ชจ๋ธ์ด ์˜์ƒ ๋‚ด ๋ณต์žกํ•œ ๊ณ ์ฃผํŒŒ์ˆ˜ ์˜์—ญ์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ ํšจ๊ณผ์ ์œผ๋กœ ์ถ”์ถœํ•˜๊ณ  ์žฌ๊ฑดํ•  ์ˆ˜ ์žˆ๋„๋ก ๋•๋Š” ๊ธฐ๋ฒ•๋“ค์„ ์†Œ๊ฐœํ•œ๋‹ค. ์šฐ๋ฆฌ๋Š” ์ œ์•ˆ๋œ ๊ธฐ๋ฒ•๋“ค์„ SIDD, Flickr2K, DIV2K, REDS ๋“ฑ ๋ฐ์ดํ„ฐ์…‹์— ์ ์šฉํ•˜์—ฌ ๊ธฐ์กด์˜ ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฐ˜ ๊ธฐ๋ฒ•๋ณด๋‹ค ํ–ฅ์ƒ๋œ ์„ฑ๋Šฅ์„ ์‹คํ—˜์ ์œผ๋กœ ์ฆ๋ช…ํ•˜์˜€๋‹ค. ๋˜ํ•œ ์ดˆํ•ด์ƒ๋„ ๋ฌธ์ œ ํ•ด๊ฒฐ์„ ์œ„ํ•ด ํ•™์Šต๋œ ์‹ฌ์ธต ์‹ ๊ฒฝ๋ง์— ์ถ”๊ฐ€์ ์ธ ์ ๋Œ€์  ํ•™์Šต์„ ์ ์šฉํ•จ์œผ๋กœ์จ ๊ธฐ์กด ๋”ฅ๋Ÿฌ๋‹ ๊ธฐ๋ฒ•๋“ค์˜ ํ•œ๊ณ„๋กœ ์ง€์ ๋˜์—ˆ๋˜ ๋ถ€๋ถ„ ํ‰๊ท ํ™” ๋ฌธ์ œ๋ฅผ ๊ทน๋ณตํ•˜๊ณ  ๋ณด๋‹ค ์ž์—ฐ์Šค๋Ÿฌ์šด ๊ณ ํ•ด์ƒ๋„ ์˜์ƒ์„ ์ƒ์„ฑํ•  ์ˆ˜ ์žˆ์Œ์„ ๋ณด์˜€๋‹ค.1. Introduction 1 2. Preliminaries 4 2.1 Image Denoising 4 2.1.1 Problem Formulation: AWGN 4 2.1.2 Existing Methods 6 2.2 Image Deblurring 7 2.2.1 Problem Formulation: Blind Deblur 7 2.2.2 Existing Methods 7 2.3 Single Image Super-Resolution 9 2.3.1 Problem Formulation: SISR 9 2.3.2 Existing Methods 12 3. Image Denoising 15 3.1 Proposed Methods 15 3.1.1 Multi-scale Edge Filtering 15 3.1.2 Feature Attention Module 17 3.1.3 Network Architecture 19 3.2 Experiments 21 3.2.1 Training Details 21 3.2.2 Experimental Results on DIV2K+AWGN dataset 21 3.2.3 Experimental Results on SIDD dataset 26 4. Image Deblurring 28 4.1 Proposed Methods 28 4.1.1 Multi-Scale Feature Analysis 29 4.1.2 Network Architecture 29 4.2 Experiments 31 4.2.1 Training Details 31 4.2.2 Experimental Results on Flickr2K dataset 31 4.2.3 Experimental Results on REDS dataset 34 5. Single Image Super-Resolution 38 5.1 Proposed Methods 38 5.1.1 High-Pass Filtering Loss 39 5.1.2 Gradient Magnitude Similarity Map Masking 41 5.1.3 Soft Gradient Magnitude Similarity Map Masking 43 5.1.4 Network Architecture 44 5.1.5 Adversarial Training for Perceptual Generative Model 45 5.2 Experiments 47 5.2.1 Training Details 47 5.2.2 Experimental Results on DIV2K dataset 48 5.2.3 Experimental Results on Set5/Set14 dataset 55 5.2.4 Experimental Results on REDS dataset 60 6. Conclusion and Future Works 63๋ฐ•

    Selected Topics in Bayesian Image/Video Processing

    Get PDF
    In this dissertation, three problems in image deblurring, inpainting and virtual content insertion are solved in a Bayesian framework.;Camera shake, motion or defocus during exposure leads to image blur. Single image deblurring has achieved remarkable results by solving a MAP problem, but there is no perfect solution due to inaccurate image prior and estimator. In the first part, a new non-blind deconvolution algorithm is proposed. The image prior is represented by a Gaussian Scale Mixture(GSM) model, which is estimated from non-blurry images as training data. Our experimental results on a total twelve natural images have shown that more details are restored than previous deblurring algorithms.;In augmented reality, it is a challenging problem to insert virtual content in video streams by blending it with spatial and temporal information. A generic virtual content insertion (VCI) system is introduced in the second part. To the best of my knowledge, it is the first successful system to insert content on the building facades from street view video streams. Without knowing camera positions, the geometry model of a building facade is established by using a detection and tracking combined strategy. Moreover, motion stabilization, dynamic registration and color harmonization contribute to the excellent augmented performance in this automatic VCI system.;Coding efficiency is an important objective in video coding. In recent years, video coding standards have been developing by adding new tools. However, it costs numerous modifications in the complex coding systems. Therefore, it is desirable to consider alternative standard-compliant approaches without modifying the codec structures. In the third part, an exemplar-based data pruning video compression scheme for intra frame is introduced. Data pruning is used as a pre-processing tool to remove part of video data before they are encoded. At the decoder, missing data is reconstructed by a sparse linear combination of similar patches. The novelty is to create a patch library to exploit similarity of patches. The scheme achieves an average 4% bit rate reduction on some high definition videos
    • โ€ฆ
    corecore