DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Chen, Ziyan; Dai, Bo; Dong, Chao; Fei, Ben; He, Jingwen; Lin, Xinqi; Lyu, Zhaoyang; Ouyang, Wanli; Qiao, Yu

DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Authors: Ziyan Chen
Bo Dai
Chao Dong
Ben Fei
Jingwen He
Xinqi Lin
Zhaoyang Lyu
Wanli Ouyang
Yu Qiao
Publication date: 29 August 2023
Publisher

Abstract

We present DiffBIR, which leverages pretrained text-to-image diffusion models for blind image restoration problem. Our framework adopts a two-stage pipeline. In the first stage, we pretrain a restoration module across diversified degradations to improve generalization capability in real-world scenarios. The second stage leverages the generative ability of latent diffusion models, to achieve realistic image restoration. Specifically, we introduce an injective modulation sub-network -- LAControlNet for finetuning, while the pre-trained Stable Diffusion is to maintain its generative ability. Finally, we introduce a controllable module that allows users to balance quality and fidelity by introducing the latent image guidance in the denoising process during inference. Extensive experiments have demonstrated its superiority over state-of-the-art approaches for both blind image super-resolution and blind face restoration tasks on synthetic and real-world datasets. The code is available at https://github.com/XPixelGroup/DiffBIR

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2308.15070

Last time updated on 10/09/2023