4 research outputs found

    Recovering Sign Bits of DCT Coefficients in Digital Images as an Optimization Problem

    Full text link
    Recovering unknown, missing, damaged, distorted or lost information in DCT coefficients is a common task in multiple applications of digital image processing, including image compression, selective image encryption, and image communications. This paper investigates recovery of a special type of information in DCT coefficients of digital images: sign bits. This problem can be modelled as a mixed integer linear programming (MILP) problem, which is NP-hard in general. To efficiently solve the problem, we propose two approximation methods: 1) a relaxation-based method that convert the MILP problem to a linear programming (LP) problem; 2) a divide-and-conquer method which splits the target image into sufficiently small regions, each of which can be more efficiently solved as an MILP problem, and then conducts a global optimization phase as a smaller MILP problem or an LP problem to maximize smoothness across different regions. To the best of our knowledge, we are the first who considered how to use global optimization to recover sign bits of DCT coefficients. We considered how the proposed methods can be applied to JPEG-encoded images and conducted extensive experiments to validate the performances of our proposed methods. The experimental results showed that the proposed methods worked well, especially when the number of unknown sign bits per DCT block is not too large. Compared with other existing methods, which are all based on simple error-concealment strategies, our proposed methods outperformed them with a substantial margin, both according to objective quality metrics (PSNR and SSIM) and also our subjective evaluation. Our work has a number of profound implications, e.g., more sign bits can be discarded to develop more efficient image compression methods, and image encryption methods based on sign bit encryption can be less secure than we previously understood.Comment: 13 pages, 8 figure

    Distributed Video Coding: Iterative Improvements

    Get PDF

    Exploiting spatial smoothness to recover undecoded coefficients for transform domain distributed video coding

    No full text
    In a transform domain distributed video coding scheme, the correlation between the current encoding unit, e.g. block and slice, and the corresponding side-information is modeled using a virtual channel. This correlation model is then used for rate allocation, quantization, and Wyner-Ziv coding. Since the encoder can only have an estimate of the correlation instead of the exact knowledge of the side-information, the decoder will fail to recover the quantized transformed coeffi- cients with a nonzero probability. In this paper, we propose to integrate a scheme at the decoder to recover the undecoded coefficients using the spatial smoothness property of individual video frames. Simulation results demonstrated that, at different decoding failure probabilities, a transformed coeffi- cient recovery scheme can significantly improve the quality of videos in terms of both PSNR and SSIM

    A maximum likelihood approach to video error correction applied to H.264 decoding

    Get PDF
    Video error concealment has long been identified as the last line of defense against transmission errors. This is especially true for real time video communication systems where retransmissions are rarely used because of timing constraints. Since error handling is outside the scope of video coding standards, decoders may choose to simply ignore the corrupted packets, or attempt to decode their content. Video error correction is a viable alternative to deal with transmission errors when corrupted packets reach their destination. Until now, these approaches have received little considerations. This is mainly because the proposed methods either rely on specific coding tools or constraints, or require far too many computations compared to video error concealment techniques. In this thesis, we propose a novel video error correction method based on maximum likelihood decoding. The method estimates the likeliest syntactically valid video slice content based on the erroneous video packets rather than discarding the content, and concealing the missing information. Such content is obtained by combining the likelihood of the candidate codewords with the bit modification likelihood associated to each candidate. We propose two solutions centered around our maximum likelihood decoding approach. First, we introduce a slice-level video error correction method. Furthermore, we show how to integrate the soft-output information shared by the channel decoder to evaluate the bit modification likelihood. We also show that it is possible to use our maximum likelihood decoding approach when soft-output information is not available. Then, we refine the solution at the syntax-element-level. The final solution we obtain can be used in real-time communication systems as it is computationally inexpensive compared to the slice-level solution, or the solutions proposed in the literature. Our final solution is then applied to the correction of videos conforming to the H.264 Baseline profile. We selected three 720x480 sequences, five 704x576 sequences, and one 720x576 sequence to run simulations. Each sequence was coded at a target bitrate of 1 Mbps, 1.2 Mbps, and 1.5 Mbps. All 27 sequences were then submitted to a noisy channel with a bit error rate ranging from 10−5 to 10−3. Our 5400 observations show a PSNR improvement of 1.69 dB over the video error concealment method implemented in the H.264 reference software. Furthermore, our results also indicate a 0.42 dB PSNR improvement over state-of-the-art error concealment STBMA+PDE
    corecore