2,461 research outputs found
Wavelet-based encoding for HD applications
In the past decades, most of the research on image and video compression has focused on addressing high bandwidth- constrained environments. However, for high resolution and high quality image and video compression, as in the case of High Definition Television (HDTV) or Digital Cinema (DC), the primary constraints are related to quality and flexibility. This paper presents a comparison between scalable wavelet-based video codecs and the state of the art in single point encoding and it investigates the obtainable compression efficiency when using temporal correlation with respect to pure intra coding
Entropy Encoding, Hilbert Space and Karhunen-Loeve Transforms
By introducing Hilbert space and operators, we show how probabilities,
approximations and entropy encoding from signal and image processing allow
precise formulas and quantitative estimates. Our main results yield orthogonal
bases which optimize distinct measures of data encoding.Comment: 25 pages, 1 figur
An efficient rate control algorithm for a wavelet video codec
Rate control plays an essential role in video coding and transmission to provide the best video quality at the receiver's end given the constraint of certain network conditions. In this paper, a rate control algorithm using the Quality Factor (QF) optimization method is proposed for the wavelet-based video codec and implemented on an open source Dirac video encoder. A mathematical model which we call Rate-QF (R - QF) model is derived to generate the optimum QF for the current coding frame according to the target bitrate. The proposed algorithm is a complete one pass process and does not require complex mathematical calculation. The process of calculating the QF is quite simple and further calculation is not required for each coded frame. The experimental results show that the proposed algorithm can control the bitrate precisely (within 1% of target bitrate in average). Moreover, the variation of bitrate over each Group of Pictures (GOPs) is lower than that of H.264. This is an advantage in preventing the buffer overflow and underflow for real-time multimedia data streaming
Distributed Representation of Geometrically Correlated Images with Compressed Linear Measurements
This paper addresses the problem of distributed coding of images whose
correlation is driven by the motion of objects or positioning of the vision
sensors. It concentrates on the problem where images are encoded with
compressed linear measurements. We propose a geometry-based correlation model
in order to describe the common information in pairs of images. We assume that
the constitutive components of natural images can be captured by visual
features that undergo local transformations (e.g., translation) in different
images. We first identify prominent visual features by computing a sparse
approximation of a reference image with a dictionary of geometric basis
functions. We then pose a regularized optimization problem to estimate the
corresponding features in correlated images given by quantized linear
measurements. The estimated features have to comply with the compressed
information and to represent consistent transformation between images. The
correlation model is given by the relative geometric transformations between
corresponding features. We then propose an efficient joint decoding algorithm
that estimates the compressed images such that they stay consistent with both
the quantized measurements and the correlation model. Experimental results show
that the proposed algorithm effectively estimates the correlation between
images in multi-view datasets. In addition, the proposed algorithm provides
effective decoding performance that compares advantageously to independent
coding solutions as well as state-of-the-art distributed coding schemes based
on disparity learning
Enabling error-resilient internet broadcasting using motion compensated spatial partitioning and packet FEC for the dirac video codec
Video transmission over the wireless or wired
network require protection from channel errors since compressed video bitstreams are very sensitive to transmission errors because of the use of predictive coding and variable length coding. In this paper, a simple, low complexity and patent free error-resilient coding is proposed. It is based upon the idea of using spatial partitioning on the motion compensated residual frame without employing the transform coefficient coding. The proposed scheme is intended for open source Dirac video codec in order to enable the codec to be used for Internet
broadcasting. By partitioning the wavelet transform coefficients of the motion compensated residual frame into groups and independently processing each group using arithmetic coding and Forward Error Correction (FEC), robustness to transmission errors over the packet erasure
wired network could be achieved. Using the Rate
Compatibles Punctured Code (RCPC) and Turbo Code
(TC) as the FEC, the proposed technique provides
gracefully decreasing perceptual quality over packet loss rates up to 30%. The PSNR performance is much better when compared with the conventional data partitioning only methods. Simulation results show that the use of multiple
partitioning of wavelet coefficient in Dirac can achieve up to 8 dB PSNR gain over its existing un-partitioned method
One Transform To Compute Them All: Efficient Fusion-Based Full-Reference Video Quality Assessment
The Visual Multimethod Assessment Fusion (VMAF) algorithm has recently
emerged as a state-of-the-art approach to video quality prediction, that now
pervades the streaming and social media industry. However, since VMAF requires
the evaluation of a heterogeneous set of quality models, it is computationally
expensive. Given other advances in hardware-accelerated encoding, quality
assessment is emerging as a significant bottleneck in video compression
pipelines. Towards alleviating this burden, we propose a novel Fusion of
Unified Quality Evaluators (FUNQUE) framework, by enabling computation sharing
and by using a transform that is sensitive to visual perception to boost
accuracy. Further, we expand the FUNQUE framework to define a collection of
improved low-complexity fused-feature models that advance the state-of-the-art
of video quality performance with respect to both accuracy, by 4.2\% to 5.3\%,
and computational efficiency, by factors of 3.8 to 11 times!Comment: Version
- …