26 research outputs found

    Overview of Image Processing and Various Compression Schemes

    Get PDF
    Image processing is key research among researchers. Compression of images are required when need of transmission or storage of images. Demand of multimedia growth, contributes to insufficient bandwidth of network and memory storage device. Advance imaging requires capacity of extensive amounts of digitized information. Therefore data compression is more required for reducing data redundancy to save more hardware space and transmission bandwidth. Various techniques are given for image compression. Some of which are discussed in this paper

    Enhancement layer inter frame coding for 3D dynamic point clouds

    Get PDF
    In recent years, Virtual Reality (VR) and Augmented Reality (AR) applications have seen a drastic increase in commercial popularity. Different representations have been used to create 3D reconstructions for AR and VR. Point clouds are one such representation that are characterized by their simplicity and versatil

    Volumetric 3D Point Cloud Attribute Compression: Learned polynomial bilateral filter for prediction

    Full text link
    We extend a previous study on 3D point cloud attribute compression scheme that uses a volumetric approach: given a target volumetric attribute function f:R3↦Rf : \mathbb{R}^3 \mapsto \mathbb{R}, we quantize and encode parameters ΞΈ\theta that characterize ff at the encoder, for reconstruction fΞΈ^((x))f_{\hat{\theta}}(\mathbf(x)) at known 3D points (x)\mathbf(x) at the decoder. Specifically, parameters ΞΈ\theta are quantized coefficients of B-spline basis vectors Ξ¦l\mathbf{\Phi}_l (for order pβ‰₯2p \geq 2) that span the function space Fl(p)\mathcal{F}_l^{(p)} at a particular resolution ll, which are coded from coarse to fine resolutions for scalability. In this work, we focus on the prediction of finer-grained coefficients given coarser-grained ones by learning parameters of a polynomial bilateral filter (PBF) from data. PBF is a pseudo-linear filter that is signal-dependent with a graph spectral interpretation common in the graph signal processing (GSP) field. We demonstrate PBF's predictive performance over a linear predictor inspired by MPEG standardization over a wide range of point cloud datasets

    Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression

    Full text link
    We study 3D point cloud attribute compression via a volumetric approach: assuming point cloud geometry is known at both encoder and decoder, parameters ΞΈ\theta of a continuous attribute function f:R3↦Rf: \mathbb{R}^3 \mapsto \mathbb{R} are quantized to ΞΈ^\hat{\theta} and encoded, so that discrete samples fΞΈ^(xi)f_{\hat{\theta}}(\mathbf{x}_i) can be recovered at known 3D points xi∈R3\mathbf{x}_i \in \mathbb{R}^3 at the decoder. Specifically, we consider a nested sequences of function subspaces Fl0(p)βŠ†β‹―βŠ†FL(p)\mathcal{F}^{(p)}_{l_0} \subseteq \cdots \subseteq \mathcal{F}^{(p)}_L, where Fl(p)\mathcal{F}_l^{(p)} is a family of functions spanned by B-spline basis functions of order pp, flβˆ—f_l^* is the projection of ff on Fl(p)\mathcal{F}_l^{(p)} and encoded as low-pass coefficients Flβˆ—F_l^*, and glβˆ—g_l^* is the residual function in orthogonal subspace Gl(p)\mathcal{G}_l^{(p)} (where Gl(p)βŠ•Fl(p)=Fl+1(p)\mathcal{G}_l^{(p)} \oplus \mathcal{F}_l^{(p)} = \mathcal{F}_{l+1}^{(p)}) and encoded as high-pass coefficients Glβˆ—G_l^*. In this paper, to improve coding performance over [1], we study predicting fl+1βˆ—f_{l+1}^* at level l+1l+1 given flβˆ—f_l^* at level ll and encoding of Glβˆ—G_l^* for the p=1p=1 case (RAHT(11)). For the prediction, we formalize RAHT(1) linear prediction in MPEG-PCC in a theoretical framework, and propose a new nonlinear predictor using a polynomial of bilateral filter. We derive equations to efficiently compute the critically sampled high-pass coefficients Glβˆ—G_l^* amenable to encoding. We optimize parameters in our resulting feed-forward network on a large training set of point clouds by minimizing a rate-distortion Lagrangian. Experimental results show that our improved framework outperformed the MPEG G-PCC predictor by 1111 to 12%12\% in bit rate reduction
    corecore