Search CORE

5 research outputs found

Super-resolution assessment and detection

Author: López Cuena Enrique
Publication venue: Universitat Politècnica de Catalunya
Publication date: 28/06/2023
Field of study

Super Resolution (SR) techniques are powerful digital manipulation tools that have significantly impacted various industries due to their ability to enhance the resolution of lower quality images and videos. Yet, the real-world adaptation of SR models poses numerous challenges, which blind SR models aim to overcome by emulating complex real-world degradations. In this thesis, we investigate these SR techniques, with a particular focus on comparing the performance of blind models to their non-blind counterparts under various conditions. Despite recent progress, the proliferation of SR techniques raises concerns about their potential misuse. These methods can easily manipulate real digital content and create misrepresentations, which highlights the need for robust SR detection mechanisms. In our study, we analyze the limitations of current SR detection techniques and propose a new detection system that exhibits higher performance in discerning real and upscaled videos. Moreover, we conduct several experiments to gain insights into the strengths and weaknesses of the detection models, providing a better understanding of their behavior and limitations. Particularly, we target 4K videos, which are rapidly becoming the standard resolution in various fields such as streaming services, gaming, and content creation. As part of our research, we have created and utilized a unique dataset in 4K resolution, specifically designed to facilitate the investigation of SR techniques and their detection

UPCommons. Portal del coneixement obert de la UPC

Generalized Rate-Distortion Functions of Videos

Author: Liu Wentao
Publication venue: 'University of Waterloo'
Publication date: 09/07/2019
Field of study

Customers are consuming enormous digital videos every day via various kinds of video services through terrestrial, cable, and satellite communication systems or over-the-top Internet connections. To offer the best possible services using the limited capacity of video distribution systems, these video services desire precise understanding of the relationship between the perceptual quality of a video and its media attributes, for which we term it the GRD function. In this thesis, we focus on accurately estimating the generalized rate-distortion (GRD) function with a minimal number of measurement queries. We first explore the GRD behavior of compressed digital videos in a two-dimensional space of bitrate and resolution. Our analysis on real-world GRD data reveals that all GRD functions share similar regularities, but meanwhile exhibit considerable variations across different combinations of content and encoder types. Based on the analysis, we define the theoretical space of the GRD function, which not only constructs the groundwork of the form a GRD model should take, but also determines the constraints these functions must satisfy. We propose two computational GRD models. In the first model, we assume that the quality scores are precise, and develop a robust axial-monotonic Clough-Tocher (RAMCT) interpolation method to approximate the GRD function from a moderate number of measurements. In the second model, we show that the GRD function space is a convex set residing in a Hilbert space, and that a GRD function can be estimated by solving a projection problem onto the convex set. By analyzing GRD functions that arise in practice, we approximate the infinite-dimensional theoretical space by a low-dimensional one, based on which an empirical GRD model of few parameters is proposed. To further reduce the number of queries, we present a novel sampling scheme based on a probabilistic model and an information measure. The proposed sampling method generates a sequence of queries by minimizing the overall informativeness of the remaining samples. To evaluate the performance of the GRD estimation methods, we collect a large-scale database consisting of more than

4,000

real-world GRD functions, namely the Waterloo generalized rate-distortion (Waterloo GRD) database. Extensive comparison experiments are carried out on the database. Superiority of the two proposed GRD models over state-of-the-art approaches are attested both quantitatively and visually. Meanwhile, it is also validated that the proposed sampling algorithm consistently reduces the number of queries needed by various GRD estimation algorithms. Finally, we show the broad application scope of the proposed GRD models by exemplifying three applications: rate-distortion curve prediction, per-title encoding profile generation, and video encoder comparison

University of Waterloo's Institutional Repository

Compressive Sensing Applied to MIMO Radar and Sparse Disjoint Scenes

Author: Minner Michael Francis
Publication venue: Drexel University
Publication date: 01/03/2016
Field of study

The purpose of remote sensing is to acquire information about an object through the propagation of electromagnetic waves, specifically radio waves for radar systems. However, these systems are constrained by the costly Nyquist sampling rate required to guarantee efficient recovery of the signal. The recent advancements of compressive sensing offer a means of efficiently recovering such signals with fewer measurements. This thesis investigates the feasibility of employing techniques from compressive sensing in on-grid MIMO radar in order to identify targets and estimate their locations and velocities. We develop a mathematical framework to model this problem then devise numerical simulations to assess how various parameters, such as the choice of recovery algorithm, antenna positioning, signal to noise ratio, etc., impact performance. The experimental formulation of this project leads to further theoretical questions concerning the benefits of incorporating an underlying signal structure within the compressive sensing framework. We pursue these concerns for the case of sparse and disjoint vectors. Our computational and analytical treatments illustrate that knowledge of the simultaneity of these structures within a signal provides no benefit in reducing the minimal number of measurements needed to robustly recover such vectors from noninflating measurements, regardless of the reconstruction algorithm.Ph.D., Mathematics -- Drexel University, 201

Drexel Libraries E-Repository and Archives

On Automatic Music Genre Recognition by Sparse Representation Classification using Auditory Temporal Modulations

Author: Noorzad Pardis
Sturm Bob L.
Publication venue
Publication date: 01/01/2012
Field of study

VBN

Performance analysis of AVS2 for remote sensing image compression

Author
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref