Search CORE

13 research outputs found

Directional graph weight prediction for image compression

Author: Grangetto Marco
Verdoja Francesco
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

Crossref

Institutional Research Information System University of Turin

Steerable Discrete Cosine Transform

Author: Fosson Sophie Marie
Fracastoro Giulia
Magli Enrico
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

In image compression, classical block-based separable transforms tend to be inefficient when image blocks contain arbitrarily shaped discontinuities. For this reason, transforms incorporating directional information are an appealing alternative. In this paper, we propose a new approach to this problem, namely a discrete cosine transform (DCT) that can be steered in any chosen direction. Such transform, called steerable DCT (SDCT), allows to rotate in a flexible way pairs of basis vectors, and enables precise matching of directionality in each image block, achieving improved coding efficiency. The optimal rotation angles for SDCT can be represented as solution of a suitable rate-distortion (RD) problem. We propose iterative methods to search such solution, and we develop a fully fledged image encoder to practically compare our techniques with other competing transforms. Analytical and numerical results prove that SDCT outperforms both DCT and state-of-the-art directional transforms

arXiv.org e-Print Archive

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Spatial intra-prediction based on mixtures of sparse representations

Author: Angelique Dremeau
Cedric Herzet
Christine Guillemot
Jean-Jacques Fuchs
Mehmet Turkan
Publication venue
Publication date: 03/04/2020
Field of study

Abstract-In this paper, we consider the problem of spatial prediction based on sparse representations. Several algorithms dealing with this problem can be found in the literature. We propose a novel method involving a mixture of sparse representations. We first place this approach into a probabilistic framework and then derive a practical procedure to solve it. Comparisons of the rate-distortion performance show the superiority of the proposed algorithm with regard to other stateof-the-art algorithms

CiteSeerX

Transformadas direccionales basadas en grafos para codificación de vídeo: hacia un nuevo paradigma en codificación

Author: Martínez Enríquez Eduardo
Publication venue
Publication date: 01/01/2018
Field of study

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Universidad Carlos III de Madrid e-Archivo

Direction-adaptive transforms for coding prediction residuals

Author
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Anisotropic multiscale sparse learned bases for image compression

Author
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date
Field of study

Crossref

Graph Spectral Image Processing

Author: Cheung Gene
Magli Enrico
Ng Michael
Tanaka Yuichi
Publication venue
Publication date: 16/01/2018
Field of study

Recent advent of graph signal processing (GSP) has spurred intensive studies of signals that live naturally on irregular data kernels described by graphs (e.g., social networks, wireless sensor networks). Though a digital image contains pixels that reside on a regularly sampled 2D grid, if one can design an appropriate underlying graph connecting pixels with weights that reflect the image structure, then one can interpret the image (or image patch) as a signal on a graph, and apply GSP tools for processing and analysis of the signal in graph spectral domain. In this article, we overview recent graph spectral techniques in GSP specifically for image / video processing. The topics covered include image compression, image restoration, image filtering and image segmentation

arXiv.org e-Print Archive

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Sparse/DCT (S/DCT) Two-Layered Representation of Prediction Residuals for Video Coding

Author: C-C J. Kuo
Je-Won Kang
M. Gabbouj
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Sub-pixel gradient 를 활용한 compound 영상 압축

Author: 김규동
Publication venue: 서울대학교 대학원
Publication date: 01/02/2014
Field of study

학위논문 (박사)-- 서울대학교 대학원 : 전기·컴퓨터공학부, 2014. 2. 김수환.컴퓨터 성능과 네트워크 속도가 발전함에 따라 컴퓨터 화면에 표시되는 compound image 의 기술은 다양한 전송 환경에서 비디오 및 양방향 서비스가 가능해졌다. 그러나 compound image는 다양한 종류의 영상이 복합적으로 나타나기 때문에 영상의 종류를 명확히 구분하고 각 종류에 맞는 영상 데이터 처리 방식이 필요하게 된다. 영상의 데이터 처리 방식이 복잡해 질수록 서버와 클라이언트의 성능 불균형은 데이터를 원활히 생성/재현 하지 못하는 문제를 가질 수 있다. Compound image 의 분류는 텍스트로 구성된 부분에 대하여 다른 종류의 영상으로 분류하지 않아야 한다. 이는 블록 단위로 구분하여 분류하는 방법에서 인접한 블록간에 서로 다른 코딩 방법을 적용하게 되면 사람이 느끼는 영상의 화질은 낮아지게 된다. 본 연구에서는 이러한 문제점을 해결하기 위해서 텍스트의 생성과정을 역이용한 sub-pixel gradient 블록 분류 방법을 제시한다. 평판 디스플레이에서는 텍스트의 부드러움을 표현하기 위하여 sub-pixel 단위로 컬러의 변화량을 조절하게 된다. 이를 whole-pixel의 단위로 영상을 구분하게 되면, 텍스트의 영역을 명확하게 구분하지 못한다. 본 연구에는 sub-pixel gradient 블록 분류 방법을 통하여 텍스트로 구성된 영역과 텍스트가 아닌 영역에 대한 판단이 정확히 이루어짐을 실험을 통하여 확인하였다. 텍스트의 코딩방법 중 손실 압축방법은 텍스트로 구성된 영상이 높은 주파수를 가지는 영상이기 때문에 양자화나 변환과정을 거치게 되면 영상의 손실이 커지게 된다. 하지만 무 손실 압축 방법은 높은 데이터 량을 가지게 되고, 영상 전송 속도가 높아져야 하는 문제가 발생한다. 본 연구에서는 sub-pixel gradient 방법을 이용한 텍스트 영역에 대한 코딩 방법을 제시한다. 텍스트 영상이 가지는 특성을 이용하여 영상에서 발생하는 기울기에 대하여 코딩을 진행한다. 이를 통하여 영상의 손실을 줄이고 텍스트의 가독성을 높일 수 있다. 동일한 압축률에서 다른 압축 알고리즘에 비하여 텍스트의 화질과 가독성이 뛰어남을 확인하였다. Compound image는 자연 영상과는 다르게 움직임이 단순하고 노이즈가 없다는 특성을 가진다. 이는 기존의 움직임 추정방법에 비하여 복잡도가 낮은 방법을 가능하게 한다. 본 연구에서는 이러한 compound image의 영상 특성을 이용한 그룹 움직임 추정 방법을 제시한다. 픽셀의 움직임을 확인하기 전에 영상의 분류에 따라 분류된 영역의 움직임을 먼저 파악하고 이를 통하여 최종적인 움직임을 추정하게 된다. 그룹 움직임 추정 방법을 사용하면 기존의 탐색영역 방법과 비교하여 탐색 영역을 최소화 할 수 있으며, 복잡도를 낮출 수 있음을 실험을 통하여 확인하였다.초 록 i 차 례 iii 그림 목차 vi 표 목 차 ix 제1장 서 론 1 1.1 연구 배경 1 1.2 연구 내용 4 1.3 논문 구성 6 제2장 텍스트 생성과정 및 기존압축방법 7 2.1 텍스트 생성과정 7 2.2 표준 영상 압축 방법 14 2.3 H.264 inter prediction 16 2.4 Compound image 의 압축 알고리즘 19 제3장 Sub-pixel gradient 블록 분류 방법 23 3.1 Background & Text color extraction 28 3.2 Text De-colorization 32 3.3 블록 분류 실험 결과 38 제4장 Sub-pixel Gradient text 블록 코딩 방법 46 4.1 Gradient fitting process 51 4.2 Text Coding 56 4.2.1 Gradient로 구성된 부분의 코딩방법 56 4.2.2 Gradient가 없는 부분의 코딩방법 57 4.2.3 local min/max 값 예측 57 4.2.4 Whole-pixel 코딩 59 4.2.5 화질 enhancement 60 4.3 텍스트 코딩 동작 64 4.3.1 텍스트 코딩 입력 65 4.3.2 Whole-pixel 코딩 1 66 4.3.3 역방향 Sub-pixel gradient 코딩 1 67 4.3.4 Local minimum 코딩 1 69 4.3.5 순방향 gradient 코딩 1 70 4.3.6 Local maximum 코딩 1 71 4.3.7 역방향 gradient 코딩 2 72 4.3.8 Local minimum 코딩 2 73 4.3.9 순방향 gradient 코딩 2 74 4.3.10 Whole-pixel 코딩 2 75 4.4 텍스트 블록 코딩 실험 결과 77 제5장 그룹 움직임 추정 방법 88 5.1 Block Grouping 94 5.2 Group Matching 97 5.3 Group motion vector calculation 101 5.4 그룹 움직임 추정 방법 실험 결과 104 제6장 결 론 109 참 고 문 헌 112 Abstract 119Docto

SNU Open Repository and Archive