13 research outputs found

    Directional graph weight prediction for image compression

    Get PDF

    Steerable Discrete Cosine Transform

    Get PDF
    In image compression, classical block-based separable transforms tend to be inefficient when image blocks contain arbitrarily shaped discontinuities. For this reason, transforms incorporating directional information are an appealing alternative. In this paper, we propose a new approach to this problem, namely a discrete cosine transform (DCT) that can be steered in any chosen direction. Such transform, called steerable DCT (SDCT), allows to rotate in a flexible way pairs of basis vectors, and enables precise matching of directionality in each image block, achieving improved coding efficiency. The optimal rotation angles for SDCT can be represented as solution of a suitable rate-distortion (RD) problem. We propose iterative methods to search such solution, and we develop a fully fledged image encoder to practically compare our techniques with other competing transforms. Analytical and numerical results prove that SDCT outperforms both DCT and state-of-the-art directional transforms

    Spatial intra-prediction based on mixtures of sparse representations

    Get PDF
    Abstract-In this paper, we consider the problem of spatial prediction based on sparse representations. Several algorithms dealing with this problem can be found in the literature. We propose a novel method involving a mixture of sparse representations. We first place this approach into a probabilistic framework and then derive a practical procedure to solve it. Comparisons of the rate-distortion performance show the superiority of the proposed algorithm with regard to other stateof-the-art algorithms

    Direction-adaptive transforms for coding prediction residuals

    Full text link

    Anisotropic multiscale sparse learned bases for image compression

    Full text link

    Graph Spectral Image Processing

    Full text link
    Recent advent of graph signal processing (GSP) has spurred intensive studies of signals that live naturally on irregular data kernels described by graphs (e.g., social networks, wireless sensor networks). Though a digital image contains pixels that reside on a regularly sampled 2D grid, if one can design an appropriate underlying graph connecting pixels with weights that reflect the image structure, then one can interpret the image (or image patch) as a signal on a graph, and apply GSP tools for processing and analysis of the signal in graph spectral domain. In this article, we overview recent graph spectral techniques in GSP specifically for image / video processing. The topics covered include image compression, image restoration, image filtering and image segmentation

    Sparse/DCT (S/DCT) Two-Layered Representation of Prediction Residuals for Video Coding

    Full text link

    Sub-pixel gradient ๋ฅผ ํ™œ์šฉํ•œ compound ์˜์ƒ ์••์ถ•

    Get PDF
    ํ•™์œ„๋…ผ๋ฌธ (๋ฐ•์‚ฌ)-- ์„œ์šธ๋Œ€ํ•™๊ต ๋Œ€ํ•™์› : ์ „๊ธฐยท์ปดํ“จํ„ฐ๊ณตํ•™๋ถ€, 2014. 2. ๊น€์ˆ˜ํ™˜.์ปดํ“จํ„ฐ ์„ฑ๋Šฅ๊ณผ ๋„คํŠธ์›Œํฌ ์†๋„๊ฐ€ ๋ฐœ์ „ํ•จ์— ๋”ฐ๋ผ ์ปดํ“จํ„ฐ ํ™”๋ฉด์— ํ‘œ์‹œ๋˜๋Š” compound image ์˜ ๊ธฐ์ˆ ์€ ๋‹ค์–‘ํ•œ ์ „์†ก ํ™˜๊ฒฝ์—์„œ ๋น„๋””์˜ค ๋ฐ ์–‘๋ฐฉํ–ฅ ์„œ๋น„์Šค๊ฐ€ ๊ฐ€๋Šฅํ•ด์กŒ๋‹ค. ๊ทธ๋Ÿฌ๋‚˜ compound image๋Š” ๋‹ค์–‘ํ•œ ์ข…๋ฅ˜์˜ ์˜์ƒ์ด ๋ณตํ•ฉ์ ์œผ๋กœ ๋‚˜ํƒ€๋‚˜๊ธฐ ๋•Œ๋ฌธ์— ์˜์ƒ์˜ ์ข…๋ฅ˜๋ฅผ ๋ช…ํ™•ํžˆ ๊ตฌ๋ถ„ํ•˜๊ณ  ๊ฐ ์ข…๋ฅ˜์— ๋งž๋Š” ์˜์ƒ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ๋ฐฉ์‹์ด ํ•„์š”ํ•˜๊ฒŒ ๋œ๋‹ค. ์˜์ƒ์˜ ๋ฐ์ดํ„ฐ ์ฒ˜๋ฆฌ ๋ฐฉ์‹์ด ๋ณต์žกํ•ด ์งˆ์ˆ˜๋ก ์„œ๋ฒ„์™€ ํด๋ผ์ด์–ธํŠธ์˜ ์„ฑ๋Šฅ ๋ถˆ๊ท ํ˜•์€ ๋ฐ์ดํ„ฐ๋ฅผ ์›ํ™œํžˆ ์ƒ์„ฑ/์žฌํ˜„ ํ•˜์ง€ ๋ชปํ•˜๋Š” ๋ฌธ์ œ๋ฅผ ๊ฐ€์งˆ ์ˆ˜ ์žˆ๋‹ค. Compound image ์˜ ๋ถ„๋ฅ˜๋Š” ํ…์ŠคํŠธ๋กœ ๊ตฌ์„ฑ๋œ ๋ถ€๋ถ„์— ๋Œ€ํ•˜์—ฌ ๋‹ค๋ฅธ ์ข…๋ฅ˜์˜ ์˜์ƒ์œผ๋กœ ๋ถ„๋ฅ˜ํ•˜์ง€ ์•Š์•„์•ผ ํ•œ๋‹ค. ์ด๋Š” ๋ธ”๋ก ๋‹จ์œ„๋กœ ๊ตฌ๋ถ„ํ•˜์—ฌ ๋ถ„๋ฅ˜ํ•˜๋Š” ๋ฐฉ๋ฒ•์—์„œ ์ธ์ ‘ํ•œ ๋ธ”๋ก๊ฐ„์— ์„œ๋กœ ๋‹ค๋ฅธ ์ฝ”๋”ฉ ๋ฐฉ๋ฒ•์„ ์ ์šฉํ•˜๊ฒŒ ๋˜๋ฉด ์‚ฌ๋žŒ์ด ๋Š๋ผ๋Š” ์˜์ƒ์˜ ํ™”์งˆ์€ ๋‚ฎ์•„์ง€๊ฒŒ ๋œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ด๋Ÿฌํ•œ ๋ฌธ์ œ์ ์„ ํ•ด๊ฒฐํ•˜๊ธฐ ์œ„ํ•ด์„œ ํ…์ŠคํŠธ์˜ ์ƒ์„ฑ๊ณผ์ •์„ ์—ญ์ด์šฉํ•œ sub-pixel gradient ๋ธ”๋ก ๋ถ„๋ฅ˜ ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ํ‰ํŒ ๋””์Šคํ”Œ๋ ˆ์ด์—์„œ๋Š” ํ…์ŠคํŠธ์˜ ๋ถ€๋“œ๋Ÿฌ์›€์„ ํ‘œํ˜„ํ•˜๊ธฐ ์œ„ํ•˜์—ฌ sub-pixel ๋‹จ์œ„๋กœ ์ปฌ๋Ÿฌ์˜ ๋ณ€ํ™”๋Ÿ‰์„ ์กฐ์ ˆํ•˜๊ฒŒ ๋œ๋‹ค. ์ด๋ฅผ whole-pixel์˜ ๋‹จ์œ„๋กœ ์˜์ƒ์„ ๊ตฌ๋ถ„ํ•˜๊ฒŒ ๋˜๋ฉด, ํ…์ŠคํŠธ์˜ ์˜์—ญ์„ ๋ช…ํ™•ํ•˜๊ฒŒ ๊ตฌ๋ถ„ํ•˜์ง€ ๋ชปํ•œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—๋Š” sub-pixel gradient ๋ธ”๋ก ๋ถ„๋ฅ˜ ๋ฐฉ๋ฒ•์„ ํ†ตํ•˜์—ฌ ํ…์ŠคํŠธ๋กœ ๊ตฌ์„ฑ๋œ ์˜์—ญ๊ณผ ํ…์ŠคํŠธ๊ฐ€ ์•„๋‹Œ ์˜์—ญ์— ๋Œ€ํ•œ ํŒ๋‹จ์ด ์ •ํ™•ํžˆ ์ด๋ฃจ์–ด์ง์„ ์‹คํ—˜์„ ํ†ตํ•˜์—ฌ ํ™•์ธํ•˜์˜€๋‹ค. ํ…์ŠคํŠธ์˜ ์ฝ”๋”ฉ๋ฐฉ๋ฒ• ์ค‘ ์†์‹ค ์••์ถ•๋ฐฉ๋ฒ•์€ ํ…์ŠคํŠธ๋กœ ๊ตฌ์„ฑ๋œ ์˜์ƒ์ด ๋†’์€ ์ฃผํŒŒ์ˆ˜๋ฅผ ๊ฐ€์ง€๋Š” ์˜์ƒ์ด๊ธฐ ๋•Œ๋ฌธ์— ์–‘์žํ™”๋‚˜ ๋ณ€ํ™˜๊ณผ์ •์„ ๊ฑฐ์น˜๊ฒŒ ๋˜๋ฉด ์˜์ƒ์˜ ์†์‹ค์ด ์ปค์ง€๊ฒŒ ๋œ๋‹ค. ํ•˜์ง€๋งŒ ๋ฌด ์†์‹ค ์••์ถ• ๋ฐฉ๋ฒ•์€ ๋†’์€ ๋ฐ์ดํ„ฐ ๋Ÿ‰์„ ๊ฐ€์ง€๊ฒŒ ๋˜๊ณ , ์˜์ƒ ์ „์†ก ์†๋„๊ฐ€ ๋†’์•„์ ธ์•ผ ํ•˜๋Š” ๋ฌธ์ œ๊ฐ€ ๋ฐœ์ƒํ•œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” sub-pixel gradient ๋ฐฉ๋ฒ•์„ ์ด์šฉํ•œ ํ…์ŠคํŠธ ์˜์—ญ์— ๋Œ€ํ•œ ์ฝ”๋”ฉ ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ํ…์ŠคํŠธ ์˜์ƒ์ด ๊ฐ€์ง€๋Š” ํŠน์„ฑ์„ ์ด์šฉํ•˜์—ฌ ์˜์ƒ์—์„œ ๋ฐœ์ƒํ•˜๋Š” ๊ธฐ์šธ๊ธฐ์— ๋Œ€ํ•˜์—ฌ ์ฝ”๋”ฉ์„ ์ง„ํ–‰ํ•œ๋‹ค. ์ด๋ฅผ ํ†ตํ•˜์—ฌ ์˜์ƒ์˜ ์†์‹ค์„ ์ค„์ด๊ณ  ํ…์ŠคํŠธ์˜ ๊ฐ€๋…์„ฑ์„ ๋†’์ผ ์ˆ˜ ์žˆ๋‹ค. ๋™์ผํ•œ ์••์ถ•๋ฅ ์—์„œ ๋‹ค๋ฅธ ์••์ถ• ์•Œ๊ณ ๋ฆฌ์ฆ˜์— ๋น„ํ•˜์—ฌ ํ…์ŠคํŠธ์˜ ํ™”์งˆ๊ณผ ๊ฐ€๋…์„ฑ์ด ๋›ฐ์–ด๋‚จ์„ ํ™•์ธํ•˜์˜€๋‹ค. Compound image๋Š” ์ž์—ฐ ์˜์ƒ๊ณผ๋Š” ๋‹ค๋ฅด๊ฒŒ ์›€์ง์ž„์ด ๋‹จ์ˆœํ•˜๊ณ  ๋…ธ์ด์ฆˆ๊ฐ€ ์—†๋‹ค๋Š” ํŠน์„ฑ์„ ๊ฐ€์ง„๋‹ค. ์ด๋Š” ๊ธฐ์กด์˜ ์›€์ง์ž„ ์ถ”์ •๋ฐฉ๋ฒ•์— ๋น„ํ•˜์—ฌ ๋ณต์žก๋„๊ฐ€ ๋‚ฎ์€ ๋ฐฉ๋ฒ•์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•œ๋‹ค. ๋ณธ ์—ฐ๊ตฌ์—์„œ๋Š” ์ด๋Ÿฌํ•œ compound image์˜ ์˜์ƒ ํŠน์„ฑ์„ ์ด์šฉํ•œ ๊ทธ๋ฃน ์›€์ง์ž„ ์ถ”์ • ๋ฐฉ๋ฒ•์„ ์ œ์‹œํ•œ๋‹ค. ํ”ฝ์…€์˜ ์›€์ง์ž„์„ ํ™•์ธํ•˜๊ธฐ ์ „์— ์˜์ƒ์˜ ๋ถ„๋ฅ˜์— ๋”ฐ๋ผ ๋ถ„๋ฅ˜๋œ ์˜์—ญ์˜ ์›€์ง์ž„์„ ๋จผ์ € ํŒŒ์•…ํ•˜๊ณ  ์ด๋ฅผ ํ†ตํ•˜์—ฌ ์ตœ์ข…์ ์ธ ์›€์ง์ž„์„ ์ถ”์ •ํ•˜๊ฒŒ ๋œ๋‹ค. ๊ทธ๋ฃน ์›€์ง์ž„ ์ถ”์ • ๋ฐฉ๋ฒ•์„ ์‚ฌ์šฉํ•˜๋ฉด ๊ธฐ์กด์˜ ํƒ์ƒ‰์˜์—ญ ๋ฐฉ๋ฒ•๊ณผ ๋น„๊ตํ•˜์—ฌ ํƒ์ƒ‰ ์˜์—ญ์„ ์ตœ์†Œํ™” ํ•  ์ˆ˜ ์žˆ์œผ๋ฉฐ, ๋ณต์žก๋„๋ฅผ ๋‚ฎ์ถœ ์ˆ˜ ์žˆ์Œ์„ ์‹คํ—˜์„ ํ†ตํ•˜์—ฌ ํ™•์ธํ•˜์˜€๋‹ค.์ดˆ ๋ก i ์ฐจ ๋ก€ iii ๊ทธ๋ฆผ ๋ชฉ์ฐจ vi ํ‘œ ๋ชฉ ์ฐจ ix ์ œ1์žฅ ์„œ ๋ก  1 1.1 ์—ฐ๊ตฌ ๋ฐฐ๊ฒฝ 1 1.2 ์—ฐ๊ตฌ ๋‚ด์šฉ 4 1.3 ๋…ผ๋ฌธ ๊ตฌ์„ฑ 6 ์ œ2์žฅ ํ…์ŠคํŠธ ์ƒ์„ฑ๊ณผ์ • ๋ฐ ๊ธฐ์กด์••์ถ•๋ฐฉ๋ฒ• 7 2.1 ํ…์ŠคํŠธ ์ƒ์„ฑ๊ณผ์ • 7 2.2 ํ‘œ์ค€ ์˜์ƒ ์••์ถ• ๋ฐฉ๋ฒ• 14 2.3 H.264 inter prediction 16 2.4 Compound image ์˜ ์••์ถ• ์•Œ๊ณ ๋ฆฌ์ฆ˜ 19 ์ œ3์žฅ Sub-pixel gradient ๋ธ”๋ก ๋ถ„๋ฅ˜ ๋ฐฉ๋ฒ• 23 3.1 Background & Text color extraction 28 3.2 Text De-colorization 32 3.3 ๋ธ”๋ก ๋ถ„๋ฅ˜ ์‹คํ—˜ ๊ฒฐ๊ณผ 38 ์ œ4์žฅ Sub-pixel Gradient text ๋ธ”๋ก ์ฝ”๋”ฉ ๋ฐฉ๋ฒ• 46 4.1 Gradient fitting process 51 4.2 Text Coding 56 4.2.1 Gradient๋กœ ๊ตฌ์„ฑ๋œ ๋ถ€๋ถ„์˜ ์ฝ”๋”ฉ๋ฐฉ๋ฒ• 56 4.2.2 Gradient๊ฐ€ ์—†๋Š” ๋ถ€๋ถ„์˜ ์ฝ”๋”ฉ๋ฐฉ๋ฒ• 57 4.2.3 local min/max ๊ฐ’ ์˜ˆ์ธก 57 4.2.4 Whole-pixel ์ฝ”๋”ฉ 59 4.2.5 ํ™”์งˆ enhancement 60 4.3 ํ…์ŠคํŠธ ์ฝ”๋”ฉ ๋™์ž‘ 64 4.3.1 ํ…์ŠคํŠธ ์ฝ”๋”ฉ ์ž…๋ ฅ 65 4.3.2 Whole-pixel ์ฝ”๋”ฉ 1 66 4.3.3 ์—ญ๋ฐฉํ–ฅ Sub-pixel gradient ์ฝ”๋”ฉ 1 67 4.3.4 Local minimum ์ฝ”๋”ฉ 1 69 4.3.5 ์ˆœ๋ฐฉํ–ฅ gradient ์ฝ”๋”ฉ 1 70 4.3.6 Local maximum ์ฝ”๋”ฉ 1 71 4.3.7 ์—ญ๋ฐฉํ–ฅ gradient ์ฝ”๋”ฉ 2 72 4.3.8 Local minimum ์ฝ”๋”ฉ 2 73 4.3.9 ์ˆœ๋ฐฉํ–ฅ gradient ์ฝ”๋”ฉ 2 74 4.3.10 Whole-pixel ์ฝ”๋”ฉ 2 75 4.4 ํ…์ŠคํŠธ ๋ธ”๋ก ์ฝ”๋”ฉ ์‹คํ—˜ ๊ฒฐ๊ณผ 77 ์ œ5์žฅ ๊ทธ๋ฃน ์›€์ง์ž„ ์ถ”์ • ๋ฐฉ๋ฒ• 88 5.1 Block Grouping 94 5.2 Group Matching 97 5.3 Group motion vector calculation 101 5.4 ๊ทธ๋ฃน ์›€์ง์ž„ ์ถ”์ • ๋ฐฉ๋ฒ• ์‹คํ—˜ ๊ฒฐ๊ณผ 104 ์ œ6์žฅ ๊ฒฐ ๋ก  109 ์ฐธ ๊ณ  ๋ฌธ ํ—Œ 112 Abstract 119Docto
    corecore