Search CORE

4,209 research outputs found

Rethinking the Pipeline of Demosaicing, Denoising and Super-Resolution

Author: Dong Chao
Ghanem Bernard
Gu Jinjin
Heidrich Wolfgang
Qian Guocheng
Ren Jimmy S.
Wang Yuanhao
Publication venue
Publication date: 29/03/2021
Field of study

Incomplete color sampling, noise degradation, and limited resolution are the three key problems that are unavoidable in modern camera systems. Demosaicing (DM), denoising (DN), and super-resolution (SR) are core components in a digital image processing pipeline to overcome the three problems above, respectively. Although each of these problems has been studied actively, the mixture problem of DM, DN, and SR, which is a higher practical value, lacks enough attention. Such a mixture problem is usually solved by a sequential solution (applying each method independently in a fixed order: DM

\to

\to

SR), or is simply tackled by an end-to-end network without enough analysis into interactions among tasks, resulting in an undesired performance drop in the final image quality. In this paper, we rethink the mixture problem from a holistic perspective and propose a new image processing pipeline: DN

\to

\to

DM. Extensive experiments show that simply modifying the usual sequential solution by leveraging our proposed pipeline could enhance the image quality by a large margin. We further adopt the proposed pipeline into an end-to-end network, and present Trinity Enhancement Network (TENet). Quantitative and qualitative experiments demonstrate the superiority of our TENet to the state-of-the-art. Besides, we notice the literature lacks a full color sampled dataset. To this end, we contribute a new high-quality full color sampled real-world dataset, namely PixelShift200. Our experiments show the benefit of the proposed PixelShift200 dataset for raw image processing.Comment: Code is available at: https://github.com/guochengqian/TENe

arXiv.org e-Print Archive

Contributions in image and video coding

Author: Testoni Vanessa
Publication venue: [s.n.]
Publication date: 19/08/2018
Field of study

Orientador: Max Henrique Machado CostaTese (doutorado) - Universidade Estadual de Campinas, Faculdade de Engenharia Elétrica e de ComputaçãoResumo: A comunidade de codificação de imagens e vídeo vem também trabalhando em inovações que vão além das tradicionais técnicas de codificação de imagens e vídeo. Este trabalho é um conjunto de contribuições a vários tópicos que têm recebido crescente interesse de pesquisadores na comunidade, nominalmente, codificação escalável, codificação de baixa complexidade para dispositivos móveis, codificação de vídeo de múltiplas vistas e codificação adaptativa em tempo real. A primeira contribuição estuda o desempenho de três transformadas 3-D rápidas por blocos em um codificador de vídeo de baixa complexidade. O codificador recebeu o nome de Fast Embedded Video Codec (FEVC). Novos métodos de implementação e ordens de varredura são propostos para as transformadas. Os coeficiente 3-D são codificados por planos de bits pelos codificadores de entropia, produzindo um fluxo de bits (bitstream) de saída totalmente embutida. Todas as implementações são feitas usando arquitetura com aritmética inteira de 16 bits. Somente adições e deslocamentos de bits são necessários, o que reduz a complexidade computacional. Mesmo com essas restrições, um bom desempenho em termos de taxa de bits versus distorção pôde ser obtido e os tempos de codificação são significativamente menores (em torno de 160 vezes) quando comparados ao padrão H.264/AVC. A segunda contribuição é a otimização de uma recente abordagem proposta para codificação de vídeo de múltiplas vistas em aplicações de video-conferência e outras aplicações do tipo "unicast" similares. O cenário alvo nessa abordagem é fornecer vídeo com percepção real em 3-D e ponto de vista livre a boas taxas de compressão. Para atingir tal objetivo, pesos são atribuídos a cada vista e mapeados em parâmetros de quantização. Neste trabalho, o mapeamento ad-hoc anteriormente proposto entre pesos e parâmetros de quantização é mostrado ser quase-ótimo para uma fonte Gaussiana e um mapeamento ótimo é derivado para fonte típicas de vídeo. A terceira contribuição explora várias estratégias para varredura adaptativa dos coeficientes da transformada no padrão JPEG XR. A ordem de varredura original, global e adaptativa do JPEG XR é comparada com os métodos de varredura localizados e híbridos propostos neste trabalho. Essas novas ordens não requerem mudanças nem nos outros estágios de codificação e decodificação, nem na definição da bitstream A quarta e última contribuição propõe uma transformada por blocos dependente do sinal. As transformadas hierárquicas usualmente exploram a informação residual entre os níveis no estágio da codificação de entropia, mas não no estágio da transformada. A transformada proposta neste trabalho é uma técnica de compactação de energia que também explora as similaridades estruturais entre os níveis de resolução. A idéia central da técnica é incluir na transformada hierárquica um número de funções de base adaptativas derivadas da resolução menor do sinal. Um codificador de imagens completo foi desenvolvido para medir o desempenho da nova transformada e os resultados obtidos são discutidos neste trabalhoAbstract: The image and video coding community has often been working on new advances that go beyond traditional image and video architectures. This work is a set of contributions to various topics that have received increasing attention from researchers in the community, namely, scalable coding, low-complexity coding for portable devices, multiview video coding and run-time adaptive coding. The first contribution studies the performance of three fast block-based 3-D transforms in a low complexity video codec. The codec has received the name Fast Embedded Video Codec (FEVC). New implementation methods and scanning orders are proposed for the transforms. The 3-D coefficients are encoded bit-plane by bit-plane by entropy coders, producing a fully embedded output bitstream. All implementation is performed using 16-bit integer arithmetic. Only additions and bit shifts are necessary, thus lowering computational complexity. Even with these constraints, reasonable rate versus distortion performance can be achieved and the encoding time is significantly smaller (around 160 times) when compared to the H.264/AVC standard. The second contribution is the optimization of a recent approach proposed for multiview video coding in videoconferencing applications or other similar unicast-like applications. The target scenario in this approach is providing realistic 3-D video with free viewpoint video at good compression rates. To achieve such an objective, weights are computed for each view and mapped into quantization parameters. In this work, the previously proposed ad-hoc mapping between weights and quantization parameters is shown to be quasi-optimum for a Gaussian source and an optimum mapping is derived for a typical video source. The third contribution exploits several strategies for adaptive scanning of transform coefficients in the JPEG XR standard. The original global adaptive scanning order applied in JPEG XR is compared with the localized and hybrid scanning methods proposed in this work. These new orders do not require changes in either the other coding and decoding stages or in the bitstream definition. The fourth and last contribution proposes an hierarchical signal dependent block-based transform. Hierarchical transforms usually exploit the residual cross-level information at the entropy coding step, but not at the transform step. The transform proposed in this work is an energy compaction technique that can also exploit these cross-resolution-level structural similarities. The core idea of the technique is to include in the hierarchical transform a number of adaptive basis functions derived from the lower resolution of the signal. A full image codec is developed in order to measure the performance of the new transform and the obtained results are discussed in this workDoutoradoTelecomunicações e TelemáticaDoutor em Engenharia Elétric

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio da Producao Cientifica e Intelectual da Unicamp

Towards a filmic look and feel in real time computer graphics

Author: Farouk Sherief
Publication venue: DigitalCommons@UMaine
Publication date: 01/05/2013
Field of study

Film footage has a distinct look and feel that audience can instantly recognize, making its replication desirable for computer generated graphics. This thesis presents methods capable of replicating significant portions of the film look and feel while being able to fit within the constraints imposed by real-time computer generated graphics on consumer hardware

University of Maine

Efficiency of remote sensing tools for post-fire management along a climatic gradient

Author: Archibald
Austin
Barbieri
Beaty
Bento-Gonçalves
Berk
Bond
Brunsdon
Buchhorn
Buchhorn
Calvo
Calvo
Calvo
Chen
Chu
Chuvieco
Chuvieco
Chuvieco
Core Team
Cutler
Doblas-Miranda
Eckert
Eicher
Elena Marcos-Porras
Esposito
Fernandes
Fernández
Fernández-García
Fernández-Manso
Fernández-Manso
Ferreira-Leite
Foody
García-Morote
Gitelson
Gu
Haboudane
Haralick
Hart
Hernandez
Hoeting
Hong
Jiménez-Alfaro
José Manuel Fernández-Guisuraga
Jung
Lasaponara
Latif
Leonor Calvo
Liang
Liu
Log
Lozano
Lu
Mansourian
Matthew
Meng
Meng
Meng
Middleton
Mitchell
Mitri
Mohammadi
Mutanga
Mänd
Ninyerola
Nunes
Osborne
Osborne
Ozdemir
Pausas
Pausas
Pereg
Pereira
Perrault
Peterson
Pinto
Pu
Puig-Gironès
Quintano
Radosavljevic
Raftery
Roach
Rouse
Sarker
Schepers
Schmeer
Schmidt
Schoennagel
Schumacher
Sevegnani
Shakesby
Shamsoddini
Sims
Solans-Vila
Sumnall
Sundblad
Susana Suárez-Seoane
Suárez-Seoane
Taboada
Tao
Tapias
Tapias
Tessler
Thenkabail
Thuiller
Tsalyuk
Unwin
Veraverbeke
Veraverbeke
Viedma
Víctor Fernández-García
Wenger
Wenger
Whelan
Wood
Woodcock
Wulder
Xiao
Zellner
Zeugner
Zhang
Zhao
Zhu
Ángela Taboada
Publication venue: 'Elsevier BV'
Publication date: 05/02/2019
Field of study

P. 553-562Forest managers require reliable tools to evaluate post-fire recovery across different geographic/climatic contexts and define management actions at the landscape scale, which might be highly resource-consuming in terms of data collection. In this sense, remote sensing techniques allow for gathering environmental data over large areas with low collection effort. We aim to assess the applicability of remote sensing tools in post-fire management within and across three mega-fires that occurred in pine fire-prone ecosystems located along an Atlantic-Transition-Mediterranean climatic gradient. Four years after the wildfires, we established 120 2x2m plots in each mega-fire site, where we evaluated: (1) density of pine seedlings, (2) percentage of woody species cover and (3) percentage of dead plant material cover. These variables were modeled following a Bayesian Model Averaging approach on the basis of spectral indices and texture features derived from WorldView-2 satellite imagery at 2 m spatial resolution. We assessed model interpolation and transferability within each mega-fire, as well as model extrapolation between mega-fires along the climatic gradient. Texture features were the predictors that contributed most in all cases. The woody species cover model had the best performance regarding spatial interpolation and transferability within the three study sites, with predictive errors lower than 25% for the two approaches. Model extrapolation between the Transition and Mediterranean sites had low levels of error (from 6% to 19%) for the three field variables, because the landscape in these areas is similar in structure and function and, therefore, in spectral characteristics. However, model extrapolation from the Atlantic site achieved the weakest results (error higher than 30%), due to the large ecological differences between this particular site and the others. This study demonstrates the potential of fine-grained satellite imagery for land managers to conduct post-fire recovery studies with a high degree of generality across different geographic/climatic contexts.S

Crossref

Leon University (Spain)

Webcam Configurations for Ground Texture Visual Servo

Author: Asgari Nasser
Matsumoto Takeshi
Powers David Martin
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2008
Field of study

Singapor

Crossref

Flinders Academic Commons

Recommended from our members

Efficient Debanding Filtering for Inverse Tone Mapped High Dynamic Range Videos

Author: Cosman Pamela C
Song Qing
Su Guan-Ming
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

eScholarship - University of California

Recommended from our members

Image Understanding and Robotics Research at Columbia University

Author: Allen Peter K.
Boult Terrance E.
Ibrahim Hussein
Kender John R.
Publication venue: 'Columbia University Libraries/Information Services'
Publication date: 01/01/1988
Field of study

The research investigations of the Vision/Robotics Laboratory at Columbia University reflect the diversity of interests of its four faculty members, two staff programmers and 15 Ph.D. students. Several of the projects involve either a visiting computer science post-doc, other faculty members in the department or the university, or researchers at AT&T Bell Laboratories or Philips laboratories. We list below a summary of our interest and results, together with the principal researchers associated with them. Since it is difficult to separate those aspects of robotic research that are purely visual from those that are vision-like (for example, tactile sensing) or vision-related (for example, integrated vision-robotic systems), we have listed all robotic research that is not purely manipulative

Columbia University Academic Commons