Search CORE

14 research outputs found

Directional edge and texture representations for image processing

Author: Yao Zhen
Publication venue
Publication date: 01/10/2007
Field of study

An efficient representation for natural images is of fundamental importance in image processing and analysis. The commonly used separable transforms such as wavelets axe not best suited for images due to their inability to exploit directional regularities such as edges and oriented textural patterns; while most of the recently proposed directional schemes cannot represent these two types of features in a unified transform. This thesis focuses on the development of directional representations for images which can capture both edges and textures in a multiresolution manner. The thesis first considers the problem of extracting linear features with the multiresolution Fourier transform (MFT). Based on a previous MFT-based linear feature model, the work extends the extraction method into the situation when the image is corrupted by noise. The problem is tackled by the combination of a "Signal+Noise" frequency model, a refinement stage and a robust classification scheme. As a result, the MFT is able to perform linear feature analysis on noisy images on which previous methods failed. A new set of transforms called the multiscale polar cosine transforms (MPCT) are also proposed in order to represent textures. The MPCT can be regarded as real-valued MFT with similar basis functions of oriented sinusoids. It is shown that the transform can represent textural patches more efficiently than the conventional Fourier basis. With a directional best cosine basis, the MPCT packet (MPCPT) is shown to be an efficient representation for edges and textures, despite its high computational burden. The problem of representing edges and textures in a fixed transform with less complexity is then considered. This is achieved by applying a Gaussian frequency filter, which matches the disperson of the magnitude spectrum, on the local MFT coefficients. This is particularly effective in denoising natural images, due to its ability to preserve both types of feature. Further improvements can be made by employing the information given by the linear feature extraction process in the filter's configuration. The denoising results compare favourably against other state-of-the-art directional representations

Warwick Research Archives Portal Repository

An adaptive minimum spanning tree multi-element method for uncertainty quantification of smooth and discontinuous responses

Author: Halder Y. (Yous) van
Koren B. (Barry)
Sanderse B. (Benjamin)
Publication venue
Publication date: 19/03/2018
Field of study

CWI's Institutional Repository

An adaptive minimum spanning tree multi-element method for uncertainty quantification of smooth and discontinuous responses

Author: Koren Barry
Sanderse Benjamin
van Halder Yous
Publication venue
Publication date: 19/03/2018
Field of study

A novel approach for non-intrusive uncertainty propagation is proposed. Our approach overcomes the limitation of many traditional methods, such as generalised polynomial chaos methods, which may lack sufficient accuracy when the quantity of interest depends discontinuously on the input parameters. As a remedy we propose an adaptive sampling algorithm based on minimum spanning trees combined with a domain decomposition method based on support vector machines. The minimum spanning tree determines new sample locations based on both the probability density of the input parameters and the gradient in the quantity of interest. The support vector machine efficiently decomposes the random space in multiple elements, avoiding the appearance of Gibbs phenomena near discontinuities. On each element, local approximations are constructed by means of least orthogonal interpolation, in order to produce stable interpolation on the unstructured sample set. The resulting minimum spanning tree multi-element method does not require initial knowledge of the behaviour of the quantity of interest and automatically detects whether discontinuities are present. We present several numerical examples that demonstrate accuracy, efficiency and generality of the method.Comment: 20 pages, 18 figure

arXiv.org e-Print Archive

Repository TU/e

CWI's Institutional Repository

Pure OAI Repository

Multiresolution image models and estimation techniques

Author: Goossens Bart
Publication venue: Ghent University. Faculty of Engineering
Publication date: 01/01/2010
Field of study

Ghent University Academic Bibliography

An adaptive minimum spanning tree multielement method for uncertainty quantification of smooth and discontinuous responses

Author: Halder Y. (Yous) van
Koren B. (Barry)
Sanderse B. (Benjamin)
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2019
Field of study

A novel approach for nonintrusive uncertainty propagation is proposed. Our approach overcomes the limitation of many traditional methods, such as generalized polynomial chaos methods, which may lack sufficient accuracy when the quantity of interest depends discontinuously on the input parameters. As a remedy we propose an adaptive sampling algorithm based on minimum spanning trees combined with a domain d

CWI's Institutional Repository

Recommended from our members

A Hybrid Multibiometric System for Personal Identification Based on Face and Iris Traits. The Development of an automated computer system for the identification of humans by integrating facial and iris features using Localization, Feature Extraction, Handcrafted and Deep learning Techniques.

Author: Nassar Alaa S.N.
Publication venue: School of Electrical Engineering and Computer Science
Publication date: 01/01/2018
Field of study

Multimodal biometric systems have been widely applied in many real-world applications due to its ability to deal with a number of significant limitations of unimodal biometric systems, including sensitivity to noise, population coverage, intra-class variability, non-universality, and vulnerability to spoofing. This PhD thesis is focused on the combination of both the face and the left and right irises, in a unified hybrid multimodal biometric identification system using different fusion approaches at the score and rank level. Firstly, the facial features are extracted using a novel multimodal local feature extraction approach, termed as the Curvelet-Fractal approach, which based on merging the advantages of the Curvelet transform with Fractal dimension. Secondly, a novel framework based on merging the advantages of the local handcrafted feature descriptors with the deep learning approaches is proposed, Multimodal Deep Face Recognition (MDFR) framework, to address the face recognition problem in unconstrained conditions. Thirdly, an efficient deep learning system is employed, termed as IrisConvNet, whose architecture is based on a combination of Convolutional Neural Network (CNN) and Softmax classifier to extract discriminative features from an iris image. Finally, The performance of the unimodal and multimodal systems has been evaluated by conducting a number of extensive experiments on large-scale unimodal databases: FERET, CAS-PEAL-R1, LFW, CASIA-Iris-V1, CASIA-Iris-V3 Interval, MMU1 and IITD and MMU1, and SDUMLA-HMT multimodal dataset. The results obtained have demonstrated the superiority of the proposed systems compared to the previous works by achieving new state-of-the-art recognition rates on all the employed datasets with less time required to recognize the person’s identity.Multimodal biometric systems have been widely applied in many real-world applications due to its ability to deal with a number of significant limitations of unimodal biometric systems, including sensitivity to noise, population coverage, intra-class variability, non-universality, and vulnerability to spoofing. This PhD thesis is focused on the combination of both the face and the left and right irises, in a unified hybrid multimodal biometric identification system using different fusion approaches at the score and rank level. Firstly, the facial features are extracted using a novel multimodal local feature extraction approach, termed as the Curvelet-Fractal approach, which based on merging the advantages of the Curvelet transform with Fractal dimension. Secondly, a novel framework based on merging the advantages of the local handcrafted feature descriptors with the deep learning approaches is proposed, Multimodal Deep Face Recognition (MDFR) framework, to address the face recognition problem in unconstrained conditions. Thirdly, an efficient deep learning system is employed, termed as IrisConvNet, whose architecture is based on a combination of Convolutional Neural Network (CNN) and Softmax classifier to extract discriminative features from an iris image. Finally, The performance of the unimodal and multimodal systems has been evaluated by conducting a number of extensive experiments on large-scale unimodal databases: FERET, CAS-PEAL-R1, LFW, CASIA-Iris-V1, CASIA-Iris-V3 Interval, MMU1 and IITD and MMU1, and SDUMLA-HMT multimodal dataset. The results obtained have demonstrated the superiority of the proposed systems compared to the previous works by achieving new state-of-the-art recognition rates on all the employed datasets with less time required to recognize the person’s identity.Higher Committee for Education Development in Ira

Bradford Scholars

Super-resolution:A comprehensive survey

Author: A Adler
A Almansa
A Chakrabarti
A Corduneanu
A Gholipour
A Giachetti
A Lorette
A Marquina
A Panagiotopoulou
A Schatzberg
A Zomet
AJ Patti
AJ Patti
AJ Storkey
AJ Tatem
AK Katsaggelos
ALD Martins
AWMV Eekeren
AWMV Eekeren
B Choi
B Cohen
B Huhle
B Li
B Li
B Narayanan
B Wu
BC Song
BGV Kumar
BK Gunturk
BK Gunturk
BK Gunturk
BK Gunturk
BR Hunt
C Jung
C Liu
C Liu
C Miravet
C Miravet
C Papathanassiou
C Pohl
C Su
C Wang
C Wang
CA Segall
CA Segall
CA Segall
CS Tong
CV Jiji
CV Jiji
CV Jiji
D Calle
D Capel
D Datsenko
D Lin
D Pastina
D Rajan
D Rajan
D Rajan
D Rajan
D Rajan
D Rajan
D Robinson
D Robinson
D Yldrm
D Zhang
D Zhang
DO Walsh
DP Capel
E Salari
E Shechtman
EM Hung
F Champagnat
F Humblot
F Rousseau
F Sroubek
F Sroubek
F Zhou
FM Candocia
G Dedeoglu
G Gilboa
G Ye
GH Costa
GH Costa
GH Costa
GK Chantas
GM Callic
GM Callico
H Bouzari
H Chang
H Demirel
H He
H He
H Huang
H Huanga
H Ji
H Nasir
H Shekarforoush
H Shekarforoush
H Shekarforoush
H Stark
H Su
H Su
H Takeda
H Takeda
H Yang
H Zhang
H Zhang
H Zhao
HF Shen
HK Aghajan
I Begin
J Chen
J Chung
J Cui
J Sun
J Tian
J Tian
J Tian
J Wang
J Wang
J Wu
J Yang
J Yang
J Yu
JA Kennedy
JD Ouwerkerk van
JJ Green
JS Park
K Aizawa
K Choi
K Donaldson
K Jia
K Jia
K Jia
K Kimura
K Nasrollahi
Kamal Nasrollahi
KD Sauer
KH Yap
KI Kim
KV Suresh
L Ma
L Zhang
LC Pickup
LC Pickup
LC Pickup
LJ Karam
M Ben-Ezra
M Ben-Ezra
M Bertero
M Carcenac
M Chappalli
M Elad
M Elad
M Elad
M Elad
M Elad
M Elad
M Gevrekci
M Gevrekci
M Gonzalez-Audcana
M Irani
M Irani
M Jung
M Protter
M Protter
M Shen
M Shen
M Shen
M Singh
MC Chiang
MC Hong
MC Pan
MD Robinson
ME Tipping
ME Tipping
MH Cheng
MK Nema
MK Ng
MK Ng
MK Ng
MM Islam
MV Joshi
MV Joshi
MVW Zibetti
MVW Zibetti
MVW Zibetti
MVW Zibetti
N Bose
N Bose
N Goldberg
N Kulkarni
N Nguyen
N Nguyen
NA Woods
NA Yamany
NK Bose
NK Bose
NK Bose
OA Omer
OA Omer
P Chainais
P Kramer
P Milanfar
P Purkait
P Vandewalle
P Vandewalle
P Vandewalle
PD Santis
PE Eren
PP Gajjar
Q Pan
Q Yuan
Q Yuan
R Fransens
R He
R Molina
R Sasaharay
R Tsai
RC Hardie
RC Hardie
RC Hardie
RR Schultz
RR Schultz
RR Schultz
RS Prendergast
RW Gerchberg
S Baker
S Chaudhuri
S Dai
S Farsiu
S Farsiu
S Farsiu
S Farsiu
S Farsiu
S Kim
S Liu
S Lui
S Mallat
S Peleg
S Pelletier
S Peng
S Rajaram
S Yang
S Zhang
S Zhao
SC Park
SD Babacan
SH Keller
SP Belekos
SW Park
T Akgun
T Gotoh
T Katsuki
T Komatsu
T Szydzik
TA Stephenson
TC Ho
TF Gee
Thomas B. Moeslund
V Patanavijit
V Patanavijit
V Patanavijit
W Fan
W Liu
W Liu
W Wu
W Zhang
W Zhao
WT Freeman
WT Freeman
WWW Zou
WZ Shao
X Gao
X Gao
X Gao
X Li
X Li
X Li
X Ma
X Maa
X Wang
X Zeng
X Zhang
Y Altunbasak
Y He
Y He
Y Hu
Y Hu
Y Huang
Y Mochizuki
Y Zhuang
Y-W Tai
YJ Ma
YR Li
Z Arycan
Z Bi
Z Jiang
Z Lin
Z Lin
Z Wang
Z Wang
Z Xiong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/06/2014
Field of study

Crossref

VBN

Fusion of magnetic resonance and ultrasound images for endometriosis detection

Author: El Mansouri Oumaima
Publication venue
Publication date: 07/12/2020
Field of study

Endometriosis is a gynecologic disorder that typically affects women in their reproductive age and is associated with chronic pelvic pain and infertility. In the context of pre-operative diagnosis and guided surgery, endometriosis is a typical example of pathology that requires the use of both magnetic resonance (MR) and ultrasound (US) modalities. These modalities are used side by sidebecause they contain complementary information. However, MRI and US images have different spatial resolutions, fields of view and contrasts and are corrupted by different kinds of noise, which results in important challenges related to their analysis by radiologists. The fusion of MR and US images is a way of facilitating the task of medical experts and improve the pre-operative diagnosis and the surgery mapping. The object of this PhD thesis is to propose a new automatic fusion method for MRI and US images. First, we assume that the MR and US images to be fused are aligned, i.e., there is no geometric distortion between these images. We propose a fusion method for MR and US images, which aims at combining the advantages of each modality, i.e., good contrast and signal to noise ratio for the MR image and good spatial resolution for the US image. The proposed algorithm is based on an inverse problem, performing a super-resolution of the MR image and a denoising of the US image. A polynomial function is introduced to modelthe relationships between the gray levels of the MR and US images. However, the proposed fusion method is very sensitive to registration errors. Thus, in a second step, we introduce a joint fusion and registration method for MR and US images. Registration is a complicated task in practical applications. The proposed MR/US image fusion performs jointly super-resolution of the MR image and despeckling of the US image, and is able to automatically account for registration errors. A polynomial function is used to link ultrasound and MR images in the fusion process while an appropriate similarity measure is introduced to handle the registration problem. The proposed registration is based on a non-rigid transformation containing a local elastic B-spline model and a global affine transformation. The fusion and registration operations are performed alternatively simplifying the underlying optimization problem. The interest of the joint fusion and registration is analyzed using synthetic and experimental phantom images

Open Archive Toulouse Archive Ouverte

Super Resolution of Wavelet-Encoded Images and Videos

Author: Atalay Vildan
Publication venue: 'Information Bulletin on Variable Stars (IBVS)'
Publication date: 01/01/2017
Field of study

In this dissertation, we address the multiframe super resolution reconstruction problem for wavelet-encoded images and videos. The goal of multiframe super resolution is to obtain one or more high resolution images by fusing a sequence of degraded or aliased low resolution images of the same scene. Since the low resolution images may be unaligned, a registration step is required before super resolution reconstruction. Therefore, we first explore in-band (i.e. in the wavelet-domain) image registration; then, investigate super resolution. Our motivation for analyzing the image registration and super resolution problems in the wavelet domain is the growing trend in wavelet-encoded imaging, and wavelet-encoding for image/video compression. Due to drawbacks of widely used discrete cosine transform in image and video compression, a considerable amount of literature is devoted to wavelet-based methods. However, since wavelets are shift-variant, existing methods cannot utilize wavelet subbands efficiently. In order to overcome this drawback, we establish and explore the direct relationship between the subbands under a translational shift, for image registration and super resolution. We then employ our devised in-band methodology, in a motion compensated video compression framework, to demonstrate the effective usage of wavelet subbands. Super resolution can also be used as a post-processing step in video compression in order to decrease the size of the video files to be compressed, with downsampling added as a pre-processing step. Therefore, we present a video compression scheme that utilizes super resolution to reconstruct the high frequency information lost during downsampling. In addition, super resolution is a crucial post-processing step for satellite imagery, due to the fact that it is hard to update imaging devices after a satellite is launched. Thus, we also demonstrate the usage of our devised methods in enhancing resolution of pansharpened multispectral images

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Acquisition, compression and rendering of depth and texture for multi-view video

Author: Morvan Y.
Publication venue: Technische Universiteit Eindhoven
Publication date: 01/01/2009
Field of study

Three-dimensional (3D) video and imaging technologies is an emerging trend in the development of digital video systems, as we presently witness the appearance of 3D displays, coding systems, and 3D camera setups. Three-dimensional multi-view video is typically obtained from a set of synchronized cameras, which are capturing the same scene from different viewpoints. This technique especially enables applications such as freeviewpoint video or 3D-TV. Free-viewpoint video applications provide the feature to interactively select and render a virtual viewpoint of the scene. A 3D experience such as for example in 3D-TV is obtained if the data representation and display enable to distinguish the relief of the scene, i.e., the depth within the scene. With 3D-TV, the depth of the scene can be perceived using a multi-view display that renders simultaneously several views of the same scene. To render these multiple views on a remote display, an efficient transmission, and thus compression of the multi-view video is necessary. However, a major problem when dealing with multiview video is the intrinsically large amount of data to be compressed, decompressed and rendered. We aim at an efficient and flexible multi-view video system, and explore three different aspects. First, we develop an algorithm for acquiring a depth signal from a multi-view setup. Second, we present efficient 3D rendering algorithms for a multi-view signal. Third, we propose coding techniques for 3D multi-view signals, based on the use of an explicit depth signal. This motivates that the thesis is divided in three parts. The first part (Chapter 3) addresses the problem of 3D multi-view video acquisition. Multi-view video acquisition refers to the task of estimating and recording a 3D geometric description of the scene. A 3D description of the scene can be represented by a so-called depth image, which can be estimated by triangulation of the corresponding pixels in the multiple views. Initially, we focus on the problem of depth estimation using two views, and present the basic geometric model that enables the triangulation of corresponding pixels across the views. Next, we review two calculation/optimization strategies for determining corresponding pixels: a local and a one-dimensional optimization strategy. Second, to generalize from the two-view case, we introduce a simple geometric model for estimating the depth using multiple views simultaneously. Based on this geometric model, we propose a new multi-view depth-estimation technique, employing a one-dimensional optimization strategy that (1) reduces the noise level in the estimated depth images and (2) enforces consistent depth images across the views. The second part (Chapter 4) details the problem of multi-view image rendering. Multi-view image rendering refers to the process of generating synthetic images using multiple views. Two different rendering techniques are initially explored: a 3D image warping and a mesh-based rendering technique. Each of these methods has its limitations and suffers from either high computational complexity or low image rendering quality. As a consequence, we present two image-based rendering algorithms that improves the balance on the aforementioned issues. First, we derive an alternative formulation of the relief texture algorithm which was extented to the geometry of multiple views. The proposed technique features two advantages: it avoids rendering artifacts ("holes") in the synthetic image and it is suitable for execution on a standard Graphics Processor Unit (GPU). Second, we propose an inverse mapping rendering technique that allows a simple and accurate re-sampling of synthetic pixels. Experimental comparisons with 3D image warping show an improvement of rendering quality of 3.8 dB for the relief texture mapping and 3.0 dB for the inverse mapping rendering technique. The third part concentrates on the compression problem of multi-view texture and depth video (Chapters 5–7). In Chapter 5, we extend the standard H.264/MPEG-4 AVC video compression algorithm for handling the compression of multi-view video. As opposed to the Multi-view Video Coding (MVC) standard that encodes only the multi-view texture data, the proposed encoder peforms the compression of both the texture and the depth multi-view sequences. The proposed extension is based on exploiting the correlation between the multiple camera views. To this end, two different approaches for predictive coding of views have been investigated: a block-based disparity-compensated prediction technique and a View Synthesis Prediction (VSP) scheme. Whereas VSP relies on an accurate depth image, the block-based disparity-compensated prediction scheme can be performed without any geometry information. Our encoder adaptively selects the most appropriate prediction scheme using a rate-distortion criterion for an optimal prediction-mode selection. We present experimental results for several texture and depth multi-view sequences, yielding a quality improvement of up to 0.6 dB for the texture and 3.2 dB for the depth, when compared to solely performing H.264/MPEG-4AVC disparitycompensated prediction. Additionally, we discuss the trade-off between the random-access to a user-selected view and the coding efficiency. Experimental results illustrating and quantifying this trade-off are provided. In Chapter 6, we focus on the compression of a depth signal. We present a novel depth image coding algorithm which concentrates on the special characteristics of depth images: smooth regions delineated by sharp edges. The algorithm models these smooth regions using parameterized piecewiselinear functions and sharp edges by a straight line, so that it is more efficient than a conventional transform-based encoder. To optimize the quality of the coding system for a given bit rate, a special global rate-distortion optimization balances the rate against the accuracy of the signal representation. For typical bit rates, i.e., between 0.01 and 0.25 bit/pixel, experiments have revealed that the coder outperforms a standard JPEG-2000 encoder by 0.6-3.0 dB. Preliminary results were published in the Proceedings of 26th Symposium on Information Theory in the Benelux. In Chapter 7, we propose a novel joint depth-texture bit-allocation algorithm for the joint compression of texture and depth images. The described algorithm combines the depth and texture Rate-Distortion (R-D) curves, to obtain a single R-D surface that allows the optimization of the joint bit-allocation in relation to the obtained rendering quality. Experimental results show an estimated gain of 1 dB compared to a compression performed without joint bit-allocation optimization. Besides this, our joint R-D model can be readily integrated into an multi-view H.264/MPEG-4 AVC coder because it yields the optimal compression setting with a limited computation effort

Pure OAI Repository