Search CORE

51 research outputs found

Enhancing Image Quality: A Comparative Study of Spatial, Frequency Domain, and Deep Learning Methods

Author: Rashmi Agrawal et al.
Publication venue: Auricle Global Society of Education and Research
Publication date: 02/11/2023
Field of study

Image restoration and noise reduction methods have been created to restore deteriorated images and improve their quality. These methods have garnered substantial significance in recent times, mainly due to the growing utilization of digital imaging across diverse domains, including but not limited to medical imaging, surveillance, satellite imaging, and numerous others. In this paper, we conduct a comparative analysis of three distinct approaches to image restoration: the spatial method, the frequency domain method, and the deep learning method. The study was conducted on a dataset of 10,000 images, and the performance of each method was evaluated using the accuracy and loss metrics. The results show that the deep learning method outperformed the other two methods, achieving a validation accuracy of 72.68% after 10 epochs. The spatial method had the lowest accuracy of the three, achieving a validation accuracy of 69.98% after 10 epochs. The FFT frequency domain method had a validation accuracy of 52.87% after 10 epochs, significantly lower than the other two methods. The study demonstrates that deep learning is a promising approach for image classification tasks and outperforms traditional methods such as spatial and frequency domain techniques

International Journal on Recent and Innovation Trends in Computing and Communication

Video content analysis for intelligent forensics

Author: Muhammad Fraz (7169066)
Publication venue
Publication date: 01/01/2014
Field of study

The networks of surveillance cameras installed in public places and private territories continuously record video data with the aim of detecting and preventing unlawful activities. This enhances the importance of video content analysis applications, either for real time (i.e. analytic) or post-event (i.e. forensic) analysis. In this thesis, the primary focus is on four key aspects of video content analysis, namely; 1. Moving object detection and recognition, 2. Correction of colours in the video frames and recognition of colours of moving objects, 3. Make and model recognition of vehicles and identification of their type, 4. Detection and recognition of text information in outdoor scenes. To address the first issue, a framework is presented in the first part of the thesis that efficiently detects and recognizes moving objects in videos. The framework targets the problem of object detection in the presence of complex background. The object detection part of the framework relies on background modelling technique and a novel post processing step where the contours of the foreground regions (i.e. moving object) are refined by the classification of edge segments as belonging either to the background or to the foreground region. Further, a novel feature descriptor is devised for the classification of moving objects into humans, vehicles and background. The proposed feature descriptor captures the texture information present in the silhouette of foreground objects. To address the second issue, a framework for the correction and recognition of true colours of objects in videos is presented with novel noise reduction, colour enhancement and colour recognition stages. The colour recognition stage makes use of temporal information to reliably recognize the true colours of moving objects in multiple frames. The proposed framework is specifically designed to perform robustly on videos that have poor quality because of surrounding illumination, camera sensor imperfection and artefacts due to high compression. In the third part of the thesis, a framework for vehicle make and model recognition and type identification is presented. As a part of this work, a novel feature representation technique for distinctive representation of vehicle images has emerged. The feature representation technique uses dense feature description and mid-level feature encoding scheme to capture the texture in the frontal view of the vehicles. The proposed method is insensitive to minor in-plane rotation and skew within the image. The capability of the proposed framework can be enhanced to any number of vehicle classes without re-training. Another important contribution of this work is the publication of a comprehensive up to date dataset of vehicle images to support future research in this domain. The problem of text detection and recognition in images is addressed in the last part of the thesis. A novel technique is proposed that exploits the colour information in the image for the identification of text regions. Apart from detection, the colour information is also used to segment characters from the words. The recognition of identified characters is performed using shape features and supervised learning. Finally, a lexicon based alignment procedure is adopted to finalize the recognition of strings present in word images. Extensive experiments have been conducted on benchmark datasets to analyse the performance of proposed algorithms. The results show that the proposed moving object detection and recognition technique superseded well-know baseline techniques. The proposed framework for the correction and recognition of object colours in video frames achieved all the aforementioned goals. The performance analysis of the vehicle make and model recognition framework on multiple datasets has shown the strength and reliability of the technique when used within various scenarios. Finally, the experimental results for the text detection and recognition framework on benchmark datasets have revealed the potential of the proposed scheme for accurate detection and recognition of text in the wild

Loughborough University Institutional Repository

Advancements and Breakthroughs in Ultrasound Imaging

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Ultrasonic imaging is a powerful diagnostic tool available to medical practitioners, engineers and researchers today. Due to the relative safety, and the non-invasive nature, ultrasonic imaging has become one of the most rapidly advancing technologies. These rapid advances are directly related to the parallel advancements in electronics, computing, and transducer technology together with sophisticated signal processing techniques. This book focuses on state of the art developments in ultrasonic imaging applications and underlying technologies presented by leading practitioners and researchers from many parts of the world

Directory of Open Access Books (DOAB)

Deep Depth Completion of a Single RGB-D Image

Author: Funkhouser Thomas
Zhang Yinda
Publication venue
Publication date: 01/01/2018
Field of study

The goal of our work is to complete the depth channel of an RGB-D image. Commodity-grade depth cameras often fail to sense depth for shiny, bright, transparent, and distant surfaces. To address this problem, we train a deep network that takes an RGB image as input and predicts dense surface normals and occlusion boundaries. Those predictions are then combined with raw depth observations provided by the RGB-D camera to solve for depths for all pixels, including those missing in the original observation. This method was chosen over others (e.g., inpainting depths directly) as the result of extensive experiments with a new depth completion benchmark dataset, where holes are filled in training data through the rendering of surface reconstructions created from multiview RGB-D scans. Experiments with different network inputs, depth representations, loss functions, optimization methods, inpainting methods, and deep depth estimation networks show that our proposed approach provides better depth completions than these alternatives.Comment: Accepted by CVPR2018 (Spotlight). Project webpage: http://deepcompletion.cs.princeton.edu/ This version includes supplementary materials which provide more implementation details, quantitative evaluation, and qualitative results. Due to file size limit, please check project website for high-res pape

arXiv.org e-Print Archive

Princeton University Open Access Repository

Crossref

Recent Progress in Image Deblurring

Author: Tao Dacheng
Wang Ruxin
Publication venue
Publication date: 24/09/2014
Field of study

This paper comprehensively reviews the recent development of image deblurring, including non-blind/blind, spatially invariant/variant deblurring techniques. Indeed, these techniques share the same objective of inferring a latent sharp image from one or several corresponding blurry images, while the blind deblurring techniques are also required to derive an accurate blur kernel. Considering the critical role of image restoration in modern imaging systems to provide high-quality images under complex environments such as motion, undesirable lighting conditions, and imperfect system components, image deblurring has attracted growing attention in recent years. From the viewpoint of how to handle the ill-posedness which is a crucial issue in deblurring tasks, existing methods can be grouped into five categories: Bayesian inference framework, variational methods, sparse representation-based methods, homography-based modeling, and region-based methods. In spite of achieving a certain level of development, image deblurring, especially the blind case, is limited in its success by complex application conditions which make the blur kernel hard to obtain and be spatially variant. We provide a holistic understanding and deep insight into image deblurring in this review. An analysis of the empirical evidence for representative methods, practical issues, as well as a discussion of promising future directions are also presented.Comment: 53 pages, 17 figure

arXiv.org e-Print Archive

CiteSeerX

A multi-frame super-resolution method based on the variable-exponent nonlinear diffusion regularizer

Author: Baraka Jacob Maiseli
Huijun Gao
Ogada Achieng Elisha
Publication venue: Springer Nature
Publication date: 01/01/2015
Field of study

Springer - Publisher Connector

Super-resolution:A comprehensive survey

Author: A Adler
A Almansa
A Chakrabarti
A Corduneanu
A Gholipour
A Giachetti
A Lorette
A Marquina
A Panagiotopoulou
A Schatzberg
A Zomet
AJ Patti
AJ Patti
AJ Storkey
AJ Tatem
AK Katsaggelos
ALD Martins
AWMV Eekeren
AWMV Eekeren
B Choi
B Cohen
B Huhle
B Li
B Li
B Narayanan
B Wu
BC Song
BGV Kumar
BK Gunturk
BK Gunturk
BK Gunturk
BK Gunturk
BR Hunt
C Jung
C Liu
C Liu
C Miravet
C Miravet
C Papathanassiou
C Pohl
C Su
C Wang
C Wang
CA Segall
CA Segall
CA Segall
CS Tong
CV Jiji
CV Jiji
CV Jiji
D Calle
D Capel
D Datsenko
D Lin
D Pastina
D Rajan
D Rajan
D Rajan
D Rajan
D Rajan
D Rajan
D Robinson
D Robinson
D Yldrm
D Zhang
D Zhang
DO Walsh
DP Capel
E Salari
E Shechtman
EM Hung
F Champagnat
F Humblot
F Rousseau
F Sroubek
F Sroubek
F Zhou
FM Candocia
G Dedeoglu
G Gilboa
G Ye
GH Costa
GH Costa
GH Costa
GK Chantas
GM Callic
GM Callico
H Bouzari
H Chang
H Demirel
H He
H He
H Huang
H Huanga
H Ji
H Nasir
H Shekarforoush
H Shekarforoush
H Shekarforoush
H Stark
H Su
H Su
H Takeda
H Takeda
H Yang
H Zhang
H Zhang
H Zhao
HF Shen
HK Aghajan
I Begin
J Chen
J Chung
J Cui
J Sun
J Tian
J Tian
J Tian
J Wang
J Wang
J Wu
J Yang
J Yang
J Yu
JA Kennedy
JD Ouwerkerk van
JJ Green
JS Park
K Aizawa
K Choi
K Donaldson
K Jia
K Jia
K Jia
K Kimura
K Nasrollahi
Kamal Nasrollahi
KD Sauer
KH Yap
KI Kim
KV Suresh
L Ma
L Zhang
LC Pickup
LC Pickup
LC Pickup
LJ Karam
M Ben-Ezra
M Ben-Ezra
M Bertero
M Carcenac
M Chappalli
M Elad
M Elad
M Elad
M Elad
M Elad
M Elad
M Gevrekci
M Gevrekci
M Gonzalez-Audcana
M Irani
M Irani
M Jung
M Protter
M Protter
M Shen
M Shen
M Shen
M Singh
MC Chiang
MC Hong
MC Pan
MD Robinson
ME Tipping
ME Tipping
MH Cheng
MK Nema
MK Ng
MK Ng
MK Ng
MM Islam
MV Joshi
MV Joshi
MVW Zibetti
MVW Zibetti
MVW Zibetti
MVW Zibetti
N Bose
N Bose
N Goldberg
N Kulkarni
N Nguyen
N Nguyen
NA Woods
NA Yamany
NK Bose
NK Bose
NK Bose
OA Omer
OA Omer
P Chainais
P Kramer
P Milanfar
P Purkait
P Vandewalle
P Vandewalle
P Vandewalle
PD Santis
PE Eren
PP Gajjar
Q Pan
Q Yuan
Q Yuan
R Fransens
R He
R Molina
R Sasaharay
R Tsai
RC Hardie
RC Hardie
RC Hardie
RR Schultz
RR Schultz
RR Schultz
RS Prendergast
RW Gerchberg
S Baker
S Chaudhuri
S Dai
S Farsiu
S Farsiu
S Farsiu
S Farsiu
S Farsiu
S Kim
S Liu
S Lui
S Mallat
S Peleg
S Pelletier
S Peng
S Rajaram
S Yang
S Zhang
S Zhao
SC Park
SD Babacan
SH Keller
SP Belekos
SW Park
T Akgun
T Gotoh
T Katsuki
T Komatsu
T Szydzik
TA Stephenson
TC Ho
TF Gee
Thomas B. Moeslund
V Patanavijit
V Patanavijit
V Patanavijit
W Fan
W Liu
W Liu
W Wu
W Zhang
W Zhao
WT Freeman
WT Freeman
WWW Zou
WZ Shao
X Gao
X Gao
X Gao
X Li
X Li
X Li
X Ma
X Maa
X Wang
X Zeng
X Zhang
Y Altunbasak
Y He
Y He
Y Hu
Y Hu
Y Huang
Y Mochizuki
Y Zhuang
Y-W Tai
YJ Ma
YR Li
Z Arycan
Z Bi
Z Jiang
Z Lin
Z Lin
Z Wang
Z Wang
Z Xiong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/06/2014
Field of study

Crossref

VBN

Recommended from our members

Deep learning assisted MRI guided attenuation correction in PET

Author: Mecheter Imene
Publication venue: Brunel University London
Publication date: 01/01/2021
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University LondonPositron emission tomography (PET) is a unique imaging modality that provides physiological and functional details of the tissue at the molecular level. However, the acquired PET images have some limitations such as the attenuation. PET attenuation correction is an essential step to obtain the full potential of PET quantification. With the wide use of hybrid PET/MR scanners, magnetic resonance (MR) images are used to address the problem of PET attenuation correction. The MR images segmentation is one simple and robust approach to create pseudo computed tomography (CT) images, which are used to generate attenuation coefficient maps to correct the PET attenuation. Recently, deep learning has been proposed and used as a promising technique to efficiently perform MR and various medical images segmentation. In this research work, deep learning guided segmentation approaches have been proposed to enhance the bone class segmentation of MR brain images in order to generate accurate pseudo-CT images. The first approach has introduced the combination of handcrafted features with deep learning features to enrich the set of features. Multiresolution analysis techniques, which generate multiscale and multidirectional coefficients of an image such as contourlet and shearlet transforms, are applied and combined with deep convolutional neural network (CNN) features. Different experiments have been conducted to investigate the number of selected coefficients and the insertion location of the handcrafted features. The second approach aims at reducing the segmentation algorithm’s complexity while maintaining the segmentation performance. An attention based convolutional encode-decoder network has been proposed to adaptively recalibrate the deep network features. This attention based network consists of two different squeeze and excitation blocks that excite the features spatially and channel wise. The two blocks are combined sequentially to decrease the number of network’s parameters and reduces the model complexity. The third approach has been focuses on the application of transfer learning from different MR sequences such as T1 weighted (T1-w) and T2 weighted (T2-w) images. A pretrained model with T1-w MR sequences is fine tuned to perform the segmentation of T2-w images. Multiple fine tuning approaches and experiments have been conducted to study the best fine tuning mechanism that is able to build an efficient segmentation model for both T1-w and T2-w segmentation. Clinical datasets of fifty patients with different conditions and diagnosis have been used to carry an objective evaluation to measure the segmentation performance of the results obtained by the three proposed methods. The first and second approaches have been validated with other studies in the literature that applied deep network based segmentation technique to perform MR based attenuation correction for PET images. The proposed methods have shown an enhancement in the bone segmentation with an increase of dice similarity coefficient (DSC) from 0.6179 to 0.6567 using an ensemble of CNNs with an improvement percentage of 6.3%. The proposed excitation-based CNN has decreased the model complexity by decreasing the number of trainable parameters by more than 46% where less computing resources are required to train the model. The proposed hybrid transfer learning method has shown its superiority to build a multi-sequences (T1-w and T2-w) segmentation approach compared to other applied transfer learning methods especially with the bone class where the DSC is increased from 0.3841 to 0.5393. Moreover, the hybrid transfer learning approach requires less computing time than transfer learning using open and conservative fine tuning

Brunel University Research Archive