Search CORE

31,267 research outputs found

An optimal factor analysis approach to improve the wavelet-based image resolution enhancement techniques

Author: Jenkins Karl W.
Witwit Wasnaa
Zhao Yifan
Zhao Yitian
Publication venue: Global Journal of Computer Science and Technology
Publication date: 17/12/2016
Field of study

The existing wavelet-based image resolution enhancement techniques have many assumptions, such as limitation of the way to generate low-resolution images and the selection of wavelet functions, which limits their applications in different fields. This paper initially identifies the factors that effectively affect the performance of these techniques and quantitatively evaluates the impact of the existing assumptions. An approach called Optimal Factor Analysis employing the genetic algorithm is then introduced to increase the applicability and fidelity of the existing methods. Moreover, a new Figure of Merit is proposed to assist the selection of parameters and better measure the overall performance. The experimental results show that the proposed approach improves the performance of the selected image resolution enhancement methods and has potential to be extended to other methods

Cranfield CERES

Image enhancement from a stabilised video sequence

Author: Scarmana Gabriel
Publication venue: International Society for Photogrammetry and Remote Sensing (ISPRS)
Publication date: 26/08/2012
Field of study

The aim of video stabilisation is to create a new video sequence where the motions (i.e. rotations, translations) and scale differences between frames (or parts of a frame) have effectively been removed. These stabilisation effects can be obtained via digital video processing techniques which use the information extracted from the video sequence itself, with no need for additional hardware or knowledge about camera physical motion. A video sequence usually contains a large overlap between successive frames, and regions of the same scene are sampled at different positions. In this paper, this multiple sampling is combined to achieve images with a higher spatial resolution. Higher resolution imagery play an important role in assisting in the identification of people, vehicles, structures or objects of interest captured by surveillance cameras or by video cameras used in face recognition, traffic monitoring, traffic law reinforcement, driver assistance and automatic vehicle guidance systems

University of Southern Queensland ePrints

Investigation of a new method for improving image resolution for camera tracking applications

Author: Alzarok Hamza
Fletcher Simon
Longstaff Andrew P.
Myers Alan
Publication venue: EUSPEN
Publication date: 17/03/2015
Field of study

Camera based systems have been a preferred choice in many motion tracking applications due to the ease of installation and the ability to work in unprepared environments. The concept of these systems is based on extracting image information (colour and shape properties) to detect the object location. However, the resolution of the image and the camera field-of- view (FOV) are two main factors that can restrict the tracking applications for which these systems can be used. Resolution can be addressed partially by using higher resolution cameras but this may not always be possible or cost effective. This research paper investigates a new method utilising averaging of offset images to improve the effective resolution using a standard camera. The initial results show that the minimum detectable position change of a tracked object could be improved by up to 4 times

University of Huddersfield Repository

Huddersfield Research Portal

UG^2: a Video Benchmark for Assessing the Impact of Image Restoration and Enhancement on Automatic Visual Recognition

Author: Banerjee Sreya
Grm Klemen
Scheirer Walter J.
Struc Vitomir
Vidal Rosaura G.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/02/2018
Field of study

Advances in image restoration and enhancement techniques have led to discussion about how such algorithmscan be applied as a pre-processing step to improve automatic visual recognition. In principle, techniques like deblurring and super-resolution should yield improvements by de-emphasizing noise and increasing signal in an input image. But the historically divergent goals of the computational photography and visual recognition communities have created a significant need for more work in this direction. To facilitate new research, we introduce a new benchmark dataset called UG^2, which contains three difficult real-world scenarios: uncontrolled videos taken by UAVs and manned gliders, as well as controlled videos taken on the ground. Over 160,000 annotated frames forhundreds of ImageNet classes are available, which are used for baseline experiments that assess the impact of known and unknown image artifacts and other conditions on common deep learning-based object classification approaches. Further, current image restoration and enhancement techniques are evaluated by determining whether or not theyimprove baseline classification performance. Results showthat there is plenty of room for algorithmic innovation, making this dataset a useful tool going forward.Comment: Supplemental material: https://goo.gl/vVM1xe, Dataset: https://goo.gl/AjA6En, CVPR 2018 Prize Challenge: ug2challenge.or

arXiv.org e-Print Archive

Crossref

Recommended from our members

Learning Spatial and Temporal Visual Enhancement

Author: Lai Wei-Sheng
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Visual enhancement is concerned with problems to improve the visual quality and viewing experience for images and videos. Researchers have been actively working on this area due to its theoretical and practical interest. However, obtaining high visual quality often comes with a cost of computational efficiency. With the growth of mobile applications and cloud services, it is crucial to develop effective and efficient algorithms for generating visually attractive images and videos. In this thesis, we address the visual enhancement problems in three aspects, including the spatial, temporal, and the joint spatial-temporal domains. We propose efficient algorithms based on deep convolutional neural networks for solving various visual enhancement problems.First, we address the problem of spatial enhancement for single-image super-resolution. We propose a deep Laplacian Pyramid Network to reconstruct a high-resolution image from an input low-resolution input in a coarse-to-fine manner. Our model directly extracts features from input LR images and progressively reconstructs the sub-band residuals. We train the proposed model with a multi-scale training, deep supervision, and robust loss functions to achieve state-of-the-art performance. Furthermore, we exploit the recursive learning technique to share parameters across and within pyramid levels to significantly reduce the model parameters. As most of the operations are performed on a low-resolution space, our model requires less memory and runs faster than state-of-the-art methods.Second, we address the temporal enhancement problem by learning the temporal consistency in videos. Given an input video and a per-frame processed video (processed by an existing image-based algorithm), we learn a recurrent network to reduce the temporal flickering and generate a temporally consistent video. We train the proposed network by minimizing both short-term and long-term temporal losses as well as a perceptual loss to strike a balance between temporal coherence and perceptual similarity with the processed frames. At test time, our model does not require computing optical flow and thus runs at 400+ FPS on GPU for high-resolution videos. Our model is task independent, where a single model can handle multiple and unseen tasks, including but not limited to artistic style transfer, enhancement, colorization, image-to-image translation and intrinsic image decomposition.Third, we address the spatial-temporal enhancement problem for video stitching. Inspired by the pushbroom cameras, we cast the stitching as a spatial interpolation problem. We propose a pushbroom stitching network to learn dense flow fields to smoothly align the input videos. The stitched videos can be generated from an efficient pushbroom interpolation layer. Our approach generates more temporally stable and visually pleasing results than existing video stitching approaches and commercial software. Furthermore, our algorithm has immediate applications in many areas such as virtual reality, immersive telepresence, autonomous driving, and video surveillance

eScholarship - University of California

Spatiotemporal super-resolution for low bitrate H.264 video

Author: Anantrasirichai N
Canagarajah CN
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/09/2010
Field of study

Crossref

Explore Bristol Research

Sub-pixel resolving optofluidic microscope for on-chip cell imaging

Author: Lee Seung Ah
Yang Changhuei
Yang Samuel
Zheng Guoan
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 01/01/2010
Field of study

We report the implementation of a fully on-chip, lensless, sub-pixel resolving optofluidic microscope (SROFM). The device utilizes microfluidic flow to deliver specimens directly across a complementary metal oxide semiconductor (CMOS) sensor to generate a sequence of low-resolution (LR) projection images, where resolution is limited by the sensor's pixel size. This image sequence is then processed with a pixel super-resolution algorithm to reconstruct a single high resolution (HR) image, where features beyond the Nyquist rate of the LR images are resolved. We demonstrate the device's capabilities by imaging microspheres, protist Euglena gracilis, and Entamoeba invadens cysts with sub-cellular resolution and establish that our prototype has a resolution limit of 0.75 microns. Furthermore, we also apply the same pixel super-resolution algorithm to reconstruct HR videos in which the dynamic interaction between the fluid and the sample, including the in-plane and out-of-plane rotation of the sample within the flow, can be monitored in high resolution. We believe that the powerful combination of both the pixel super-resolution and optofluidic microscopy techniques within our SROFM is a significant step forwards toward a simple, cost-effective, high throughput and highly compact imaging solution for biomedical and bioscience needs

Caltech Authors

Video-rate computational super-resolution and integral imaging at longwave-infrared wavelengths

Author: Carles Guillem
Harvey Andrew R.
Preciado Miguel A.
Publication venue
Publication date: 11/08/2017
Field of study

We report the first computational super-resolved, multi-camera integral imaging at long-wave infrared (LWIR) wavelengths. A synchronized array of FLIR Lepton cameras was assembled, and computational super-resolution and integral-imaging reconstruction employed to generate video with light-field imaging capabilities, such as 3D imaging and recognition of partially obscured objects, while also providing a four-fold increase in effective pixel count. This approach to high-resolution imaging enables a fundamental reduction in the track length and volume of an imaging system, while also enabling use of low-cost lens materials.Comment: Supplementary multimedia material in http://dx.doi.org/10.6084/m9.figshare.530302

arXiv.org e-Print Archive

Enlighten

Light field super resolution through controlled micro-shifts of light field sensor

Author: Gunturk Bahadir K.
Mukati M. Umair
Publication venue
Publication date: 01/01/2018
Field of study

Light field cameras enable new capabilities, such as post-capture refocusing and aperture control, through capturing directional and spatial distribution of light rays in space. Micro-lens array based light field camera design is often preferred due to its light transmission efficiency, cost-effectiveness and compactness. One drawback of the micro-lens array based light field cameras is low spatial resolution due to the fact that a single sensor is shared to capture both spatial and angular information. To address the low spatial resolution issue, we present a light field imaging approach, where multiple light fields are captured and fused to improve the spatial resolution. For each capture, the light field sensor is shifted by a pre-determined fraction of a micro-lens size using an XY translation stage for optimal performance

arXiv.org e-Print Archive

İstanbul Medipol University Institutional Repository