9 research outputs found

    Haar-Wavelet-Based Just Noticeable Distortion Model for Transparent Watermark

    Get PDF
    Watermark transparency is required mainly for copyright protection. Based on the characteristics of human visual system, the just noticeable distortion (JND) can be used to verify the transparency requirement. More specifically, any watermarks whose intensities are less than the JND values of an image can be added without degrading the visual quality. It takes extensive experimentations for an appropriate JND model. Motivated by the texture masking effect and the spatial masking effect, which are key factors of JND, Chou and Li (1995) proposed the well-known full-band JND model for the transparent watermark applications. In this paper, we propose a novel JND model based on discrete wavelet transform. Experimental results show that the performance of the proposed JND model is comparable to that of the full-band JND model. However, it has the advantage of saving a lot of computation time; the speed is about 6 times faster than that of the full-band JND model

    Low bit-rate image sequence coding

    Get PDF

    Mode decision for the H.264/AVC video coding standard

    Get PDF
    H.264/AVC video coding standard gives us a very promising future for the field of video broadcasting and communication because of its high coding efficiency compared with other older video coding standards. However, high coding efficiency also carries high computational complexity. Fast motion estimation and fast mode decision are two very useful techniques which can significantly reduce computational complexity. This thesis focuses on the field of fast mode decision. The goal of this thesis is that for very similar RD performance compared with H.264/AVC video coding standard, we aim to find new fast mode decision techniques which can afford significant time savings. [Continues.

    Feature extraction for image quality prediction

    Get PDF

    Content-prioritised video coding for British Sign Language communication.

    Get PDF
    Video communication of British Sign Language (BSL) is important for remote interpersonal communication and for the equal provision of services for deaf people. However, the use of video telephony and video conferencing applications for BSL communication is limited by inadequate video quality. BSL is a highly structured, linguistically complete, natural language system that expresses vocabulary and grammar visually and spatially using a complex combination of facial expressions (such as eyebrow movements, eye blinks and mouth/lip shapes), hand gestures, body movements and finger-spelling that change in space and time. Accurate natural BSL communication places specific demands on visual media applications which must compress video image data for efficient transmission. Current video compression schemes apply methods to reduce statistical redundancy and perceptual irrelevance in video image data based on a general model of Human Visual System (HVS) sensitivities. This thesis presents novel video image coding methods developed to achieve the conflicting requirements for high image quality and efficient coding. Novel methods of prioritising visually important video image content for optimised video coding are developed to exploit the HVS spatial and temporal response mechanisms of BSL users (determined by Eye Movement Tracking) and the characteristics of BSL video image content. The methods implement an accurate model of HVS foveation, applied in the spatial and temporal domains, at the pre-processing stage of a current standard-based system (H.264). Comparison of the performance of the developed and standard coding systems, using methods of video quality evaluation developed for this thesis, demonstrates improved perceived quality at low bit rates. BSL users, broadcasters and service providers benefit from the perception of high quality video over a range of available transmission bandwidths. The research community benefits from a new approach to video coding optimisation and better understanding of the communication needs of deaf people

    Visibility and acceptance of discrete-sampling artifacts in visual displays

    Get PDF
    Ph. D. ThesisDigital visual displays are aimed to provide an illusion of a continuous reality through a discrete presentation of visual information. This thesis explored three topics on (i) angular, (ii) spatial, and (iii) temporal sampling characteristics, related to distortion visibility, acceptance, and discomfort. In the first topic, we addressed the issue of optimizing the view density in continuous parallax visualization by replicating the changing views of a 3-D object for a moving observer. We measured the visibility of the related artifacts and evaluated the performance of full-reference visual quality metrics. We found that the state-of-the-art metrics can indirectly characterize artifact visibility and established a quantitative relationship for threshold estimation on varying conditions. The second topic addressed the relation of the contrast sensitivity function (CSF) to adaptation luminance and specifically its asymptotic behavior at high light levels essential to modern high-luminance displays. Using a custombuilt system, we measured the CSF at relatively high luminance levels and spatial frequency range, integrating our dataset to the existing research. We found a gradual transition among the linear to DeVries-Rose to Weber regions with steeper slopes for higher frequencies and lower luminance. A further decreasing region was located at low to intermediate frequencies. Following this construct, we adopted a model consisting of central elements in the visual signal processing and proposed an eight-parameter form for the CSF in the luminance domain. The final topic addressed the effects of frame rate on distortion acceptance and its impact on visual discomfort during regular display use. We assessed the perceived symptoms, preference, and task performance under varying conditions. The measurements indicated that for nondemanding everyday tasks, the frame rate could be reasonably reduced without severe effects on the observer; however, this tolerance diminished under more dynamic content. A potential association of discomfort with the blinking activity was also discussedThe European Union’s Horizon 2020 research and innovation program under the Marie Skłodowska-Curie grant agreement No 676401, European Training Network on Full Parallax Imaging and Huawei Technologies Co., Lt

    Compaction of C-band synthetic aperture radar based sea ice information for navigation in the Baltic Sea

    Get PDF
    In this work operational sea ice synthetic aperture radar (SAR) data products were improved and developed. A SAR instrument is transmitting electromagnetic radiation at certain wavelengths and measures the radiation which is scattered back towards the instrument from the target, in our case sea and sea ice. The measured backscattering is converted to an image describing the target area through complex signal processing. The images, however, differ from optical images, i.e. photographs, and their visual interpretation is not straightforward. The main idea in this work has been to deliver the essential SAR-based sea ice information to end-users (typically on ships) in a compact and user-friendly format. The operational systems at Finnish Institute of Marine Research (FIMR) are currently based on the data received from a Canadian SAR-satellite, Radarsat-1. The operational sea ice classification, developed by the author with colleagues, has been further developed. One problem with the SAR data is typically that the backscattering varies depending on the incidence angle. The incidence angle is the angle in which the transmitted electromagnetic wave meets the target surface and it varies within each SAR image and between different SAR images depending on the measuring geometry. To improve this situation, an incidence angle correction algorithm to normalize the backscattering over the SAR incidence angle range for Baltic Sea ice has been developed as part of this work. The algorithm is based on SAR backscattering statistics over the Baltic Sea. To locate different sea ice areas in SAR images, a SAR segmentation algorithm based on pulse-coupled neural networks has been developed and tested. The parameters have been tuned suitable for the operational data in use at FIMR. The sea ice classification is based on this segmentation and the classification is segment-wise rather than pixel-wise. To improve SAR-based distinguishing between sea ice and open water an open water detection algorithm based on segmentation and local autocorrelation has been developed. Also ice type classification based on higher-order statistics and independent component analysis have been studied to get an improved SAR-based ice type classification. A compression algorithm for compressing sea ice SAR data for visual use has been developed. This algorithm is based on the wavelet decomposition, zero-tree structure and arithmetic coding. Also some properties of the human visual system were utilized. This algorithm was developed to produce smaller compressed SAR images, with a reasonable visual quality. The transmission of the compressed images to ships with low-speed data connections in reasonable time is then possible. One of the navigationally most important sea ice parameters is the ice thickness. SAR-based ice thickness estimation has been developed and evaluated as part of this work. This ice thickness estimation method uses the ice thickness history derived from digitized ice charts, made daily at the Finnish Ice Service, as its input, and updates this chart based on the novel SAR data. The result is an ice thickness chart representing the ice situation at the SAR acquisition time in higher resolution than in the manually made ice thickness charts. For the evaluation of the results a helicopter-borne ice thickness measuring instrument, based on electromagnetic induction and laser altimeter, was used.reviewe

    Scalable video compression with optimized visual performance and random accessibility

    Full text link
    This thesis is concerned with maximizing the coding efficiency, random accessibility and visual performance of scalable compressed video. The unifying theme behind this work is the use of finely embedded localized coding structures, which govern the extent to which these goals may be jointly achieved. The first part focuses on scalable volumetric image compression. We investigate 3D transform and coding techniques which exploit inter-slice statistical redundancies without compromising slice accessibility. Our study shows that the motion-compensated temporal discrete wavelet transform (MC-TDWT) practically achieves an upper bound to the compression efficiency of slice transforms. From a video coding perspective, we find that most of the coding gain is attributed to offsetting the learning penalty in adaptive arithmetic coding through 3D code-block extension, rather than inter-frame context modelling. The second aspect of this thesis examines random accessibility. Accessibility refers to the ease with which a region of interest is accessed (subband samples needed for reconstruction are retrieved) from a compressed video bitstream, subject to spatiotemporal code-block constraints. We investigate the fundamental implications of motion compensation for random access efficiency and the compression performance of scalable interactive video. We demonstrate that inclusion of motion compensation operators within the lifting steps of a temporal subband transform incurs a random access penalty which depends on the characteristics of the motion field. The final aspect of this thesis aims to minimize the perceptual impact of visible distortion in scalable reconstructed video. We present a visual optimization strategy based on distortion scaling which raises the distortion-length slope of perceptually significant samples. This alters the codestream embedding order during post-compression rate-distortion optimization, thus allowing visually sensitive sites to be encoded with higher fidelity at a given bit-rate. For visual sensitivity analysis, we propose a contrast perception model that incorporates an adaptive masking slope. This versatile feature provides a context which models perceptual significance. It enables scene structures that otherwise suffer significant degradation to be preserved at lower bit-rates. The novelty in our approach derives from a set of "perceptual mappings" which account for quantization noise shaping effects induced by motion-compensated temporal synthesis. The proposed technique reduces wavelet compression artefacts and improves the perceptual quality of video
    corecore