3 research outputs found
Adaptive Variable Degree-k Zero-Trees for Re-Encoding of Perceptually Quantized Wavelet-Packet Transformed Audio and High Quality Speech
A fast, efficient and scalable algorithm is proposed, in this paper, for
re-encoding of perceptually quantized wavelet-packet transform (WPT)
coefficients of audio and high quality speech and is called "adaptive variable
degree-k zero-trees" (AVDZ). The quantization process is carried out by taking
into account some basic perceptual considerations, and achieves good subjective
quality with low complexity. The performance of the proposed AVDZ algorithm is
compared with two other zero-tree-based schemes comprising: 1- Embedded
Zero-tree Wavelet (EZW) and 2- The set partitioning in hierarchical trees
(SPIHT). Since EZW and SPIHT are designed for image compression, some
modifications are incorporated in these schemes for their better matching to
audio signals. It is shown that the proposed modifications can improve their
performance by about 15-25%. Furthermore, it is concluded that the proposed
AVDZ algorithm outperforms these modified versions in terms of both output
average bit-rates and computation times.Comment: 30 pages (Double space), 15 figures, 5 tables, ISRN Signal Processing
(in Press
Frame-synchronous Blind Audio Watermarking for Tamper Proofing and Self-Recovery
This paper presents a lifting wavelet transform (LWT)-based blind audio watermarking scheme designed for tampering detection and self-recovery. Following 3-level LWT decomposition of a host audio, the coefficients in selected subbands are first partitioned into frames for watermarking. To suit different purposes of the watermarking applications, binary information is packed into two groups: frame-related data are embedded in the approximation subband using rational dither modulation; the source-channel coded bit sequence of the host audio is hidden inside the 2nd and 3rd -detail subbands using 2N-ary adaptive quantization index modulation. The frame-related data consists of a synchronization code used for frame alignment and a composite message gathered from four adjacent frames for content authentication. To endow the proposed watermarking scheme with a self-recovering capability, we resort to hashing comparison to identify tampered frames and adopt a Reed–Solomon code to correct symbol errors. The experiment results indicate that the proposed watermarking scheme can accurately locate and recover the tampered regions of the audio signal. The incorporation of the frame synchronization mechanism enables the proposed scheme to resist against cropping and replacement attacks, all of which were unsolvable by previous watermarking schemes. Furthermore, as revealed by the perceptual evaluation of audio quality measures, the quality degradation caused by watermark embedding is merely minor. With all the aforementioned merits, the proposed scheme can find various applications for ownership protection and content authentication
Recommended from our members
An enhanced psychoacoustic model based on the discrete wavelet packet transform
The perception of acoustic information by humans is based on the detailed temporal and spectral analysis provided by the auditory processing of the received signal. The incorporation of this process in psychoacoustical computational models has contributed significantly both in the development of highly efficient audio compression schemes as well as in effective audio watermarking methods. In this paper, we present an approach based on the discrete wavelet packet transform, which closely mimics the multi-resolution properties of the human ear and also includes simultaneous and temporal auditory masking. Experimental results show that the proposed technique offers better masking capabilities and it reduces the signal-to-masking ratio when compared to related approaches, without introducing audible distortion. Those results have implications that are important both for audio compression by permitting further bit rate reduction, and for watermarking by providing greater signal space for information hiding