21 research outputs found
Audio Watermarking Scheme in MDCT Domain
In our proposed watermark embedding system, the original audio is segmented into overlapping frames. The psychoacoustic auditory model has been utilized to calculate the global threshold in modified discrete cosine transform domain. The perceptual insignificant locations have been used to insert the appropriately scaled watermark in the transform domain. Blind detection of watermark has been performed. Simulation results indicate that the proposed watermarking system is perceptually transparent and robust against various kind of attacks such as AWGN addition and MP3 compression
Reviews on Technology and Standard of Spatial Audio Coding
Market demands on a more impressive entertainment media have motivated for delivery of three dimensional (3D) audio content to home consumers through Ultra High Definition TV (UHDTV), the next generation of TV broadcasting, where spatial audio coding plays fundamental role. This paper reviews fundamental concept on spatial audio coding which includes technology, standard, and application. Basic principle of object-based audio reproduction system will also be elaborated, compared to the traditional channel-based system, to provide good understanding on this popular interactive audio reproduction system which gives end users flexibility to render their own preferred audio composition.Keywords : spatial audio, audio coding, multi-channel audio signals, MPEG standard, object-based audi
Signal-Adaptive and Perceptually Optimized Sound Zones with Variable Span Trade-Off Filters
Creating sound zones has been an active research field since the idea was
first proposed. So far, most sound zone control methods rely on either an
optimization of physical metrics such as acoustic contrast and signal
distortion or a mode decomposition of the desired sound field. By using these
types of methods, approximately 15 dB of acoustic contrast between the
reproduced sound field in the target zone and its leakage to other zone(s) has
been reported in practical set-ups, but this is typically not high enough to
satisfy the people inside the zones. In this paper, we propose a sound zone
control method shaping the leakage errors so that they are as inaudible as
possible for a given acoustic contrast. The shaping of the leakage errors is
performed by taking the time-varying input signal characteristics and the human
auditory system into account when the loudspeaker control filters are
calculated. We show how this shaping can be performed using variable span
trade-off filters, and we show theoretically how these filters can be used for
trading signal distortion in the target zone for acoustic contrast. The
proposed method is evaluated based on physical metrics such as acoustic
contrast and perceptual metrics such as STOI. The computational complexity and
processing time of the proposed method for different system set-ups are also
investigated. Lastly, the results of a MUSHRA listening test are reported. The
test results show that the proposed method provides more than 20% perceptual
improvement compared to existing sound zone control methods.Comment: Accepted for publication in IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH,
AND LANGUAGE PROCESSIN