Search CORE

104 research outputs found

DeAR: A Deep-learning-based Audio Re-recording Resilient Watermarking

Author: Fang Han
Liu Chang
Ma Zehua
Yu Nenghai
Zhang Jie
Zhang Weiming
Publication venue
Publication date: 13/12/2022
Field of study

Audio watermarking is widely used for leaking source tracing. The robustness of the watermark determines the traceability of the algorithm. With the development of digital technology, audio re-recording (AR) has become an efficient and covert means to steal secrets. AR process could drastically destroy the watermark signal while preserving the original information. This puts forward a new requirement for audio watermarking at this stage, that is, to be robust to AR distortions. Unfortunately, none of the existing algorithms can effectively resist AR attacks due to the complexity of the AR process. To address this limitation, this paper proposes DeAR, a deep-learning-based audio re-recording resistant watermarking. Inspired by DNN-based image watermarking, we pioneer a deep learning framework for audio carriers, based on which the watermark signal can be effectively embedded and extracted. Meanwhile, in order to resist the AR attack, we delicately analyze the distortions that occurred in the AR process and design the corresponding distortion layer to cooperate with the proposed watermarking framework. Extensive experiments show that the proposed algorithm can resist not only common electronic channel distortions but also AR distortions. Under the premise of high-quality embedding (SNR=25.86dB), in the case of a common re-recording distance (20cm), the algorithm can effectively achieve an average bit recovery accuracy of 98.55%.Comment: Accepted by AAAI202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

A Review of Audio Features and Statistical Models Exploited for Voice Pattern Design

Author: Duong Hien-Thanh
Duong Ngoc Q. K.
Publication venue
Publication date: 24/02/2015
Field of study

Audio fingerprinting, also named as audio hashing, has been well-known as a powerful technique to perform audio identification and synchronization. It basically involves two major steps: fingerprint (voice pattern) design and matching search. While the first step concerns the derivation of a robust and compact audio signature, the second step usually requires knowledge about database and quick-search algorithms. Though this technique offers a wide range of real-world applications, to the best of the authors' knowledge, a comprehensive survey of existing algorithms appeared more than eight years ago. Thus, in this paper, we present a more up-to-date review and, for emphasizing on the audio signal processing aspect, we focus our state-of-the-art survey on the fingerprint design step for which various audio features and their tractable statistical models are discussed.Comment: http://www.iaria.org/conferences2015/PATTERNS15.html ; Seventh International Conferences on Pervasive Patterns and Applications (PATTERNS 2015), Mar 2015, Nice, Franc

arXiv.org e-Print Archive

Semi fragile audio crypto-watermarking based on sparse sampling with partially decomposed Haar matrix structure

Author: Bhatt Mahabaleswara Ram
D. D. Geetha
Jayarani Appadurai Electa Alice
Publication venue: University of Szeged, Institute of Informatics
Publication date: 01/01/2020
Field of study

In the recent era the growth of technology is tremendous and at the same time, the misuse of technology is also increasing with an equal scale. Thus the owners have to protect the multimedia data from the malicious and piracy. This has led the researchers to the new era of cryptography and watermarking. In the traditional security algorithm for the audio, the algorithm is implemented on the digital data after the traditional analog to digital conversion. But in this article, we propose the crypto – watermarking algorithm based on sparse sampling to be implemented during the analog to digital conversion process only. The watermark is generated by exploiting the structure of HAAR transform. The performance of the algorithm is tested on various audio signals and the obtained SNR is greater than 30dB and the algorithm results in good robustness against various signal attacks such as echo addition, noise addition, reverberation etc

University of Szeged

An acoustic data transmission system based on audio data hiding: method and performance evaluation

Author: BS Ko
D Kirovski
H Matsuoka
HS Malvar
HS Yun
HS Yun
Jae Choi
JJ Garcia-Hernandez
K Cho
K Mizutani
Kiho Cho
KK Paliwal
MR Schroeder
N Lazic
Nam Soo Kim
SG McGovern
Y Nakashima
Y Xiang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Implementation of the DSSS method in watermarking digital audio objects

Author: Gajewski Piotr
Piotrowski Zbigniew
Wojtuń Jarosław
Publication venue: 'Uniwersytetu Marii Curie-Sklodowskiej w Lublinie'
Publication date: 01/01/2011
Field of study

The paper presents the results of implementation in the Matlab environment for watermarking embedder and extractor based on the Direct Sequence Spread Spectrum (DSSS). A block diagram of watermarking system, an analysis of watermarked signal reproduced as well as watermarking system robustness to degrading factors: lossy compression, signal-to-noise ratio (SNR) as well as a change in sampling frequency, were shown

Biblioteka Nauki - repozytorium artykuÅÃ³w

Crossref

University of Maria Curie-Skłodowska (UMCS): Scientific e-Journals / Uniwersytet Marii Curie-Skłodowskiej: e-czasopisma naukowe

Evaluation of Audio Compression Artifacts

Author: Martinez M. Herrera
Publication venue: 'Czech Technical University in Prague - Central Library'
Publication date: 01/01/2007
Field of study

This paper deals with subjective evaluation of audio-coding systems. From this evaluation, it is found that, depending on the type of signal and the algorithm of the audio-coding system, different types of audible errors arise. These errors are called coding artifacts. Although three kinds of artifacts are perceivable in the auditory domain, the author proposes that in the coding domain there is only one common cause for the appearance of the artifact, inefficient tracking of transient-stochastic signals. For this purpose, state-of-the art audio coding systems use a wide range of signal processing techniques, including application of the wavelet transform, which is described here.

Directory of Open Access Journals

CTU Open Journal Systems (Czech Technical University, Prague / České vysoké učení technické v Praze)

A study on number theoretic construction and prediction of two dimensional acoustic diffusers for architectural applications

Author: Döşemeciler Ayçe
Publication venue: İzmir Institute of Technology
Publication date: 01/01/2011
Field of study

Thesis (Doctoral)--İzmir Institute of Technology, Architecture, İzmir, 2011Includes bibliographical references (leaves: 72-77)Text in English; Abstract: Turkish and Englishxiv, 172 leavesDefined as the scattering of sound independent from angle, optimum diffusion is very important for the perception of musical sound. For this purpose, Schroeder used mathematical number sequences to propose ʼreflection phase grating diffusersʼ, of two main types: Single plane or one-dimensional (1D) diffusers that scatter sound into a hemi-disc, and two dimensional (2D) diffusers that scatter into a hemisphere to disperse strong specular reflections without removing sound energy from the space, which is the main advantage of these devices. Currently, two methods are used to design 2D diffusers:Product Array and Folding Array Methods. Both are based on number theory and used methodologically in the field of acoustics, producing results that offer limited diffusion characteristics and design solutions for a variety of architectural spaces ranging from concert halls to recording studios where Schroeder diffusers are widely used. This dissertation proposes Distinct Sums Property Method originally devised for watermarking digital images, to construct adoptable 2D diffusers through number theoretical construction and prediction. At first, quadratic residue sequence based on prime number 7 is selected according to its autocorrelation properties as the Fourier transform of good autocorrelation properties gives an even scattered energy distribution. Then Distinct Sums Property Method is applied to construct 2D arrays from this sequence from which well depths and widths are calculated. Third, the aimed scattering and diffusion properties of the modeled 2D diffuser are predicted by Boundary Element Method which gives approximate results in accordance with the measurements based on Audio Engineering Society Standards. Fourth, polar responses (i.e. the scattering diagrams for specific angles) in each octave band frequency are obtained. Finally, predicted diffusion coefficients for uniform scattering are calculated and compared to the reference flat surfaceʼs coefficients and previous studieʼs results

Real-time Loudspeaker Distance Estimation with Stereo Audio

Author: Gaubitch Nikolay
Heusdens Richard
Jensen Søren Holdt
Jensen Tobias Lindstrøm
Martinez Jorge
Nielsen Jesper Kjær
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/09/2015
Field of study

Knowledge on how a number of loudspeakers are positioned relative to a listening position can be used to enhance the listening experience. Usually, these loudspeaker positions are estimated using calibration signals, either audible or psycho-acoustically hidden inside the desired audio signal. In this paper, we propose to use the desired audio signal instead. Specifically, we treat the case of estimating the distance between two loudspeakers playing back a stereo music or speech signal. In this connection, we develop a real-time maximum likelihood estimator and demonstrate that it has a variance in the millimetre range in a real environment for even a modest sampling frequency

CiteSeerX

Crossref

VBN