Search CORE

43 research outputs found

Предельные биполярные последовательности для робастного маркирования цифровых аудиосигналов по методу лоскута

Author: Гофман Максим Викторович
Корниенко Анатолий Адамович
Publication venue: СПб ФИЦ РАН
Publication date: 17/03/2023
Field of study

Ensuring the robustness of digital audio watermarking under the influence of interference, various transformations and possible attacks is an urgent problem. One of the most used and fairly stable marking methods is the patchwork method. Its robustness is ensured by the use of expanding bipolar numerical sequences in the formation and embedding of a watermark in a digital audio and correlation detection in the detection and extraction of a watermark. An analysis of the patchwork method showed that the absolute values of the ratio of the maximum of the autocorrelation function (ACF) to its minimum for expanding bipolar sequences and extended marker sequences used in traditional digital watermarking approach 2 with high accuracy. This made it possible to formulate criteria for searching for special expanding bipolar sequences, which have improved correlation properties and greater robustness. The article developed a mathematical apparatus for searching and constructing limit-expanding bipolar sequences used in solving the problem of robust digital audio watermarking using the patchwork method. Limit bipolar sequences are defined as sequences whose autocorrelation functions have the maximum possible ratios of maximum to minimum in absolute value. Theorems and corollaries from them are formulated and proved: on the existence of an upper bound on the minimum values of autocorrelation functions of limit bipolar sequences and on the values of the first and second petals of the ACF. On this basis, a rigorous mathematical definition of limit bipolar sequences is given. A method for searching for the complete set of limit bipolar sequences based on rational search and a method for constructing limit bipolar sequences of arbitrary length using generating functions are developed. The results of the computer simulation of the assessment of the values of the absolute value of the ratio of the maximum to the minimum of the autocorrelation and cross-correlation functions of the studied bipolar sequences for blind reception are presented. It is shown that the proposed limit bipolar sequences are characterized by better correlation properties in comparison with the traditionally used bipolar sequences and are more robust.Обеспечение устойчивости маркирования цифровых аудиосигналов в условиях действия помех, различных преобразований и возможных атак является актуальной проблемой. Одним из наиболее используемых и достаточно устойчивых методов маркирования является метод лоскута. Его робастность обеспечивается применением расширяющих биполярных числовых последовательностей при формировании и внедрении маркера в цифровой аудиосигнал и корреляционного детектирования при обнаружении и извлечении маркерной последовательности. Анализ свойств биполярных последовательностей, реализуемых в методе лоскута, показал, что абсолютные значения величины отношения максимума автокорреляционной функции (АКФ) к её минимуму для расширяющих биполярных последовательностей и расширенных маркерных последовательностей, используемых при традиционном маркировании, с высокой точностью приближаются к 2. Это позволило сформулировать критерии для поиска специальных расширяющих биполярных последовательностей, обладающих улучшенными корреляционными свойствами и большей устойчивостью. В статье разработан математический аппарат для поиска и построения предельных расширяющих биполярных последовательностей, используемых при решении задачи робастного маркирования цифровых аудиосигналов по методу лоскута. Предельные биполярные последовательности определены как последовательности, у которых автокорреляционные функции обладают максимально возможными по абсолютному значению отношениями максимума к минимуму. Сформулированы и доказаны теоремы и следствия из них: о существовании верхней границы минимальных значений автокорреляционных функций предельных биполярных последовательностей и о значениях первого и второго лепестков АКФ. На этой основе дано строгое математическое определение предельных биполярных последовательностей. Разработаны метод поиска полного множества предельных биполярных последовательностей на основе рационального перебора и метод построения предельных биполярных последовательностей произвольной длины с использованием порождающих функций. Представлены результаты компьютерного моделирования по оценке значений абсолютной величины отношения максимума к минимуму автокорреляционной и взаимной корреляционных функций исследуемых биполярных последовательностей для слепого приема. Показано, что предложенные предельные биполярные последовательности характеризуются лучшими корреляционными свойствами в сравнении с традиционно используемыми биполярными последовательностями и обладают большей устойчивостью

Информатика и автоматизация

Предельные биполярные последовательности для робастного маркирования цифровых аудиосигналов по методу лоскута

Author: Anatolij Kornienko
Maksim Gofman
Publication venue: 'SPIIRAS'
Publication date: 01/03/2023
Field of study

Обеспечение устойчивости маркирования цифровых аудиосигналов в условиях действия помех, различных преобразований и возможных атак является актуальной проблемой. Одним из наиболее используемых и достаточно устойчивых методов маркирования является метод лоскута. Его робастность обеспечивается применением расширяющих биполярных числовых последовательностей при формировании и внедрении маркера в цифровой аудиосигнал и корреляционного детектирования при обнаружении и извлечении маркерной последовательности. Анализ свойств биполярных последовательностей, реализуемых в методе лоскута, показал, что абсолютные значения величины отношения максимума автокорреляционной функции (АКФ) к её минимуму для расширяющих биполярных последовательностей и расширенных маркерных последовательностей, используемых при традиционном маркировании, с высокой точностью приближаются к 2. Это позволило сформулировать критерии для поиска специальных расширяющих биполярных последовательностей, обладающих улучшенными корреляционными свойствами и большей устойчивостью. В статье разработан математический аппарат для поиска и построения предельных расширяющих биполярных последовательностей, используемых при решении задачи робастного маркирования цифровых аудиосигналов по методу лоскута. Предельные биполярные последовательности определены как последовательности, у которых автокорреляционные функции обладают максимально возможными по абсолютному значению отношениями максимума к минимуму. Сформулированы и доказаны теоремы и следствия из них: о существовании верхней границы минимальных значений автокорреляционных функций предельных биполярных последовательностей и о значениях первого и второго лепестков АКФ. На этой основе дано строгое математическое определение предельных биполярных последовательностей. Разработаны метод поиска полного множества предельных биполярных последовательностей на основе рационального перебора и метод построения предельных биполярных последовательностей произвольной длины с использованием порождающих функций. Представлены результаты компьютерного моделирования по оценке значений абсолютной величины отношения максимума к минимуму автокорреляционной и взаимной корреляционных функций исследуемых биполярных последовательностей для слепого приема. Показано, что предложенные предельные биполярные последовательности характеризуются лучшими корреляционными свойствами в сравнении с традиционно используемыми биполярными последовательностями и обладают большей устойчивостью

Directory of Open Access Journals

Journal of Telecommunications and Information Technology, 2011, nr 4

Author
Publication venue: 'National Institute of Telecommunications'
Publication date
Field of study

kwartalni

Biblioteka Cyfrowa Instytutu Łączności / National Institute of Telecomunications: Digital Library

Securing Multi-Layer Communications: A Signal Processing Approach

Author: Mao Yinian
Publication venue
Publication date: 17/07/2006
Field of study

Security is becoming a major concern in this information era. The development in wireless communications, networking technology, personal computing devices, and software engineering has led to numerous emerging applications whose security requirements are beyond the framework of conventional cryptography. The primary motivation of this dissertation research is to develop new approaches to the security problems in secure communication systems, without unduly increasing the complexity and cost of the entire system. Signal processing techniques have been widely applied in communication systems. In this dissertation, we investigate the potential, the mechanism, and the performance of incorporating signal processing techniques into various layers along the chain of secure information processing. For example, for application-layer data confidentiality, we have proposed atomic encryption operations for multimedia data that can preserve standard compliance and are friendly to communications and delegate processing. For multimedia authentication, we have discovered the potential key disclosure problem for popular image hashing schemes, and proposed mitigation solutions. In physical-layer wireless communications, we have discovered the threat of signal garbling attack from compromised relay nodes in the emerging cooperative communication paradigm, and proposed a countermeasure to trace and pinpoint the adversarial relay. For the design and deployment of secure sensor communications, we have proposed two sensor location adjustment algorithms for mobility-assisted sensor deployment that can jointly optimize sensing coverage and secure communication connectivity. Furthermore, for general scenarios of group key management, we have proposed a time-efficient key management scheme that can improve the scalability of contributory key management from O(log n) to O(log(log n)) using scheduling and optimization techniques. This dissertation demonstrates that signal processing techniques, along with optimization, scheduling, and beneficial techniques from other related fields of study, can be successfully integrated into security solutions in practical communication systems. The fusion of different technical disciplines can take place at every layer of a secure communication system to strengthen communication security and improve performance-security tradeoff

Digital Repository at the University of Maryland

Intelligent watermarking of long streams of document images

Author: Vellasques Eduardo
Publication venue: École de technologie supérieure
Publication date
Field of study

Digital watermarking has numerous applications in the imaging domain, including (but not limited to) fingerprinting, authentication, tampering detection. Because of the trade-off between watermark robustness and image quality, the heuristic parameters associated with digital watermarking systems need to be optimized. A common strategy to tackle this optimization problem formulation of digital watermarking, known as intelligent watermarking (IW), is to employ evolutionary computing (EC) to optimize these parameters for each image, with a computational cost that is infeasible for practical applications. However, in industrial applications involving streams of document images, one can expect instances of problems to reappear over time. Therefore, computational cost can be saved by preserving the knowledge of previous optimization problems in a separate archive (memory) and employing that memory to speedup or even replace optimization for future similar problems. That is the basic principle behind the research presented in this thesis. Although similarity in the image space can lead to similarity in the problem space, there is no guarantee of that and for this reason, knowledge about the image space should not be employed whatsoever. Therefore, in this research, strategies to appropriately represent, compare, store and sample from problem instances are investigated. The objective behind these strategies is to allow for a comprehensive representation of a stream of optimization problems in a way to avoid re-optimization whenever a previously seen problem provides solutions as good as those that would be obtained by reoptimization, but at a fraction of its cost. Another objective is to provide IW systems with a predictive capability which allows replacing costly fitness evaluations with cheaper regression models whenever re-optimization cannot be avoided. To this end, IW of streams of document images is first formulated as the problem of optimizing a stream of recurring problems and a Dynamic Particle Swarm Optimization (DPSO) technique is proposed to tackle this problem. This technique is based on a two-tiered memory of static solutions. Memory solutions are re-evaluated for every new image and then, the re-evaluated fitness distribution is compared with stored fitness distribution as a mean of measuring the similarity between both problem instances (change detection). In simulations involving homogeneous streams of bi-tonal document images, the proposed approach resulted in a decrease of 95% in computational burden with little impact in watermarking performace. Optimization cost was severely decreased by replacing re-optimizations with recall to previously seen solutions. After that, the problem of representing the stream of optimization problems in a compact manner is addressed. With that, new optimization concepts can be incorporated into previously learned concepts in an incremental fashion. The proposed strategy to tackle this problem is based on Gaussian Mixture Models (GMM) representation, trained with parameter and fitness data of all intermediate (candidate) solutions of a given problem instance. GMM sampling replaces selection of individual memory solutions during change detection. Simulation results demonstrate that such memory of GMMs is more adaptive and can thus, better tackle the optimization of embedding parameters for heterogeneous streams of document images when compared to the approach based on memory of static solutions. Finally, the knowledge provided by the memory of GMMs is employed as a manner of decreasing the computational cost of re-optimization. To this end, GMM is employed in regression mode during re-optimization, replacing part of the costly fitness evaluations in a strategy known as surrogate-based optimization. Optimization is split in two levels, where the first one relies primarily on regression while the second one relies primarily on exact fitness values and provide a safeguard to the whole system. Simulation results demonstrate that the use of surrogates allows for better adaptation in situations involving significant variations in problem representation as when the set of attacks employed in the fitness function changes. In general lines, the intelligent watermarking system proposed in this thesis is well adapted for the optimization of streams of recurring optimization problems. The quality of the resulting solutions for both, homogeneous and heterogeneous image streams is comparable to that obtained through full optimization but for a fraction of its computational cost. More specifically, the number of fitness evaluations is 97% smaller than that of full optimization for homogeneous streams and 95% for highly heterogeneous streams of document images. The proposed method is general and can be easily adapted to other applications involving streams of recurring problems

Espace ÉTS

Intelligent Circuits and Systems

Author
Publication venue: 'Informa UK Limited'
Publication date: 26/07/2021
Field of study

ICICS-2020 is the third conference initiated by the School of Electronics and Electrical Engineering at Lovely Professional University that explored recent innovations of researchers working for the development of smart and green technologies in the fields of Energy, Electronics, Communications, Computers, and Control. ICICS provides innovators to identify new opportunities for the social and economic benefits of society.　 This conference bridges the gap between academics and R&D institutions, social visionaries, and experts from all strata of society to present their ongoing research activities and foster research relations between them. It provides opportunities for the exchange of new ideas, applications, and experiences in the field of smart technologies and finding global partners for future collaboration. The ICICS-2020 was conducted in two broad categories, Intelligent Circuits & Intelligent Systems and Emerging Technologies in Electrical Engineering

Directory of Open Access Books (DOAB)

An investigation of the utility of monaural sound source separation via nonnegative matrix factorization applied to acoustic echo and reverberation mitigation for hands-free telephony

Author: Cahill Niall M.
Publication venue
Publication date: 01/02/2012
Field of study

In this thesis we investigate the applicability and utility of Monaural Sound Source Separation (MSSS) via Nonnegative Matrix Factorization (NMF) for various problems related to audio for hands-free telephony. We first investigate MSSS via NMF as an alternative acoustic echo reduction approach to existing approaches such as Acoustic Echo Cancellation (AEC). To this end, we present the single-channel acoustic echo problem as an MSSS problem, in which the objective is to extract the users signal from a mixture also containing acoustic echo and noise. To perform separation, NMF is used to decompose the near-end microphone signal onto the union of two nonnegative bases in the magnitude Short Time Fourier Transform domain. One of these bases is for the spectral energy of the acoustic echo signal, and is formed from the in- coming far-end user’s speech, while the other basis is for the spectral energy of the near-end speaker, and is trained with speech data a priori. In comparison to AEC, the speaker extraction approach obviates Double-Talk Detection (DTD), and is demonstrated to attain its maximal echo mitigation performance immediately upon initiation and to maintain that performance during and after room changes for similar computational requirements. Speaker extraction is also shown to introduce distortion of the near-end speech signal during double-talk, which is quantified by means of a speech distortion measure and compared to that of AEC. Subsequently, we address Double-Talk Detection (DTD) for block-based AEC algorithms. We propose a novel block-based DTD algorithm that uses the available signals and the estimate of the echo signal that is produced by NMF-based speaker extraction to compute a suitably normalized correlation-based decision variable, which is compared to a fixed threshold to decide on doubletalk. Using a standard evaluation technique, the proposed algorithm is shown to have comparable detection performance to an existing conventional block-based DTD algorithm. It is also demonstrated to inherit the room change insensitivity of speaker extraction, with the proposed DTD algorithm generating minimal false doubletalk indications upon initiation and in response to room changes in comparison to the existing conventional DTD. We also show that this property allows its paired AEC to converge at a rate close to the optimum. Another focus of this thesis is the problem of inverting a single measurement of a non- minimum phase Room Impulse Response (RIR). We describe the process by which percep- tually detrimental all-pass phase distortion arises in reverberant speech filtered by the inverse of the minimum phase component of the RIR; in short, such distortion arises from inverting the magnitude response of the high-Q maximum phase zeros of the RIR. We then propose two novel partial inversion schemes that precisely mitigate this distortion. One of these schemes employs NMF-based MSSS to separate the all-pass phase distortion from the target speech in the magnitude STFT domain, while the other approach modifies the inverse minimum phase filter such that the magnitude response of the maximum phase zeros of the RIR is not fully compensated. Subjective listening tests reveal that the proposed schemes generally produce better quality output speech than a comparable inversion technique

MURAL - Maynooth University Research Archive Library

Irish Universities

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

An investigation of the utility of monaural sound source separation via nonnegative matrix factorization applied to acoustic echo and reverberation mitigation for hands-free telephony

Author: Cahill Niall M.
Publication venue
Publication date: 01/02/2012
Field of study

MURAL - Maynooth University Research Archive Library