Search CORE

54 research outputs found

Perceptual Echo Control and Delay Estimation

Author: Sakhnov Kirill
Simak Boris
Verteletskay Ekaterina
Publication venue: 'IntechOpen'
Publication date: 05/07/2011
Field of study

An investigation of the utility of monaural sound source separation via nonnegative matrix factorization applied to acoustic echo and reverberation mitigation for hands-free telephony

Author: Cahill Niall M.
Publication venue
Publication date: 01/02/2012
Field of study

In this thesis we investigate the applicability and utility of Monaural Sound Source Separation (MSSS) via Nonnegative Matrix Factorization (NMF) for various problems related to audio for hands-free telephony. We first investigate MSSS via NMF as an alternative acoustic echo reduction approach to existing approaches such as Acoustic Echo Cancellation (AEC). To this end, we present the single-channel acoustic echo problem as an MSSS problem, in which the objective is to extract the users signal from a mixture also containing acoustic echo and noise. To perform separation, NMF is used to decompose the near-end microphone signal onto the union of two nonnegative bases in the magnitude Short Time Fourier Transform domain. One of these bases is for the spectral energy of the acoustic echo signal, and is formed from the in- coming far-end user’s speech, while the other basis is for the spectral energy of the near-end speaker, and is trained with speech data a priori. In comparison to AEC, the speaker extraction approach obviates Double-Talk Detection (DTD), and is demonstrated to attain its maximal echo mitigation performance immediately upon initiation and to maintain that performance during and after room changes for similar computational requirements. Speaker extraction is also shown to introduce distortion of the near-end speech signal during double-talk, which is quantified by means of a speech distortion measure and compared to that of AEC. Subsequently, we address Double-Talk Detection (DTD) for block-based AEC algorithms. We propose a novel block-based DTD algorithm that uses the available signals and the estimate of the echo signal that is produced by NMF-based speaker extraction to compute a suitably normalized correlation-based decision variable, which is compared to a fixed threshold to decide on doubletalk. Using a standard evaluation technique, the proposed algorithm is shown to have comparable detection performance to an existing conventional block-based DTD algorithm. It is also demonstrated to inherit the room change insensitivity of speaker extraction, with the proposed DTD algorithm generating minimal false doubletalk indications upon initiation and in response to room changes in comparison to the existing conventional DTD. We also show that this property allows its paired AEC to converge at a rate close to the optimum. Another focus of this thesis is the problem of inverting a single measurement of a non- minimum phase Room Impulse Response (RIR). We describe the process by which percep- tually detrimental all-pass phase distortion arises in reverberant speech filtered by the inverse of the minimum phase component of the RIR; in short, such distortion arises from inverting the magnitude response of the high-Q maximum phase zeros of the RIR. We then propose two novel partial inversion schemes that precisely mitigate this distortion. One of these schemes employs NMF-based MSSS to separate the all-pass phase distortion from the target speech in the magnitude STFT domain, while the other approach modifies the inverse minimum phase filter such that the magnitude response of the maximum phase zeros of the RIR is not fully compensated. Subjective listening tests reveal that the proposed schemes generally produce better quality output speech than a comparable inversion technique

MURAL - Maynooth University Research Archive Library

An investigation of the utility of monaural sound source separation via nonnegative matrix factorization applied to acoustic echo and reverberation mitigation for hands-free telephony

Author: Cahill Niall M.
Publication venue
Publication date: 01/02/2012
Field of study

MURAL - Maynooth University Research Archive Library

Irish Universities

NUI Maynooth Eprint Archive

Maynooth University ePrints and eTheses Archive

Residual echo signal in critically sampled subband acoustic echo cancellers based on IIR and FIR filter banks

Author: Baykal B
Chambers JA
Constantinides AG
Tanrikulu O
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/1997
Field of study

Published versio

Crossref

Loughborough University Institutional Repository

Spiral - Imperial College Digital Repository

Adaptive Algorithms for Intelligent Acoustic Interfaces

Author: COMMINIELLO DANILO
Publication venue
Publication date: 16/04/2012
Field of study

Modern speech communications are evolving towards a new direction which involves users in a more perceptive way. That is the immersive experience, which may be considered as the “last-mile” problem of telecommunications. One of the main feature of immersive communications is the distant-talking, i.e. the hands-free (in the broad sense) speech communications without bodyworn or tethered microphones that takes place in a multisource environment where interfering signals may degrade the communication quality and the intelligibility of the desired speech source. In order to preserve speech quality intelligent acoustic interfaces may be used. An intelligent acoustic interface may comprise multiple microphones and loudspeakers and its peculiarity is to model the acoustic channel in order to adapt to user requirements and to environment conditions. This is the reason why intelligent acoustic interfaces are based on adaptive filtering algorithms. The acoustic path modelling entails a set of problems which have to be taken into account in designing an adaptive filtering algorithm. Such problems may be basically generated by a linear or a nonlinear process and can be tackled respectively by linear or nonlinear adaptive algorithms. In this work we consider such modelling problems and we propose novel effective adaptive algorithms that allow acoustic interfaces to be robust against any interfering signals, thus preserving the perceived quality of desired speech signals. As regards linear adaptive algorithms, a class of adaptive filters based on the sparse nature of the acoustic impulse response has been recently proposed. We adopt such class of adaptive filters, named proportionate adaptive filters, and derive a general framework from which it is possible to derive any linear adaptive algorithm. Using such framework we also propose some efficient proportionate adaptive algorithms, expressly designed to tackle problems of a linear nature. On the other side, in order to address problems deriving from a nonlinear process, we propose a novel filtering model which performs a nonlinear transformations by means of functional links. Using such nonlinear model, we propose functional link adaptive filters which provide an efficient solution to the modelling of a nonlinear acoustic channel. Finally, we introduce robust filtering architectures based on adaptive combinations of filters that allow acoustic interfaces to more effectively adapt to environment conditions, thus providing a powerful mean to immersive speech communications

Archivio della ricerca- Università di Roma La Sapienza

Stereophonic hands-free communication system based on microphone array fixed beamforming: real-time implementation and evaluation

Author: A Gilloire
C Wun
E Ferrara
F Bettarelli
Francesco Piazza
H Buchner
H Chen
HS Malvar
J Benesty
J Benesty
J Benesty
J Benesty
J Herre
JA Swets
JJ Shynk
L Gabrielli
L Romoli
Laura Romoli
M Ali
M Brandstein
M Kallinger
M Pirro
MA Iqbal
Matteo Pirro
MMS Doclo
N Tashev
P Oak
S Doclo
S Haykin
SL Gay
Stefano Squartini
T Fawcett
W Chen
W Herbordt
W Hoeg
W Kellerman
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Adaptive Algorithms for Intelligent Acoustic Interfaces

Author: COMMINIELLO DANILO
Publication venue
Publication date: 16/04/2012
Field of study

Pubblicazioni Aperte Digitali Interateneo Sapienza

Archivio della ricerca- Università di Roma La Sapienza

Single-channel acoustic echo cancellation in noise based on gradient-based adaptive filtering

Author: AWH Khong
B Widrow
C Beaugeant
C Breining
E Hänsler
E Hänsler
F Guangzeng
F Lindstrom
G Schmidt
H Yasukawa
JS Garofolo
JS Lim
M Berouti
M Omair Ahmad
M Yukawa
R Martin
R Nath
R Topa
S Boll
S Haykin
S Wu
Shaikh Anowarul Fattah
SM Kuo
SV Vaseghi
U Mahbub
Upal Mahbub
V Myllylä
Wei-Ping Zhu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Multichannel Bin-Wise Robust Frequency-Domain Adaptive Filtering and Its Application to Adaptive Beamforming

Author: Herbert Buchner
Satoshi Nakamura
Walter Kellermann
Wolfgang Herbordt
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

On the applicability of models for outdoor sound (A)

Author: Rasmussen Karsten Bo
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/1999
Field of study

Crossref

Online Research Database In Technology