Search CORE

5,999 research outputs found

A Class of Optimal Rectangular Filtering Matrices for Single-Channel Signal Enhancement in the Time Domain

Author: Benesty Jacob
Chen Jingdong
Christensen Mads Græsbøll
Jensen Jesper Rindom
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

Crossref

VBN

A Class of Optimal Rectangular Filtering Matrices for Single-Channel Signal Enhancement in the Time Domain

Author: Jacob Benesty
Jesper Rindom Jensen
Jingdong Chen
Mads Graesboll Christensen
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

An LCMV Filter for Single-Channel Noise Cancellation and Reduction in the Time Domain

Author: Benesty Jacob
Chen Jingdong
Christensen Mads Græsbøll
Jensen Jesper Rindom
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2013
Field of study

Crossref

VBN

A brief overview of speech enhancement with linear filtering

Author: Benesty Jacob
Chen Jingdong
Christensen Mads Græsbøll
Jensen Jesper Rindom
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Abstract In this paper, we provide an overview of some recently introduced principles and ideas for speech enhancement with linear filtering and explore how these are related and how they can be used in various applications. This is done in a general framework where the speech enhancement problem is stated as a signal vector estimation problem, i.e., with a filter matrix, where the estimate is obtained by means of a matrix-vector product of the filter matrix and the noisy signal vector. In this framework, minimum distortion, minimum variance distortionless response (MVDR), tradeoff, maximum signal-to-noise ratio (SNR), and Wiener filters are derived from the conventional speech enhancement approach and the recently introduced orthogonal decomposition approach. For each of the filters, we derive their properties in terms of output SNR and speech distortion. We then demonstrate how the ideas can be applied to single- and multichannel noise reduction in both the time and frequency domains as well as binaural noise reduction.</jats:p

Crossref

Springer - Publisher Connector

VBN

Structured Sparsity Models for Multiparty Speech Recovery from Reverberant Recordings

Author: Asaei Afsaneh
Bourlard Hervé
Cevher Volkan
Golbabaee Mohammad
Publication venue
Publication date: 01/01/2012
Field of study

We tackle the multi-party speech recovery problem through modeling the acoustic of the reverberant chambers. Our approach exploits structured sparsity models to perform room modeling and speech recovery. We propose a scheme for characterizing the room acoustic from the unknown competing speech sources relying on localization of the early images of the speakers by sparse approximation of the spatial spectra of the virtual sources in a free-space model. The images are then clustered exploiting the low-rank structure of the spectro-temporal components belonging to each source. This enables us to identify the early support of the room impulse response function and its unique map to the room geometry. To further tackle the ambiguity of the reflection ratios, we propose a novel formulation of the reverberation model and estimate the absorption coefficients through a convex optimization exploiting joint sparsity model formulated upon spatio-spectral sparsity of concurrent speech representation. The acoustic parameters are then incorporated for separating individual speech signals through either structured sparse recovery or inverse filtering the acoustic channels. The experiments conducted on real data recordings demonstrate the effectiveness of the proposed approach for multi-party speech recovery and recognition.Comment: 31 page

arXiv.org e-Print Archive

Edinburgh Research Explorer

A brief overview of speech enhancement with linear filtering

Author: B Kollmeier
GH Golub
H Huang
J Benesty
J Benesty
J Benesty
J Benesty
J Benesty
J Benesty
J Benesty
J Benesty
J Chen
J Chen
J Chen
J Freudenberger
JR Jensen
JR Jensen
K Hermus
LR Rabiner
M Dendrinos
M Souden
P Loizou
P Vary
R Martin
RC Hendriks
RJ McAulay
S Boll
S Doclo
S Srinivasan
SH Jensen
T Long
T Lotter
Y Ephraim
Y Ephraim
Y Hu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Two-Channel Speech Enhancement and Implementation Considerations: Noise Reduction and Speech Quality

Author: Hammer Martin
Kjeldsen Thomas Lynge
Kjærgaard Jacob Barsøe
Publication venue: Department of Electronic Systems
Publication date: 01/01/2007
Field of study

VBN

Blind MultiChannel Identification and Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function

Author: Gannot Sharon
Horaud Radu
Li Xiaofei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/06/2017
Field of study

This paper addresses the problems of blind channel identification and multichannel equalization for speech dereverberation and noise reduction. The time-domain cross-relation method is not suitable for blind room impulse response identification, due to the near-common zeros of the long impulse responses. We extend the cross-relation method to the short-time Fourier transform (STFT) domain, in which the time-domain impulse responses are approximately represented by the convolutive transfer functions (CTFs) with much less coefficients. The CTFs suffer from the common zeros caused by the oversampled STFT. We propose to identify CTFs based on the STFT with the oversampled signals and the critical sampled CTFs, which is a good compromise between the frequency aliasing of the signals and the common zeros problem of CTFs. In addition, a normalization of the CTFs is proposed to remove the gain ambiguity across sub-bands. In the STFT domain, the identified CTFs is used for multichannel equalization, in which the sparsity of speech signals is exploited. We propose to perform inverse filtering by minimizing the

\ell_1

-norm of the source signal with the relaxed

\ell_2

-norm fitting error between the micophone signals and the convolution of the estimated source signal and the CTFs used as a constraint. This method is advantageous in that the noise can be reduced by relaxing the

\ell_2

-norm to a tolerance corresponding to the noise power, and the tolerance can be automatically set. The experiments confirm the efficiency of the proposed method even under conditions with high reverberation levels and intense noise.Comment: 13 pages, 5 figures, 5 table

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Rennes 1

The neural space: a physiologically inspired noise reduction strategy based on fractional derivatives

Author: Bleeck Stefan
Hu Hongmei
Sang Jinqiu
Winter I.M.
Wright M.C.M.
Publication venue
Publication date
Field of study

Southampton (e-Prints Soton)