Search CORE

915 research outputs found

An adaptive stereo basis method for convolutive blind audio source separation

Author: Abdallah
Abdallah
Aharon
Amari
Amari
Araki
Bell
Cardoso
Cardoso
Cardoso
Davies
Douglas
Emmanuel Vincent
Hyvärinen
Ikeda
Ikram
Jafari
Jourjine
Knapp
Kurita
Lewicki
Makino
Maria G. Jafari
Mark D. Plumbley
Matsuda
Mike E. Davies
Mitianoudis
Mitianoudis
O’Grady
Parra
Samer A. Abdallah
Saruwatari
Sawada
Schmidt
Smaragdis
Torkkola
Vincent
Vincent
Viste
Yilmaz
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

NOTICE: this is the author’s version of a work that was accepted for publication in Neurocomputing. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in PUBLICATION, [71, 10-12, June 2008] DOI:neucom.2007.08.02

Crossref

UCL Discovery

Edinburgh Research Explorer

Queen Mary Research Online

Differential fast fixed-point algorithms for underdetermined instantaneous and convolutive partial blind source separation

Author: Deville Y.
Hosseini Shahram
Thomas J.
Publication venue
Publication date: 01/01/2007
Field of study

This paper concerns underdetermined linear instantaneous and convolutive blind source separation (BSS), i.e., the case when the number of observed mixed signals is lower than the number of sources.We propose partial BSS methods, which separate supposedly nonstationary sources of interest (while keeping residual components for the other, supposedly stationary, "noise" sources). These methods are based on the general differential BSS concept that we introduced before. In the instantaneous case, the approach proposed in this paper consists of a differential extension of the FastICA method (which does not apply to underdetermined mixtures). In the convolutive case, we extend our recent time-domain fast fixed-point C-FICA algorithm to underdetermined mixtures. Both proposed approaches thus keep the attractive features of the FastICA and C-FICA methods. Our approaches are based on differential sphering processes, followed by the optimization of the differential nonnormalized kurtosis that we introduce in this paper. Experimental tests show that these differential algorithms are much more robust to noise sources than the standard FastICA and C-FICA algorithms.Comment: this paper describes our differential FastICA-like algorithms for linear instantaneous and convolutive underdetermined mixture

arXiv.org e-Print Archive

CiteSeerX

HAL-INSU

HAL-IRD

Penalty function-based joint diagonalization approach for convolutive blind separation of nonstationary sources

Author: Jonathon Chambers (1251609)
Saeid Sanei (7207403)
Wenwu Wang (4352767)
Publication venue
Publication date: 01/01/2004
Field of study

A new approach for convolutive blind source separation (BSS) by explicitly exploiting the second-order nonstationarity of signals and operating in the frequency domain is proposed. The algorithm accommodates a penalty function within the cross-power spectrum-based cost function and thereby converts the separation problem into a joint diagonalization problem with unconstrained optimization. This leads to a new member of the family of joint diagonalization criteria and a modification of the search direction of the gradient-based descent algorithm. Using this approach, not only can the degenerate solution induced by a unmixing matrix and the effect of large errors within the elements of covariance matrices at low-frequency bins be automatically removed, but in addition, a unifying view to joint diagonalization with unitary or nonunitary constraint is provided. Numerical experiments are presented to verify the performance of the new method, which show that a suitable penalty function may lead the algorithm to a faster convergence and a better performance for the separation of convolved speech signals, in particular, in terms of shape preservation and amplitude ambiguity reduction, as compared with the conventional second-order based algorithms for convolutive mixtures that exploit signal nonstationarity

CiteSeerX

Loughborough University Institutional Repository

Online Research @ Cardiff

Surrey Research Insight

Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function

Author: Gannot Sharon
Girin Laurent
Horaud Radu
Li Xiaofei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/02/2018
Field of study

This paper addresses the problem of speech separation and enhancement from multichannel convolutive and noisy mixtures, \emph{assuming known mixing filters}. We propose to perform the speech separation and enhancement task in the short-time Fourier transform domain, using the convolutive transfer function (CTF) approximation. Compared to time-domain filters, CTF has much less taps, consequently it has less near-common zeros among channels and less computational complexity. The work proposes three speech-source recovery methods, namely: i) the multichannel inverse filtering method, i.e. the multiple input/output inverse theorem (MINT), is exploited in the CTF domain, and for the multi-source case, ii) a beamforming-like multichannel inverse filtering method applying single source MINT and using power minimization, which is suitable whenever the source CTFs are not all known, and iii) a constrained Lasso method, where the sources are recovered by minimizing the

\ell_1

-norm to impose their spectral sparsity, with the constraint that the

\ell_2

-norm fitting cost, between the microphone signals and the mixing model involving the unknown source signals, is less than a tolerance. The noise can be reduced by setting a tolerance onto the noise power. Experiments under various acoustic conditions are carried out to evaluate the three proposed methods. The comparison between them as well as with the baseline methods is presented.Comment: Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processin

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Hal-Diderot

Reverberant Audio Source Separation via Sparse and Low-Rank Modeling

Author: Arberet Simon
Vandergheynst Pierre
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 11/12/2013
Field of study

The performance of audio source separation from underdetermined convolutive mixture assuming known mixing filters can be significantly improved by using an analysis sparse prior optimized by a reweighting l1 scheme and a wideband datafidelity term, as demonstrated by a recent article. In this letter, we show that the performance can be improved even more significantly by exploiting a low-rank prior on the source spectrograms.We present a new algorithm to estimate the sources based on i) an analysis sparse prior, ii) a reweighting scheme so as to increase the sparsity, iii) a wideband data-fidelity term in a constrained form, and iv) a low-rank constraint on the source spectrograms. Evaluation on reverberant music mixtures shows that the resulting algorithm improves state-of-the-art methods by more than 2 dB of signal-to-distortion ratio

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne