Search CORE

2,278 research outputs found

An adaptive stereo basis method for convolutive blind audio source separation

Author: Abdallah
Abdallah
Aharon
Amari
Amari
Araki
Bell
Cardoso
Cardoso
Cardoso
Davies
Douglas
Emmanuel Vincent
Hyvärinen
Ikeda
Ikram
Jafari
Jourjine
Knapp
Kurita
Lewicki
Makino
Maria G. Jafari
Mark D. Plumbley
Matsuda
Mike E. Davies
Mitianoudis
Mitianoudis
O’Grady
Parra
Samer A. Abdallah
Saruwatari
Sawada
Schmidt
Smaragdis
Torkkola
Vincent
Vincent
Viste
Yilmaz
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2008
Field of study

NOTICE: this is the author’s version of a work that was accepted for publication in Neurocomputing. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. A definitive version was subsequently published in PUBLICATION, [71, 10-12, June 2008] DOI:neucom.2007.08.02

Crossref

UCL Discovery

Edinburgh Research Explorer

Queen Mary Research Online

Multichannel Speech Separation and Enhancement Using the Convolutive Transfer Function

Author: Gannot Sharon
Girin Laurent
Horaud Radu
Li Xiaofei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 26/02/2018
Field of study

This paper addresses the problem of speech separation and enhancement from multichannel convolutive and noisy mixtures, \emph{assuming known mixing filters}. We propose to perform the speech separation and enhancement task in the short-time Fourier transform domain, using the convolutive transfer function (CTF) approximation. Compared to time-domain filters, CTF has much less taps, consequently it has less near-common zeros among channels and less computational complexity. The work proposes three speech-source recovery methods, namely: i) the multichannel inverse filtering method, i.e. the multiple input/output inverse theorem (MINT), is exploited in the CTF domain, and for the multi-source case, ii) a beamforming-like multichannel inverse filtering method applying single source MINT and using power minimization, which is suitable whenever the source CTFs are not all known, and iii) a constrained Lasso method, where the sources are recovered by minimizing the

\ell_1

-norm to impose their spectral sparsity, with the constraint that the

\ell_2

-norm fitting cost, between the microphone signals and the mixing model involving the unknown source signals, is less than a tolerance. The noise can be reduced by setting a tolerance onto the noise power. Experiments under various acoustic conditions are carried out to evaluate the three proposed methods. The comparison between them as well as with the baseline methods is presented.Comment: Submitted to IEEE/ACM Transactions on Audio, Speech and Language Processin

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Rennes 1

Doubly sparse models for multiple filter estimation in sparse echoic environments

Author: Arberet Simon
Gribonval Rémi
Sudhakar Prasad
Vandergheynst Pierre
Publication venue: HAL CCSD
Publication date: 10/12/2012
Field of study

We consider the estimation of multiple time-domain sparse filters from echoic mixtures of several unknown sources, when the sources are sparse in the time-frequency domain. We propose a sparse filter estimation framework consisting of two steps: a) a clustering step to group the time-frequency points of mixtures where only one source is active, for each source; b) a convex optimisation step to estimate the filters based on a time-frequency domain cross-relation. We propose a new wideband formulation of a frequency domain cross-relation, besides the one based on classical narrowband approximation. The solutions of the convex optimisation problem, formed using the cross-relation, are characterised. Numerical evaluation shows the benefit of using the wideband cross-relation for sparse echoic filter estimation. Further, the potential of the proposed framework for blind estimation of sparse echoic filters is demonstrated in a controlled experimental setting where in the proposed approach outperforms the state of the art blind filter estimation techniques, when the filters are sufficiently sparse

INRIA a CCSD electronic archive server

The influence of random element displacement on DOA estimates obtained with (Khatri-Rao-)root-MUSIC

Author: Inghelbrecht Veronique
Rogier Hendrik
Van Hecke Tanja
Verhaevert Jo
Publication venue: 'MDPI AG'
Publication date: 01/01/2014
Field of study

Although a wide range of direction of arrival (DOA) estimation algorithms has been described for a diverse range of array configurations, no specific stochastic analysis framework has been established to assess the probability density function of the error on DOA estimates due to random errors in the array geometry. Therefore, we propose a stochastic collocation method that relies on a generalized polynomial chaos expansion to connect the statistical distribution of random position errors to the resulting distribution of the DOA estimates. We apply this technique to the conventional root-MUSIC and the Khatri-Rao-root-MUSIC methods. According to Monte-Carlo simulations, this novel approach yields a speedup by a factor of more than 100 in terms of CPU-time for a one-dimensional case and by a factor of 56 for a two-dimensional case

Ghent University Academic Bibliography

Directory of Open Access Journals

PubMed Central

GPR Detection of Buried Symmetrically Shaped Mine-like Objects using Selective Independent Component Analysis

Author: Jakobsen Kaj Bjarne
Karlsen Brian
Larsen Jan
Sørensen Helge Bjarup Dissing
Publication venue
Publication date: 01/01/2003
Field of study

Online Research Database In Technology