3 research outputs found
PSD Estimation and Source Separation in a Noisy Reverberant Environment using a Spherical Microphone Array
In this paper, we propose an efficient technique for estimating individual
power spectral density (PSD) components, i.e., PSD of each desired sound source
as well as of noise and reverberation, in a multi-source reverberant sound
scene with coherent background noise. We formulate the problem in the spherical
harmonics domain to take the advantage of the inherent orthogonality of the
spherical harmonics basis functions and extract the PSD components from the
cross-correlation between the different sound field modes. We also investigate
an implementation issue that occurs at the nulls of the Bessel functions and
offer an engineering solution. The performance evaluation takes place in a
practical environment with a commercial microphone array in order to measure
the robustness of the proposed algorithm against all the deviations incurred in
practice. We also exhibit an application of the proposed PSD estimator through
a source septation algorithm and compare the performance with a contemporary
method in terms of different objective measures
Spatial dissection of a soundfield using spherical harmonic decomposition
A real-world soundfield is often contributed by multiple desired and undesired sound sources. The performance of many acoustic systems such as automatic speech recognition, audio surveillance, and teleconference relies on its ability to extract the desired sound components in such a mixed environment. The existing solutions to the above problem are constrained by various fundamental limitations and require to enforce different priors depending on the acoustic condition such as reverberation and spatial distribution of sound sources. With the growing emphasis and integration of audio applications in diverse technologies such as smart home and virtual reality appliances, it is imperative to advance the source separation technology in order to overcome the limitations of the traditional approaches.
To that end, we exploit the harmonic decomposition model to dissect a mixed soundfield into its underlying desired and undesired components based on source and signal characteristics. By analysing the spatial projection of a soundfield, we achieve multiple outcomes such as (i) soundfield separation with respect to distinct source regions, (ii) source separation in a mixed soundfield using modal coherence model, and (iii) direction of arrival (DOA) estimation of multiple overlapping sound sources through pattern recognition of the modal coherence of a soundfield.
We first employ an array of higher order microphones for soundfield separation in order to reduce hardware requirement and implementation complexity. Subsequently, we develop novel mathematical models for modal coherence of noisy and reverberant soundfields that facilitate convenient ways for estimating DOA and power spectral densities leading to robust source separation algorithms. The modal domain approach to the soundfield/source separation allows us to circumvent several practical limitations of the existing techniques and enhance the performance and robustness of the system. The proposed methods are presented with several practical applications and performance evaluations using simulated and real-life dataset