3,358 research outputs found
Acoustic Speaker Localization with Strong Reverberation and Adaptive Feature Filtering with a Bayes RFS Framework
The thesis investigates the challenges of speaker localization in presence of strong reverberation, multi-speaker tracking, and multi-feature multi-speaker state filtering, using sound recordings from microphones. Novel reverberation-robust speaker localization algorithms are derived from the signal and room acoustics models. A multi-speaker tracking filter and a multi-feature multi-speaker state filter are developed based upon the generalized labeled multi-Bernoulli random finite set framework. Experiments and comparative studies have verified and demonstrated the benefits of the proposed methods
Localization, Detection and Tracking of Multiple Moving Sound Sources with a Convolutional Recurrent Neural Network
This paper investigates the joint localization, detection, and tracking of sound events using a convolutional recurrent neural network (CRNN). We use a CRNN previously proposed for the localization and detection of stationary sources, and show that the recurrent layers enable the spatial tracking of moving sources when trained with dynamic scenes. The tracking performance of the CRNN is compared with a stand-alone tracking method that combines a multi-source (DOA) estimator and a particle filter. Their respective performance is evaluated in various acoustic conditions such as anechoic and reverberant scenarios, stationary and moving sources at several angular velocities, and with a varying number of overlapping sources. The results show that the CRNN manages to track multiple sources more consistently than the parametric method across acoustic scenarios, but at the cost of higher localization error.202
Locating and extracting acoustic and neural signals
This dissertation presents innovate methodologies for locating, extracting, and separating multiple incoherent sound sources in three-dimensional (3D) space; and applications of the time reversal (TR) algorithm to pinpoint the hyper active neural activities inside the brain auditory structure that are correlated to the tinnitus pathology. Specifically, an acoustic modeling based method is developed for locating arbitrary and incoherent sound sources in 3D space in real time by using a minimal number of microphones, and the Point Source Separation (PSS) method is developed for extracting target signals from directly measured mixed signals. Combining these two approaches leads to a novel technology known as Blind Sources Localization and Separation (BSLS) that enables one to locate multiple incoherent sound signals in 3D space and separate original individual sources simultaneously, based on the directly measured mixed signals. These technologies have been validated through numerical simulations and experiments conducted in various non-ideal environments where there are non-negligible, unspecified sound reflections and reverberation as well as interferences from random background noise. Another innovation presented in this dissertation is concerned with applications of the TR algorithm to pinpoint the exact locations of hyper-active neurons in the brain auditory structure that are directly correlated to the tinnitus perception. Benchmark tests conducted on normal rats have confirmed the localization results provided by the TR algorithm. Results demonstrate that the spatial resolution of this source localization can be as high as the micrometer level. This high precision localization may lead to a paradigm shift in tinnitus diagnosis, which may in turn produce a more cost-effective treatment for tinnitus than any of the existing ones
Proceedings of the EAA Spatial Audio Signal Processing symposium: SASP 2019
International audienc
Direction of Arrival Estimation Using Microphone Array Processing for Moving Humanoid Robots
The auditory system of humanoid robots has gained increased attention in
recent years. This system typically acquires the surrounding sound field by
means of a microphone array. Signals acquired by the array are then processed
using various methods. One of the widely applied methods is direction of
arrival estimation. The conventional direction of arrival estimation methods
assume that the array is fixed at a given position during the estimation.
However, this is not necessarily true for an array installed on a moving
humanoid robot. The array motion, if not accounted for appropriately, can
introduce a significant error in the estimated direction of arrival. The
current paper presents a signal model that takes the motion into account. Based
on this model, two processing methods are proposed. The first one compensates
for the motion of the robot. The second method is applicable to periodic
signals and utilizes the motion in order to enhance the performance to a level
beyond that of a stationary array. Numerical simulations and an experimental
study are provided, demonstrating that the motion compensation method almost
eliminates the motion-related error. It is also demonstrated that by using the
motion-based enhancement method it is possible to improve the direction of
arrival estimation performance, as compared to that obtained when using a
stationary array
- …