14 research outputs found
Regression and Classification for Direction-of-Arrival Estimation with Convolutional Recurrent Neural Networks
We present a novel learning-based approach to estimate the
direction-of-arrival (DOA) of a sound source using a convolutional recurrent
neural network (CRNN) trained via regression on synthetic data and Cartesian
labels. We also describe an improved method to generate synthetic data to train
the neural network using state-of-the-art sound propagation algorithms that
model specular as well as diffuse reflections of sound. We compare our model
against three other CRNNs trained using different formulations of the same
problem: classification on categorical labels, and regression on spherical
coordinate labels. In practice, our model achieves up to 43% decrease in
angular error over prior methods. The use of diffuse reflection results in 34%
and 41% reduction in angular prediction errors on LOCATA and SOFA datasets,
respectively, over prior methods based on image-source methods. Our method
results in an additional 3% error reduction over prior schemes that use
classification based networks, and we use 36% fewer network parameters
Reflection-Aware Sound Source Localization
We present a novel, reflection-aware method for 3D sound localization in
indoor environments. Unlike prior approaches, which are mainly based on
continuous sound signals from a stationary source, our formulation is designed
to localize the position instantaneously from signals within a single frame. We
consider direct sound and indirect sound signals that reach the microphones
after reflecting off surfaces such as ceilings or walls. We then generate and
trace direct and reflected acoustic paths using inverse acoustic ray tracing
and utilize these paths with Monte Carlo localization to estimate a 3D sound
source position. We have implemented our method on a robot with a cube-shaped
microphone array and tested it against different settings with continuous and
intermittent sound signals with a stationary or a mobile source. Across
different settings, our approach can localize the sound with an average
distance error of 0.8m tested in a room of 7m by 7m area with 3m height,
including a mobile and non-line-of-sight sound source. We also reveal that the
modeling of indirect rays increases the localization accuracy by 40% compared
to only using direct acoustic rays.Comment: Submitted to ICRA 2018. The working video is available at
(https://youtu.be/TkQ36lMEC-M
Virtual reconstruction of indoor acoustics in cathedrals: the case of the cathedral of Granada
Virtual acoustics provides a highly useful tool for the investigation into the influence that spatial transformations may exert on indoor acoustics of cathedrals, which are remarkable spaces due to their heritage value, complexity and multifunctional character. The spatial organization of cathedrals is primarily governed by the location of the choir, which represents the main musical expression. Following various reforms, certain European cathedrals undertook a relocation of the choir stalls from their original position. The Cathedral of Granada is a highly significant case. Since its original construction, three major changes have occurred due to the relocation of the choir. In this article, simulation of indoor acoustics is employed to recover the soundfield in each of these three configurations. An extensive analysis compares the results of the main acoustic parameters in each of the virtual reconstruction models. Consequently, acoustic models are created and then calibrated based on a campaign of onsite measurements