347 research outputs found
Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates
This paper presents a novel approach for indoor acoustic source localization
using microphone arrays and based on a Convolutional Neural Network (CNN). The
proposed solution is, to the best of our knowledge, the first published work in
which the CNN is designed to directly estimate the three dimensional position
of an acoustic source, using the raw audio signal as the input information
avoiding the use of hand crafted audio features. Given the limited amount of
available localization data, we propose in this paper a training strategy based
on two steps. We first train our network using semi-synthetic data, generated
from close talk speech recordings, and where we simulate the time delays and
distortion suffered in the signal that propagates from the source to the array
of microphones. We then fine tune this network using a small amount of real
data. Our experimental results show that this strategy is able to produce
networks that significantly improve existing localization methods based on
\textit{SRP-PHAT} strategies. In addition, our experiments show that our CNN
method exhibits better resistance against varying gender of the speaker and
different window sizes compared with the other methods.Comment: 18 pages, 3 figures, 8 table
High precision hybrid RF and ultrasonic chirp-based ranging for low-power IoT nodes
Hybrid acoustic-RF systems offer excellent ranging accuracy, yet they typically come at a power consumption that is too high to meet the energy constraints of mobile IoT nodes. We combine pulse compression and synchronized wake-ups to achieve a ranging solution that limits the active time of the nodes to 1 ms. Hence, an ultra low-power consumption of 9.015 µW for a single measurement is achieved. The operation time is estimated on 8.5 years on a CR2032 coin cell battery at a 1 Hz update rate, which is over 250 times larger than state-of-the-art RF-based positioning systems. Measurements based on a proof-of-concept hardware platform show median distance error values below 10 cm. Both simulations and measurements demonstrate that the accuracy is reduced at low signal-to-noise ratios and when reflections occur. We introduce three methods that enhance the distance measurements at a low extra processing power cost. Hence, we validate in realistic environments that the centimeter accuracy can be obtained within the energy budget of mobile devices and IoT nodes. The proposed hybrid signal ranging system can be extended to perform accurate, low-power indoor positioning
Robust Indoor Localization in a Reverberant Environment Using Microphone Pairs and Asynchronous Acoustic Beacons
In this paper, a robust indoor localization method using microphone pairs and asynchronous acoustic beacons was proposed. The proposed method is applicable even with a two-channel microphone pair, which is the minimal configuration of a microphone array. The proposed method estimates location by using the cross-correlation functions of the measured signals as location likelihoods. Three experiments were conducted to evaluate the proposed method. Four beacons were located at the corners of a localizing area of 4 m by 4 m and emitted signals with a bandwidth of 2 kHz. The localization results were compared to the previous method with deterministic direction-of-arrival estimation. The 90th percentiles of the localization error were 0.23 m for the proposed method with two microphones, 0.19 m for the proposed method with four microphones, and 0.30 m for the previous method under conditions without significant reverberation. Under a condition with reflective walls, the 90th percentile of the localization error of the previous method increased to 0.49 m, while that of the proposed method was only increased to 0.23 m for two microphones and 0.19 m for four microphones. The proposed method contributes to a robust localization in indoor environments and relieves the constraints of receiver configuration
Acoustic simultaneous localization and mapping (A-SLAM) of a moving microphone array and its surrounding speakers
Acoustic scene mapping creates a representation of positions of audio sources such as talkers within the surrounding environment of a microphone array. By allowing the array to move, the acoustic scene can be explored in order to improve the map. Furthermore, the spatial diversity of the kinematic array allows for estimation of the source-sensor distance in scenarios where source directions of arrival are measured. As sound source localization is performed relative to the array position, mapping of acoustic sources requires knowledge of the absolute position of the microphone array in the room. If the array is moving, its absolute position is unknown in practice. Hence, Simultaneous Localization and Mapping (SLAM) is required in order to localize the microphone array position and map the surrounding sound sources. In realistic environments, microphone arrays receive a convolutive mixture of direct-path speech signals, noise and reflections due to reverberation. A key challenge of Acoustic SLAM (a-SLAM) is robustness against reverberant clutter measurements and missing source detections. This paper proposes a novel bearing-only a-SLAM approach using a Single-Cluster Probability Hypothesis Density filter. Results demonstrate convergence to accurate estimates of the array trajectory and source positions
Simultaneous ranging and self-positioning in unsynchronized wireless acoustic sensor networks
Automatic ranging and self-positioning is a very
desirable property in wireless acoustic sensor networks (WASNs)
where nodes have at least one microphone and one loudspeaker.
However, due to environmental noise, interference and multipath
effects, audio-based ranging is a challenging task. This paper
presents a fast ranging and positioning strategy that makes use
of the correlation properties of pseudo-noise (PN) sequences for
estimating simultaneously relative time-of-arrivals (TOAs) from
multiple acoustic nodes. To this end, a proper test signal design
adapted to the acoustic node transducers is proposed. In addition,
a novel self-interference reduction method and a peak matching
algorithm are introduced, allowing for increased accuracy in
indoor environments. Synchronization issues are removed by
following a BeepBeep strategy, providing range estimates that
are converted to absolute node positions by means of multidimensional
scaling (MDS). The proposed approach is evaluated both
with simulated and real experiments under different acoustical
conditions. The results using a real network of smartphones and
laptops confirm the validity of the proposed approach, reaching
an average ranging accuracy below 1 centimeter.This work was supported by the Spanish Ministry of Economy and Competitiveness under Grant TIN2015-70202-P, TEC2012-37945-C02-02 and FEDER funds
- …