Search CORE

347 research outputs found

Towards End-to-End Acoustic Localization using Deep Learning: from Audio Signal to Source Position Coordinates

Author: Macias-Guarasa Javier
Pizarro Daniel
Vera-Diaz Juan Manuel
Publication venue: 'MDPI AG'
Publication date: 29/07/2018
Field of study

This paper presents a novel approach for indoor acoustic source localization using microphone arrays and based on a Convolutional Neural Network (CNN). The proposed solution is, to the best of our knowledge, the first published work in which the CNN is designed to directly estimate the three dimensional position of an acoustic source, using the raw audio signal as the input information avoiding the use of hand crafted audio features. Given the limited amount of available localization data, we propose in this paper a training strategy based on two steps. We first train our network using semi-synthetic data, generated from close talk speech recordings, and where we simulate the time delays and distortion suffered in the signal that propagates from the source to the array of microphones. We then fine tune this network using a small amount of real data. Our experimental results show that this strategy is able to produce networks that significantly improve existing localization methods based on \textit{SRP-PHAT} strategies. In addition, our experiments show that our CNN method exhibits better resistance against varying gender of the speaker and different window sizes compared with the other methods.Comment: 18 pages, 3 figures, 8 table

arXiv.org e-Print Archive

Directory of Open Access Journals

High precision hybrid RF and ultrasonic chirp-based ranging for low-power IoT nodes

Author: Cox B
De Strycker L
Ottoy G
Van der Perre L
Wielandt S
Publication venue: eScholarship, University of California
Publication date: 29/09/2020
Field of study

Hybrid acoustic-RF systems offer excellent ranging accuracy, yet they typically come at a power consumption that is too high to meet the energy constraints of mobile IoT nodes. We combine pulse compression and synchronized wake-ups to achieve a ranging solution that limits the active time of the nodes to 1 ms. Hence, an ultra low-power consumption of 9.015 µW for a single measurement is achieved. The operation time is estimated on 8.5 years on a CR2032 coin cell battery at a 1 Hz update rate, which is over 250 times larger than state-of-the-art RF-based positioning systems. Measurements based on a proof-of-concept hardware platform show median distance error values below 10 cm. Both simulations and measurements demonstrate that the accuracy is reduced at low signal-to-noise ratios and when reflections occur. We introduce three methods that enhance the distance measurements at a low extra processing power cost. Hence, we validate in realistic environments that the centimeter accuracy can be obtained within the energy budget of mobile devices and IoT nodes. The proposed hybrid signal ranging system can be extended to perform accurate, low-power indoor positioning

eScholarship - University of California

Robust Indoor Localization in a Reverberant Environment Using Microphone Pairs and Asynchronous Acoustic Beacons

Author: Koichi MIZUTANI
Naoto WAKATSUKI
Satoki Ogiso
Tadashi EBIHARA
水谷孝一
海老原格
若槻尚斗
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/2019
Field of study

In this paper, a robust indoor localization method using microphone pairs and asynchronous acoustic beacons was proposed. The proposed method is applicable even with a two-channel microphone pair, which is the minimal configuration of a microphone array. The proposed method estimates location by using the cross-correlation functions of the measured signals as location likelihoods. Three experiments were conducted to evaluate the proposed method. Four beacons were located at the corners of a localizing area of 4 m by 4 m and emitted signals with a bandwidth of 2 kHz. The localization results were compared to the previous method with deterministic direction-of-arrival estimation. The 90th percentiles of the localization error were 0.23 m for the proposed method with two microphones, 0.19 m for the proposed method with four microphones, and 0.30 m for the previous method under conditions without significant reverberation. Under a condition with reflective walls, the 90th percentile of the localization error of the previous method increased to 0.49 m, while that of the proposed method was only increased to 0.23 m for two microphones and 0.19 m for four microphones. The proposed method contributes to a robust localization in indoor environments and relieves the constraints of receiver configuration

Tsukuba Repository

Acoustic simultaneous localization and mapping (A-SLAM) of a moving microphone array and its surrounding speakers

Author: Evers C
Moore AH
Naylor PA
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Acoustic scene mapping creates a representation of positions of audio sources such as talkers within the surrounding environment of a microphone array. By allowing the array to move, the acoustic scene can be explored in order to improve the map. Furthermore, the spatial diversity of the kinematic array allows for estimation of the source-sensor distance in scenarios where source directions of arrival are measured. As sound source localization is performed relative to the array position, mapping of acoustic sources requires knowledge of the absolute position of the microphone array in the room. If the array is moving, its absolute position is unknown in practice. Hence, Simultaneous Localization and Mapping (SLAM) is required in order to localize the microphone array position and map the surrounding sound sources. In realistic environments, microphone arrays receive a convolutive mixture of direct-path speech signals, noise and reflections due to reverberation. A key challenge of Acoustic SLAM (a-SLAM) is robustness against reverberant clutter measurements and missing source detections. This paper proposes a novel bearing-only a-SLAM approach using a Single-Cluster Probability Hypothesis Density filter. Results demonstrate convergence to accurate estimates of the array trajectory and source positions

Southampton (e-Prints Soton)

Spiral - Imperial College Digital Repository

Acoustic Echo Estimation using the model-based approach with Application to Spatial Map Construction in Robotics

Author: Saqib Usama
Publication venue: Aalborg Universitetsforlag
Publication date: 01/01/2022
Field of study

VBN

Simultaneous ranging and self-positioning in unsynchronized wireless acoustic sensor networks

Author: Belmonte-Fernández Óscar
Cobos Maximo
Pérez Solano Juan J.
Ramos German
Torres Ana M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Automatic ranging and self-positioning is a very desirable property in wireless acoustic sensor networks (WASNs) where nodes have at least one microphone and one loudspeaker. However, due to environmental noise, interference and multipath effects, audio-based ranging is a challenging task. This paper presents a fast ranging and positioning strategy that makes use of the correlation properties of pseudo-noise (PN) sequences for estimating simultaneously relative time-of-arrivals (TOAs) from multiple acoustic nodes. To this end, a proper test signal design adapted to the acoustic node transducers is proposed. In addition, a novel self-interference reduction method and a peak matching algorithm are introduced, allowing for increased accuracy in indoor environments. Synchronization issues are removed by following a BeepBeep strategy, providing range estimates that are converted to absolute node positions by means of multidimensional scaling (MDS). The proposed approach is evaluated both with simulated and real experiments under different acoustical conditions. The results using a real network of smartphones and laptops confirm the validity of the proposed approach, reaching an average ranging accuracy below 1 centimeter.This work was supported by the Spanish Ministry of Economy and Competitiveness under Grant TIN2015-70202-P, TEC2012-37945-C02-02 and FEDER funds

Repositori Institucional de la Universitat Jaume I

RiuNet