Search CORE

8,855 research outputs found

SoundCompass: a distributed MEMS microphone array-based sensor for sound source localization

Author: da Silva Gomes Bruno
Domínguez Federico
Segers Laurent
Steenhaut Kris
Tiete Jelmer
Touhafi Abdellah
Publication venue: 'MDPI AG'
Publication date: 01/01/2014
Field of study

Sound source localization is a well-researched subject with applications ranging from localizing sniper fire in urban battlefields to cataloging wildlife in rural areas. One critical application is the localization of noise pollution sources in urban environments, due to an increasing body of evidence linking noise pollution to adverse effects on human health. Current noise mapping techniques often fail to accurately identify noise pollution sources, because they rely on the interpolation of a limited number of scattered sound sensors. Aiming to produce accurate noise pollution maps, we developed the SoundCompass, a low-cost sound sensor capable of measuring local noise levels and sound field directionality. Our first prototype is composed of a sensor array of 52 Microelectromechanical systems (MEMS) microphones, an inertial measuring unit and a low-power field-programmable gate array (FPGA). This article presents the SoundCompass's hardware and firmware design together with a data fusion technique that exploits the sensing capabilities of the SoundCompass in a wireless sensor network to localize noise pollution sources. Live tests produced a sound source localization accuracy of a few centimeters in a 25-m2 anechoic chamber, while simulation results accurately located up to five broadband sound sources in a 10,000-m2 open field

Directory of Open Access Journals

Structured Sparsity Models for Multiparty Speech Recovery from Reverberant Recordings

Author: Asaei Afsaneh
Bourlard Hervé
Cevher Volkan
Golbabaee Mohammad
Publication venue
Publication date: 01/01/2012
Field of study

We tackle the multi-party speech recovery problem through modeling the acoustic of the reverberant chambers. Our approach exploits structured sparsity models to perform room modeling and speech recovery. We propose a scheme for characterizing the room acoustic from the unknown competing speech sources relying on localization of the early images of the speakers by sparse approximation of the spatial spectra of the virtual sources in a free-space model. The images are then clustered exploiting the low-rank structure of the spectro-temporal components belonging to each source. This enables us to identify the early support of the room impulse response function and its unique map to the room geometry. To further tackle the ambiguity of the reflection ratios, we propose a novel formulation of the reverberation model and estimate the absorption coefficients through a convex optimization exploiting joint sparsity model formulated upon spatio-spectral sparsity of concurrent speech representation. The acoustic parameters are then incorporated for separating individual speech signals through either structured sparse recovery or inverse filtering the acoustic channels. The experiments conducted on real data recordings demonstrate the effectiveness of the proposed approach for multi-party speech recovery and recognition.Comment: 31 page

arXiv.org e-Print Archive

Scan and paint: theory and practice of a sound field visualization method

Author: Carrillo Pousa Graciano
de Bree Hans-Elias
Fernandez Comesana Daniel
Holland Keith R.
Steltenpool Steven
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2013
Field of study

Sound visualization techniques have played a key role in the development of acoustics throughout history. The development of measurement apparatus and techniques for displaying sound and vibration phenomena has provided excellent tools for building understanding about specific problems. Traditional methods, such as step-by-step measurements or simultaneous multichannel systems, have a strong tradeoff between time requirements, flexibility, and cost. However, if the sound field can be assumed time stationary, scanning methods allow us to assess variations across space with a single transducer, as long as the position of the sensor is known. The proposed technique, Scan and Paint, is based on the acquisition of sound pressure and particle velocity by manually moving a P-U probe (pressure-particle velocity sensors) across a sound field whilst filming the event with a camera. The sensor position is extracted by applying automatic color tracking to each frame of the recorded video. It is then possible to visualize sound variations across the space in terms of sound pressure, particle velocity, or acoustic intensity. In this paper, not only the theoretical foundations of the method, but also its practical applications are explored such as scanning transfer path analysis, source radiation characterization, operational deflection shapes, virtual phased arrays, material characterization, and acoustic intensity vector field mapping

Southampton (e-Prints Soton)

Directory of Open Access Journals

Runtime reconfigurable beamforming architecture for real-time sound-source localization

Author: Braeken An
da Silva Gomes Bruno
Segers Laurent
Touhafi Abdellah
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2016
Field of study

Archivsystem Ask23

A Noise-Robust Method with Smoothed \ell_1/\ell_2 Regularization for Sparse Moving-Source Mapping

Author: Mars Jérôme I.
Nicolas Barbara
Oudompheng Benoit
Pham Mai Quyen
Publication venue
Publication date: 01/04/2016
Field of study

The method described here performs blind deconvolution of the beamforming output in the frequency domain. To provide accurate blind deconvolution, sparsity priors are introduced with a smooth \ell_1/\ell_2 regularization term. As the mean of the noise in the power spectrum domain is dependent on its variance in the time domain, the proposed method includes a variance estimation step, which allows more robust blind deconvolution. Validation of the method on both simulated and real data, and of its performance, are compared with two well-known methods from the literature: the deconvolution approach for the mapping of acoustic sources, and sound density modeling

arXiv.org e-Print Archive

HAL-UJM

Hal - Université Grenoble Alpes

A novel deconvolution beamforming algorithm for virtual phased arrays

Author: Fernandez Comesana Daniel
Fernandez Grande Efren
Holland Keith R.
Tiana Roig Elisabet
Publication venue
Publication date: 17/09/2013
Field of study

Beamforming techniques using phased microphone arrays are one of the most common tools for localizing and quantifying noise sources. However, the use of such devices can result in a series of well-known disadvantages regarding, for instance, their very high cost or transducer mismatch. Virtual Phased Arrays (VPAs) have been proposed as an alternative solution to prevent these difficulties provided the sound field is time stationary. Several frequency domain beamforming techniques can be adapted to only use the relative phase between a fixed and a moving transducer. Therefore the results traditionally obtained using large arrays can be emulated by applying beamforming algorithms to data acquired from only two sensors. This paper presents a novel beamforming algorithm which uses a deconvolution approach to strongly reduce the presence of side lobes. A series of synthetic noise sources with negative source strength are introduced in order to maximize the dynamic range of the beamforming deconvolved map. This iterative sidelobe cleaner algorithm (ISCA) does not require the of use of the covariance matrix of the array, hence it can also be applied to a VPA. The performance of ISCA is compared throughout several simulations with conventional deconvolution algorithms such as DAMAS and NNLS. The results support the robustness and accuracy of the proposed approach, providing clear localization maps in all the conditions evaluated

Southampton (e-Prints Soton)

CABE : a cloud-based acoustic beamforming emulator for FPGA-based sound source localization

Author: Braeken An
da Silva Gomes Bruno
Lapauw Benjamin Johan
Segers Laurent
Touhafi Abdellah
Vandendriessche Jurgen
Vandervelden Thibaut
Publication venue: 'MDPI AG'
Publication date: 01/01/2019
Field of study

Microphone arrays are gaining in popularity thanks to the availability of low-cost microphones. Applications including sonar, binaural hearing aid devices, acoustic indoor localization techniques and speech recognition are proposed by several research groups and companies. In most of the available implementations, the microphones utilized are assumed to offer an ideal response in a given frequency domain. Several toolboxes and software can be used to obtain a theoretical response of a microphone array with a given beamforming algorithm. However, a tool facilitating the design of a microphone array taking into account the non-ideal characteristics could not be found. Moreover, generating packages facilitating the implementation on Field Programmable Gate Arrays has, to our knowledge, not been carried out yet. Visualizing the responses in 2D and 3D also poses an engineering challenge. To alleviate these shortcomings, a scalable Cloud-based Acoustic Beamforming Emulator (CABE) is proposed. The non-ideal characteristics of microphones are considered during the computations and results are validated with acoustic data captured from microphones. It is also possible to generate hardware description language packages containing delay tables facilitating the implementation of Delay-and-Sum beamformers in embedded hardware. Truncation error analysis can also be carried out for fixed-point signal processing. The effects of disabling a given group of microphones within the microphone array can also be calculated. Results and packages can be visualized with a dedicated client application. Users can create and configure several parameters of an emulation, including sound source placement, the shape of the microphone array and the required signal processing flow. Depending on the user configuration, 2D and 3D graphs showing the beamforming results, waterfall diagrams and performance metrics can be generated by the client application. The emulations are also validated with captured data from existing microphone arrays.</jats:p