Search CORE

2,042 research outputs found

Reflection-Aware Sound Source Localization

Author: An Inkyu
Manocha Dinesh
Son Myungbae
Yoon Sung-eui
Publication venue
Publication date: 21/11/2017
Field of study

We present a novel, reflection-aware method for 3D sound localization in indoor environments. Unlike prior approaches, which are mainly based on continuous sound signals from a stationary source, our formulation is designed to localize the position instantaneously from signals within a single frame. We consider direct sound and indirect sound signals that reach the microphones after reflecting off surfaces such as ceilings or walls. We then generate and trace direct and reflected acoustic paths using inverse acoustic ray tracing and utilize these paths with Monte Carlo localization to estimate a 3D sound source position. We have implemented our method on a robot with a cube-shaped microphone array and tested it against different settings with continuous and intermittent sound signals with a stationary or a mobile source. Across different settings, our approach can localize the sound with an average distance error of 0.8m tested in a room of 7m by 7m area with 3m height, including a mobile and non-line-of-sight sound source. We also reveal that the modeling of indirect rays increases the localization accuracy by 40% compared to only using direct acoustic rays.Comment: Submitted to ICRA 2018. The working video is available at (https://youtu.be/TkQ36lMEC-M

arXiv.org e-Print Archive

Crossref

Structured Sparsity Models for Multiparty Speech Recovery from Reverberant Recordings

Author: Asaei Afsaneh
Bourlard Hervé
Cevher Volkan
Golbabaee Mohammad
Publication venue
Publication date: 01/01/2012
Field of study

We tackle the multi-party speech recovery problem through modeling the acoustic of the reverberant chambers. Our approach exploits structured sparsity models to perform room modeling and speech recovery. We propose a scheme for characterizing the room acoustic from the unknown competing speech sources relying on localization of the early images of the speakers by sparse approximation of the spatial spectra of the virtual sources in a free-space model. The images are then clustered exploiting the low-rank structure of the spectro-temporal components belonging to each source. This enables us to identify the early support of the room impulse response function and its unique map to the room geometry. To further tackle the ambiguity of the reflection ratios, we propose a novel formulation of the reverberation model and estimate the absorption coefficients through a convex optimization exploiting joint sparsity model formulated upon spatio-spectral sparsity of concurrent speech representation. The acoustic parameters are then incorporated for separating individual speech signals through either structured sparse recovery or inverse filtering the acoustic channels. The experiments conducted on real data recordings demonstrate the effectiveness of the proposed approach for multi-party speech recovery and recognition.Comment: 31 page

arXiv.org e-Print Archive

Edinburgh Research Explorer

Extraction of acoustic sources for multiple arrays based on the ray space transform

Author: Antonacci F.
Borra F.
Sarti A.
Tubaro S.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

In this paper we present a source extraction technique for multiple uniform linear arrays distributed in space. The technique adopts the Ray Space Transform representation of the sound field, which is inherently based on the Plane Wave Decomposition. The Ray Space Transform gives us an intuitive representation of the acoustic field, thus enabling the adoption of geometrically-motivated constraints in the spatial filter design. The proposed approach is semi-blind since it needs as input an estimate of the source positions. We prove the effectiveness of the proposed solution through simulations using both white noise and speech signals

Archivio istituzionale della ricerca - Politecnico di Milano

Crossref

非球形のアレイ形状を持つAmbisonicsの定式化

Author: Trevino Jorge
Publication venue
Publication date: 25/12/2014
Field of study

Tohoku University鈴木陽一課

Tohoku University Repository (TOUR) / 東北大学機関リポジトリ

Real-time Microphone Array Processing for Sound-field Analysis and Perceptually Motivated Reproduction

Author: McCormack Leo
Publication venue
Publication date: 11/12/2017
Field of study

This thesis details real-time implementations of sound-field analysis and perceptually motivated reproduction methods for visualisation and auralisation purposes. For the former, various methods for visualising the relative distribution of sound energy from one point in space are investigated and contrasted; including a novel reformulation of the cross-pattern coherence (CroPaC) algorithm, which integrates a new side-lobe suppression technique. Whereas for auralisation applications, listening tests were conducted to compare ambisonics reproduction with a novel headphone formulation of the directional audio coding (DirAC) method. The results indicate that the side-lobe suppressed CroPaC method offers greater spatial selectivity in reverberant conditions compared with other popular approaches, and that the new DirAC formulation yields higher perceived spatial accuracy when compared to the ambisonics method

Aaltodoc Publication Archive

Acoustic Imaging with Circular Microphone Array: a new Approach for Sound Field Analysis

Author: Amy Bastine
Augusto Sarti
Fabio Antonacci
Marco Olivieri
Mirco Pezzoli
Thushara Abhayapala
Publication venue
Publication date: 01/01/2024
Field of study

Acoustic imaging is powerful in collecting spatial information of acoustic sources into a visual representation. In this paper, we focus on the analysis of the exterior acoustic field captured by a circular array of microphones. With a proper parametrization based on angles, we map the directions of arrival of sources as a function of the microphone locations, thus obtaining an acoustic image called "angular space". Therefore, we introduce a linear transform to enable analysis and synthesis operations for mapping the microphone pressures onto the angular space using local space-time Fourier analysis. We prove the ability of this representation to combine global information coming from multiple arrays in a single acoustic image that can be processed and manipulated. Examples of source localization applications in simulated and measured scenarios show the effectiveness of the proposed method obtaining results comparable with state-of-the- art methods

Archivio istituzionale della ricerca - Politecnico di Milano