2,238 research outputs found

    Design exploration and performance strategies towards power-efficient FPGA-based achitectures for sound source localization

    Get PDF
    Many applications rely on MEMS microphone arrays for locating sound sources prior to their execution. Those applications not only are executed under real-time constraints but also are often embedded on low-power devices. These environments become challenging when increasing the number of microphones or requiring dynamic responses. Field-Programmable Gate Arrays (FPGAs) are usually chosen due to their flexibility and computational power. This work intends to guide the design of reconfigurable acoustic beamforming architectures, which are not only able to accurately determine the sound Direction-Of-Arrival (DoA) but also capable to satisfy the most demanding applications in terms of power efficiency. Design considerations of the required operations performing the sound location are discussed and analysed in order to facilitate the elaboration of reconfigurable acoustic beamforming architectures. Performance strategies are proposed and evaluated based on the characteristics of the presented architecture. This power-efficient architecture is compared to a different architecture prioritizing performance in order to reveal the unavoidable design trade-offs

    Ad Hoc Microphone Array Calibration: Euclidean Distance Matrix Completion Algorithm and Theoretical Guarantees

    Get PDF
    This paper addresses the problem of ad hoc microphone array calibration where only partial information about the distances between microphones is available. We construct a matrix consisting of the pairwise distances and propose to estimate the missing entries based on a novel Euclidean distance matrix completion algorithm by alternative low-rank matrix completion and projection onto the Euclidean distance space. This approach confines the recovered matrix to the EDM cone at each iteration of the matrix completion algorithm. The theoretical guarantees of the calibration performance are obtained considering the random and locally structured missing entries as well as the measurement noise on the known distances. This study elucidates the links between the calibration error and the number of microphones along with the noise level and the ratio of missing distances. Thorough experiments on real data recordings and simulated setups are conducted to demonstrate these theoretical insights. A significant improvement is achieved by the proposed Euclidean distance matrix completion algorithm over the state-of-the-art techniques for ad hoc microphone array calibration.Comment: In Press, available online, August 1, 2014. http://www.sciencedirect.com/science/article/pii/S0165168414003508, Signal Processing, 201

    A Methodology for Acoustic Measurement and Separation of Background Noise

    Get PDF
    An attempt is made at developing experimental methods for the acoustic measurement and separation of background noise in a wind tunnel. To this end, an array beamforming technique known as delay-and- sum beamforming is identified and tested. The theory underlying delay-and-sum beamforming is discussed. Two linear arrays, the seven microphone linear array and the four microphone minimum redundancy array, are designed. A driver is designed based on Helmholtz resonator theory to provide a source of monochromatic sound. Also, the concept of partial coherence as applicable to the separation of background noise from signal noise is investigated. Array beamforming results show that tests conducted with the two linear arrays in the open field provide good resolution of the sound source Direction Of Arrival (DOA) peaks from the background noise, and provide a semianechoic reference with which to compare wind tunnel results. Beamforming results obtained for the driver placed inside the wind tunnel with the tunnel running at 0, 45, and 81 ft/sec successfully resolved the DOA peaks of the driver from the background noise of the tunnel. At a tunnel velocity of 151 ft/sec, the driver signal is completely buried in the background noise of the tunnel, and beamforming was not successful in resolving the peak corresponding to the driver signal

    EXPERIMENTAL EVALUATION OF MODIFIED PHASE TRANSFORM FOR SOUND SOURCE DETECTION

    Get PDF
    The detection of sound sources with microphone arrays can be enhanced through processing individual microphone signals prior to the delay and sum operation. One method in particular, the Phase Transform (PHAT) has demonstrated improvement in sound source location images, especially in reverberant and noisy environments. Recent work proposed a modification to the PHAT transform that allows varying degrees of spectral whitening through a single parameter, andamp;acirc;, which has shown positive improvement in target detection in simulation results. This work focuses on experimental evaluation of the modified SRP-PHAT algorithm. Performance results are computed from actual experimental setup of an 8-element perimeter array with a receiver operating characteristic (ROC) analysis for detecting sound sources. The results verified simulation results of PHAT- andamp;acirc; in improving target detection probabilities. The ROC analysis demonstrated the relationships between various target types (narrowband and broadband), room reverberation levels (high and low) and noise levels (different SNR) with respect to optimal andamp;acirc;. Results from experiment strongly agree with those of simulations on the effect of PHAT in significantly improving detection performance for narrowband and broadband signals especially at low SNR and in the presence of high levels of reverberation

    MICROPHONE ARRAY OPTIMIZATION IN IMMERSIVE ENVIRONMENTS

    Get PDF
    The complex relationship between array gain patterns and microphone distributions limits the application of traditional optimization algorithms on irregular arrays, which show enhanced beamforming performance for human speech capture in immersive environments. This work analyzes the relationship between irregular microphone geometries and spatial filtering performance with statistical methods. Novel geometry descriptors are developed to capture the properties of irregular microphone distributions showing their impact on array performance. General guidelines and optimization methods for regular and irregular array design are proposed in immersive (near-field) environments to obtain superior beamforming ability for speech applications. Optimization times are greatly reduced through the objective function rules using performance-based geometric descriptions of microphone distributions that circumvent direct array gain computations over the space of interest. In addition, probabilistic descriptions of acoustic scenes are introduced to incorporate various levels of prior knowledge for the source distribution. To verify the effectiveness of the proposed optimization methods, simulated gain patterns and real SNR results of the optimized arrays are compared to corresponding traditional regular arrays and arrays obtained from direct exhaustive searching methods. Results show large SNR enhancements for the optimized arrays over arbitrary randomly generated arrays and regular arrays, especially at low microphone densities. The rapid convergence and acceptable processing times observed during the experiments establish the feasibility of proposed optimization methods for array geometry design in immersive environments where rapid deployment is required with limited knowledge of the acoustic scene, such as in mobile platforms and audio surveillance applications

    Using Acoustic Holography for Vibration Analysis

    Get PDF
    Disertační práce se zabývá bezkontaktní analýzou vibrací pomocí metod akustické holografie v blízkém poli. Akustická holografie v blízkém poli je experimentální metoda, která rekonstruuje akustické pole v těsné blízkosti povrchu vibrujícího předmětu na základě měření akustického tlaku nebo akustické rychlosti v určité vzdálenosti od zkoumaného předmětu. Konkrétní realizace této metody závisí na použitém výpočetním algoritmu. Vlastní práce je zaměřena zejména na rozbor algoritmů, které využívají k rekonstrukci zvukového pole v blízkosti vibrujícího objektu transformaci do domény vlnových čísel (prostorová transformace), kde probíhá vlastní výpočet. V úvodu práce je vysvětlena základní teorie metody akustické holografie v blízkém poli s popisem základních vlastností a dále rozborem konkrétních nejčastěji používaných algoritmům pro lokalizaci a charakterizaci zdroje zvuku a pro následnou vibrační analýzu. Stěžejní část práce se věnuje pokročilým metodám zpracování, které se snaží určitým způsobem optimalizovat přesnost predice zvukového pole v blízkosti vibrujícího předmětu v reálných podmínkách. Jde zejména o problematiku použitého měřicího systému s akustickými snímači, které nejsou ideální, a dále o možnost měření v prostorách s difúzním charakterem zvukového pole. Pro tento případ byla na základě literárního průzkumu optimalizována a ověřena metoda využívající dvouvrstvé mikrofonní pole, které umožňuje oddělení zvukových polí přicházejících z různých stran a tedy úspěšné měření v uzavřených prostorách např. kabin automobilů a letadel. Součástí práce byla také optimalizace, rozšíření a následné ověření algoritmů publikovaných v posledních letech pro měření v reálných podmínkách za použití běžně dostupných akustických snímačů.The main aim of the thesis is application of near-field acoustic holography for non-contact vibration analysis. Near-field acoustic holography is an experimental technique for reconstruction of sound field close to the surface of the vibrating object based on measurement of sound pressure or acoustic particle velocity in certain distance from the examined object. Practical realization of this method depends on used calculation procedure. The thesis is focused on analysis of acoustic holography algorithms with transformation into wavenumber domain (spatial transformation) where the reconstruction of the sound field near vibrating object is calculated. The introductory part of the thesis describes the theory of near-field acoustic holography with general characteristics and with analysis of most common algorithms used for localization and characterization of sound source and consequent vibration analysis. Principal part of the thesis deals with advanced processing methods where these methods try to optimize the accuracy of prediction of sound field near vibrating object in real environment. In this study, real measurement conditions represent the measurement system with non-ideal acoustic sensors and also areas with reverberant sound field. Based on literature study, there has been optimized and verified the new method which uses double layer microphone array to separate incoming and outgoing sound field, thus allows successful measurement in confined space e.g. cabins of cars and airplanes. Part of the thesis has been also focused on optimization, extension and successive experimental validation of selected classical algorithms published in last decade for possible measurement in real conditions and with common acoustic sensors.

    Mathematical modelling ano optimization strategies for acoustic source localization in reverberant environments

    Get PDF
    La presente Tesis se centra en el uso de técnicas modernas de optimización y de procesamiento de audio para la localización precisa y robusta de personas dentro de un entorno reverberante dotado con agrupaciones (arrays) de micrófonos. En esta tesis se han estudiado diversos aspectos de la localización sonora, incluyendo el modelado, la algoritmia, así como el calibrado previo que permite usar los algoritmos de localización incluso cuando la geometría de los sensores (micrófonos) es desconocida a priori. Las técnicas existentes hasta ahora requerían de un número elevado de micrófonos para obtener una alta precisión en la localización. Sin embargo, durante esta tesis se ha desarrollado un nuevo método que permite una mejora de más del 30\% en la precisión de la localización con un número reducido de micrófonos. La reducción en el número de micrófonos es importante ya que se traduce directamente en una disminución drástica del coste y en un aumento de la versatilidad del sistema final. Adicionalmente, se ha realizado un estudio exhaustivo de los fenómenos que afectan al sistema de adquisición y procesado de la señal, con el objetivo de mejorar el modelo propuesto anteriormente. Dicho estudio profundiza en el conocimiento y modelado del filtrado PHAT (ampliamente utilizado en localización acústica) y de los aspectos que lo hacen especialmente adecuado para localización. Fruto del anterior estudio, y en colaboración con investigadores del instituto IDIAP (Suiza), se ha desarrollado un sistema de auto-calibración de las posiciones de los micrófonos a partir del ruido difuso presente en una sala en silencio. Esta aportación relacionada con los métodos previos basados en la coherencia. Sin embargo es capaz de reducir el ruido atendiendo a parámetros físicos previamente conocidos (distancia máxima entre los micrófonos). Gracias a ello se consigue una mejor precisión utilizando un menor tiempo de cómputo. El conocimiento de los efectos del filtro PHAT ha permitido crear un nuevo modelo que permite la representación 'sparse' del típico escenario de localización. Este tipo de representación se ha demostrado ser muy conveniente para localización, permitiendo un enfoque sencillo del caso en el que existen múltiples fuentes simultáneas. La última aportación de esta tesis, es el de la caracterización de las Matrices TDOA (Time difference of arrival -Diferencia de tiempos de llegada, en castellano-). Este tipo de matrices son especialmente útiles en audio pero no están limitadas a él. Además, este estudio transciende a la localización con sonido ya que propone métodos de reducción de ruido de las medias TDOA basados en una representación matricial 'low-rank', siendo útil, además de en localización, en técnicas tales como el beamforming o el autocalibrado

    MULTI-SCALE LOCAL CONTRAST ENHANCEMENT FOR HIGH DYNAMIC RANGE TONE MAPPING

    Get PDF
    A display utilizes virtual primary colors in image zones of a display to produce image colors in the respective zones. The display is broken into the plurality of image zones and then driven by creating a monochrome dynamic primary color per field in each zone. This can enable driving of multiple primary LEDs per frame and remapping of image content so as to dynamically optimize scenes for each frame
    corecore