358 research outputs found
Unattended acoustic sensor systems for noise monitoring in national parks
2017 Spring.Includes bibliographical references.Detection and classification of transient acoustic signals is a difficult problem. The problem is often complicated by factors such as the variety of sources that may be encountered, the presence of strong interference and substantial variations in the acoustic environment. Furthermore, for most applications of transient detection and classification, such as speech recognition and environmental monitoring, online detection and classification of these transient events is required. This is even more crucial for applications such as environmental monitoring as it is often done at remote locations where it is unfeasible to set up a large, general-purpose processing system. Instead, some type of custom-designed system is needed which is power efficient yet able to run the necessary signal processing algorithms in near real-time. In this thesis, we describe a custom-designed environmental monitoring system (EMS) which was specifically designed for monitoring air traffic and other sources of interest in national parks. More specifically, this thesis focuses on the capabilities of the EMS and how transient detection, classification and tracking are implemented on it. The Sparse Coefficient State Tracking (SCST) transient detection and classification algorithm was implemented on the EMS board in order to detect and classify transient events. This algorithm was chosen because it was designed for this particular application and was shown to have superior performance compared to other algorithms commonly used for transient detection and classification. The SCST algorithm was implemented on an Artix 7 FPGA with parts of the algorithm running as dedicated custom logic and other parts running sequentially on a soft-core processor. In this thesis, the partitioning and pipelining of this algorithm is explained. Each of the partitions was tested independently to very their functionality with respect to the overall system. Furthermore, the entire SCST algorithm was tested in the field on actual acoustic data and the performance of this implementation was evaluated using receiver operator characteristic (ROC) curves and confusion matrices. In this test the FPGA implementation of SCST was able to achieve acceptable source detection and classification results despite a difficult data set and limited training data. The tracking of acoustic sources is done through successive direction of arrival (DOA) angle estimation using a wideband extension of the Capon beamforming algorithm. This algorithm was also implemented on the EMS in order to provide real-time DOA estimates for the detected sources. This algorithm was partitioned into several stages with some stages implemented in custom logic while others were implemented as software running on the soft-core processor. Just as with SCST, each partition of this beamforming algorithm was verified independently and then a full system test was conducted to evaluate whether it would be able to track an airborne source. For the full system test, a model airplane was flown at various trajectories relative to the EMS and the trajectories estimated by the system were compared to the ground truth. Although in this test the accuracy of the DOA estimates could not be evaluated, it was show that the algorithm was able to approximately form the general trajectory of a moving source which is sufficient for our application as only a general heading of the acoustic sources is desired
Sparse Array Design for Wideband Beamforming with Reduced Complexity in Tapped Delay-lines
Sparse wideband array design for sensor location optimization is highly nonlinear and it is traditionally solved by genetic algorithms (GAs) or other similar optimization methods. This is an extremely time-consuming process and an optimum solution is not always guaranteed. In this work, this problem is studied from the viewpoint of compressive sensing (CS). Although there have been CS-based methods proposed for the design of sparse narrowband arrays, its extension to the wideband case is not straightforward, as there are multiple coefficients associated with each sensor and they have to be simultaneously minimized in order to discard the corresponding sensor locations. At first, sensor location optimization for both general wideband beamforming and frequency invariant beamforming is considered. Then, sparsity in the tapped delay-line (TDL) coefficients associated with each sensor is considered in order to reduce the implementation complexity of each TDL. Finally, design of robust wideband arrays against norm-bounded steering vector errors is addressed. Design examples are provided to verify the effectiveness of the proposed methods, with comparisons drawn with a GA-based design method
Bayesian sparse regularization in near-field wideband aeroacoustic imaging for wind tunnel test
International audienc
Low-power SNN-based audio source localisation using a Hilbert Transform spike encoding scheme
Sound source localisation is used in many consumer electronics devices, to
help isolate audio from individual speakers and to reject noise. Localization
is frequently accomplished by "beamforming" algorithms, which combine
microphone audio streams to improve received signal power from particular
incident source directions. Beamforming algorithms generally use knowledge of
the frequency components of the audio source, along with the known microphone
array geometry, to analytically phase-shift microphone streams before combining
them. A dense set of band-pass filters is often used to obtain known-frequency
"narrowband" components from wide-band audio streams. These approaches achieve
high accuracy, but state of the art narrowband beamforming algorithms are
computationally demanding, and are therefore difficult to integrate into
low-power IoT devices. We demonstrate a novel method for sound source
localisation in arbitrary microphone arrays, designed for efficient
implementation in ultra-low-power spiking neural networks (SNNs). We use a
novel short-time Hilbert transform (STHT) to remove the need for demanding
band-pass filtering of audio, and introduce a new accompanying method for audio
encoding with spiking events. Our beamforming and localisation approach
achieves state-of-the-art accuracy for SNN methods, and comparable with
traditional non-SNN super-resolution approaches. We deploy our method to
low-power SNN audio inference hardware, and achieve much lower power
consumption compared with super-resolution methods. We demonstrate that signal
processing approaches can be co-designed with spiking neural network
implementations to achieve high levels of power efficiency. Our new
Hilbert-transform-based method for beamforming promises to also improve the
efficiency of traditional DSP-based signal processing
Array signal processing for source localization and enhancement
“A common approach to the wide-band microphone array problem is to assume a certain array geometry and then design optimal weights (often in subbands) to meet a set of desired criteria. In addition to weights, we consider the geometry of the microphone arrangement to be part of the optimization problem. Our approach is to use particle swarm optimization (PSO) to search for the optimal geometry while using an optimal weight design to design the weights for each particle’s geometry. The resulting directivity indices (DI’s) and white noise SNR gains (WNG’s) form the basis of the PSO’s fitness function. Another important consideration in the optimal weight design are several regularization parameters. By including those parameters in the particles, we optimize their values as well in the operation of the PSO. The proposed method allows the user great flexibility in specifying desired DI’s and WNG’s over frequency by virtue of the PSO fitness function.
Although the above method discusses beam and nulls steering for fixed locations, in real time scenarios, it requires us to estimate the source positions to steer the beam position adaptively. We also investigate source localization of sound and RF sources using machine learning techniques. As for the RF source localization, we consider radio frequency identification (RFID) antenna tags. Using a planar RFID antenna array with beam steering capability and using received signal strength indicator (RSSI) value captured for each beam position, the position of each RFID antenna tag is estimated. The proposed approach is also shown to perform well under various challenging scenarios”--Abstract, page iv
Raking the Cocktail Party
We present the concept of an acoustic rake receiver---a microphone beamformer
that uses echoes to improve the noise and interference suppression. The rake
idea is well-known in wireless communications; it involves constructively
combining different multipath components that arrive at the receiver antennas.
Unlike spread-spectrum signals used in wireless communications, speech signals
are not orthogonal to their shifts. Therefore, we focus on the spatial
structure, rather than temporal. Instead of explicitly estimating the channel,
we create correspondences between early echoes in time and image sources in
space. These multiple sources of the desired and the interfering signal offer
additional spatial diversity that we can exploit in the beamformer design.
We present several "intuitive" and optimal formulations of acoustic rake
receivers, and show theoretically and numerically that the rake formulation of
the maximum signal-to-interference-and-noise beamformer offers significant
performance boosts in terms of noise and interference suppression. Beyond
signal-to-noise ratio, we observe gains in terms of the \emph{perceptual
evaluation of speech quality} (PESQ) metric for the speech quality. We
accompany the paper by the complete simulation and processing chain written in
Python. The code and the sound samples are available online at
\url{http://lcav.github.io/AcousticRakeReceiver/}.Comment: 12 pages, 11 figures, Accepted for publication in IEEE Journal on
Selected Topics in Signal Processing (Special Issue on Spatial Audio
- …