28,841 research outputs found
Online Localization and Tracking of Multiple Moving Speakers in Reverberant Environments
We address the problem of online localization and tracking of multiple moving
speakers in reverberant environments. The paper has the following
contributions. We use the direct-path relative transfer function (DP-RTF), an
inter-channel feature that encodes acoustic information robust against
reverberation, and we propose an online algorithm well suited for estimating
DP-RTFs associated with moving audio sources. Another crucial ingredient of the
proposed method is its ability to properly assign DP-RTFs to audio-source
directions. Towards this goal, we adopt a maximum-likelihood formulation and we
propose to use an exponentiated gradient (EG) to efficiently update
source-direction estimates starting from their currently available values. The
problem of multiple speaker tracking is computationally intractable because the
number of possible associations between observed source directions and physical
speakers grows exponentially with time. We adopt a Bayesian framework and we
propose a variational approximation of the posterior filtering distribution
associated with multiple speaker tracking, as well as an efficient variational
expectation-maximization (VEM) solver. The proposed online localization and
tracking method is thoroughly evaluated using two datasets that contain
recordings performed in real environments.Comment: IEEE Journal of Selected Topics in Signal Processing, 201
Comparison of Gravitational Wave Detector Network Sky Localization Approximations
Gravitational waves emitted during compact binary coalescences are a
promising source for gravitational-wave detector networks. The accuracy with
which the location of the source on the sky can be inferred from gravitational
wave data is a limiting factor for several potential scientific goals of
gravitational-wave astronomy, including multi-messenger observations. Various
methods have been used to estimate the ability of a proposed network to
localize sources. Here we compare two techniques for predicting the uncertainty
of sky localization -- timing triangulation and the Fisher information matrix
approximations -- with Bayesian inference on the full, coherent data set. We
find that timing triangulation alone tends to over-estimate the uncertainty in
sky localization by a median factor of for a set of signals from
non-spinning compact object binaries ranging up to a total mass of , and the over-estimation increases with the mass of the system. We
find that average predictions can be brought to better agreement by the
inclusion of phase consistency information in timing-triangulation techniques.
However, even after corrections, these techniques can yield significantly
different results to the full analysis on specific mock signals. Thus, while
the approximate techniques may be useful in providing rapid, large scale
estimates of network localization capability, the fully coherent Bayesian
analysis gives more robust results for individual signals, particularly in the
presence of detector noise.Comment: 11 pages, 7 Figure
Space Time MUSIC: Consistent Signal Subspace Estimation for Wide-band Sensor Arrays
Wide-band Direction of Arrival (DOA) estimation with sensor arrays is an
essential task in sonar, radar, acoustics, biomedical and multimedia
applications. Many state of the art wide-band DOA estimators coherently process
frequency binned array outputs by approximate Maximum Likelihood, Weighted
Subspace Fitting or focusing techniques. This paper shows that bin signals
obtained by filter-bank approaches do not obey the finite rank narrow-band
array model, because spectral leakage and the change of the array response with
frequency within the bin create \emph{ghost sources} dependent on the
particular realization of the source process. Therefore, existing DOA
estimators based on binning cannot claim consistency even with the perfect
knowledge of the array response. In this work, a more realistic array model
with a finite length of the sensor impulse responses is assumed, which still
has finite rank under a space-time formulation. It is shown that signal
subspaces at arbitrary frequencies can be consistently recovered under mild
conditions by applying MUSIC-type (ST-MUSIC) estimators to the dominant
eigenvectors of the wide-band space-time sensor cross-correlation matrix. A
novel Maximum Likelihood based ST-MUSIC subspace estimate is developed in order
to recover consistency. The number of sources active at each frequency are
estimated by Information Theoretic Criteria. The sample ST-MUSIC subspaces can
be fed to any subspace fitting DOA estimator at single or multiple frequencies.
Simulations confirm that the new technique clearly outperforms binning
approaches at sufficiently high signal to noise ratio, when model mismatches
exceed the noise floor.Comment: 15 pages, 10 figures. Accepted in a revised form by the IEEE Trans.
on Signal Processing on 12 February 1918. @IEEE201
A robust sequential hypothesis testing method for brake squeal localisation
This contribution deals with the in situ detection and localisation of brake squeal in an automobile. As brake squeal is emitted from regions known a priori, i.e., near the wheels, the localisation is treated as a hypothesis testing problem. Distributed microphone arrays, situated under the automobile, are used to capture the directional properties of the sound field generated by a squealing brake. The spatial characteristics of the sampled sound field is then used to formulate the hypothesis tests. However, in contrast to standard hypothesis testing approaches of this kind, the propagation environment is complex and time-varying. Coupled with inaccuracies in the knowledge of the sensor and source positions as well as sensor gain mismatches, modelling the sound field is difficult and standard approaches fail in this case. A previously proposed approach implicitly tried to account for such incomplete system knowledge and was based on ad hoc likelihood formulations. The current paper builds upon this approach and proposes a second approach, based on more solid theoretical foundations, that can systematically account for the model uncertainties. Results from tests in a real setting show that the proposed approach is more consistent than the prior state-of-the-art. In both approaches, the tasks of detection and localisation are decoupled for complexity reasons. The localisation (hypothesis testing) is subject to a prior detection of brake squeal and identification of the squeal frequencies. The approaches used for the detection and identification of squeal frequencies are also presented. The paper, further, briefly addresses some practical issues related to array design and placement. (C) 2019 Author(s)
How to Train a CAT: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change
Direct visual localization has recently enjoyed a resurgence in popularity
with the increasing availability of cheap mobile computing power. The
competitive accuracy and robustness of these algorithms compared to
state-of-the-art feature-based methods, as well as their natural ability to
yield dense maps, makes them an appealing choice for a variety of mobile
robotics applications. However, direct methods remain brittle in the face of
appearance change due to their underlying assumption of photometric
consistency, which is commonly violated in practice. In this paper, we propose
to mitigate this problem by training deep convolutional encoder-decoder models
to transform images of a scene such that they correspond to a previously-seen
canonical appearance. We validate our method in multiple environments and
illumination conditions using high-fidelity synthetic RGB-D datasets, and
integrate the trained models into a direct visual localization pipeline,
yielding improvements in visual odometry (VO) accuracy through time-varying
illumination conditions, as well as improved metric relocalization performance
under illumination change, where conventional methods normally fail. We further
provide a preliminary investigation of transfer learning from synthetic to real
environments in a localization context. An open-source implementation of our
method using PyTorch is available at https://github.com/utiasSTARS/cat-net.Comment: In IEEE Robotics and Automation Letters (RA-L) and presented at the
IEEE International Conference on Robotics and Automation (ICRA'18), Brisbane,
Australia, May 21-25, 201
Recommended from our members
Development and Demonstration of a TDOA-Based GNSS Interference Signal Localization System
Background theory, a reference design, and demonstration
results are given for a Global Navigation Satellite
System (GNSS) interference localization system comprising a
distributed radio-frequency sensor network that simultaneously
locates multiple interference sources by measuring their signals’
time difference of arrival (TDOA) between pairs of nodes in
the network. The end-to-end solution offered here draws from
previous work in single-emitter group delay estimation, very long
baseline interferometry, subspace-based estimation, radar, and
passive geolocation. Synchronization and automatic localization
of sensor nodes is achieved through a tightly-coupled receiver
architecture that enables phase-coherent and synchronous sampling
of the interference signals and so-called reference signals
which carry timing and positioning information. Signal and crosscorrelation
models are developed and implemented in a simulator.
Multiple-emitter subspace-based TDOA estimation techniques
are developed as well as emitter identification and localization
algorithms. Simulator performance is compared to the CramérRao
lower bound for single-emitter TDOA precision. Results are
given for a test exercise in which the system accurately locates
emitters broadcasting in the amateur radio band in Austin, TX.Aerospace Engineering and Engineering Mechanic
- …