11,120 research outputs found
Probabilistic Modeling Paradigms for Audio Source Separation
This is the author's final version of the article, first published as E. Vincent, M. G. Jafari, S. A. Abdallah, M. D. Plumbley, M. E. Davies. Probabilistic Modeling Paradigms for Audio Source Separation. In W. Wang (Ed), Machine Audition: Principles, Algorithms and Systems. Chapter 7, pp. 162-185. IGI Global, 2011. ISBN 978-1-61520-919-4. DOI: 10.4018/978-1-61520-919-4.ch007file: VincentJafariAbdallahPD11-probabilistic.pdf:v\VincentJafariAbdallahPD11-probabilistic.pdf:PDF owner: markp timestamp: 2011.02.04file: VincentJafariAbdallahPD11-probabilistic.pdf:v\VincentJafariAbdallahPD11-probabilistic.pdf:PDF owner: markp timestamp: 2011.02.04Most sound scenes result from the superposition of several sources, which can be separately perceived and analyzed by human listeners. Source separation aims to provide machine listeners with similar skills by extracting the sounds of individual sources from a given scene. Existing separation systems operate either by emulating the human auditory system or by inferring the parameters of probabilistic sound models. In this chapter, the authors focus on the latter approach and provide a joint overview of established and recent models, including independent component analysis, local time-frequency models and spectral template-based models. They show that most models are instances of one of the following two general paradigms: linear modeling or variance modeling. They compare the merits of either paradigm and report objective performance figures. They also,conclude by discussing promising combinations of probabilistic priors and inference algorithms that could form the basis of future state-of-the-art systems
Semi-blind speech-music separation using sparsity and continuity priors
In this paper we propose an approach for the problem of single channel source separation of speech and music signals. Our approach is based on representing each source's power spectral density using dictionaries and nonlinearly projecting the mixture signal spectrum onto the combined span of the dictionary entries. We encourage sparsity and continuity of the dictionary coefficients using penalty terms (or log-priors) in an optimization framework. We propose to use a novel coordinate descent technique for optimization, which nicely handles nonnegativity constraints and nonquadratic penalty terms. We use an adaptive Wiener filter, and spectral subtraction to reconstruct both of the sources from the mixture data after corresponding power spectral densities (PSDs) are estimated for each source. Using conventional metrics, we measure the performance of the system on simulated mixtures of single person speech and piano music sources. The results indicate that the proposed method is a promising technique for low speech-to-music ratio conditions and that sparsity and continuity priors help improve the performance of the proposed system
Adaptive Langevin Sampler for Separation of t-Distribution Modelled Astrophysical Maps
We propose to model the image differentials of astrophysical source maps by
Student's t-distribution and to use them in the Bayesian source separation
method as priors. We introduce an efficient Markov Chain Monte Carlo (MCMC)
sampling scheme to unmix the astrophysical sources and describe the derivation
details. In this scheme, we use the Langevin stochastic equation for
transitions, which enables parallel drawing of random samples from the
posterior, and reduces the computation time significantly (by two orders of
magnitude). In addition, Student's t-distribution parameters are updated
throughout the iterations. The results on astrophysical source separation are
assessed with two performance criteria defined in the pixel and the frequency
domains.Comment: 12 pages, 6 figure
Power-law Template for Infrared Point-source Clustering
We perform a combined fit to angular power spectra of unresolved infrared (IR) point sources from the Planck
satellite (at 217, 353, 545, and 857 GHz, over angular scales 100 ≾ ℓ ≾ 2200), the Balloon-borne Large-Aperture
Submillimeter Telescope (BLAST; 250, 350, and 500μm; 1000 ≾ ℓ ≾ 9000), and from correlating BLAST and Atacama Cosmology Telescope (ACT; 148 and 218 GHz) maps. We find that the clustered power over the range of angular scales and frequencies considered is well fitted by a simple power law of the form C^(clust)_ℓ ∝ ℓ^(-n) with n = 1.25 ± 0.06. While the IR sources are understood to lie at a range of redshifts, with a variety of dust properties, we find that the frequency dependence of the clustering power can be described by the square of a modified blackbody, ν^(β)B(ν, T_(eff)), with a single emissivity index β = 2.20 ± 0.07 and effective temperature T_(eff) = 9.7 K. Our predictions for the clustering amplitude are consistent with existing ACT and South Pole Telescope results at around 150 and 220 GHz, as is our prediction for the effective dust spectral index, which we find to be α_(150–220) = 3.68±0.07 between 150 and 220 GHz. Our constraints on the clustering shape and frequency dependence can be used to model the IR clustering as a contaminant in cosmic microwave background anisotropy measurements. The combined Planck and BLAST data also rule out a linear bias clustering model
Compact source detection in multi-channel microwave surveys: from SZ clusters to polarized sources
In this paper we describe the state-of-the art status of multi-frequency
detection techniques for compact sources in microwave astronomy. From the
simplest cases where the spectral behaviour is well-known (i.e. thermal SZ
clusters) to the more complex cases where there is little a priori information
(i.e. polarized radio sources) we will review the main advances and the most
recent results in the detection problem.Comment: 13 pages, 4 figures. Accepted for publication in the Special Issue
"Astrophysical Foregrounds in Microwave Surveys" of the journal Advances in
Astronom
Bayesian separation of spectral sources under non-negativity and full additivity constraints
This paper addresses the problem of separating spectral sources which are
linearly mixed with unknown proportions. The main difficulty of the problem is
to ensure the full additivity (sum-to-one) of the mixing coefficients and
non-negativity of sources and mixing coefficients. A Bayesian estimation
approach based on Gamma priors was recently proposed to handle the
non-negativity constraints in a linear mixture model. However, incorporating
the full additivity constraint requires further developments. This paper
studies a new hierarchical Bayesian model appropriate to the non-negativity and
sum-to-one constraints associated to the regressors and regression coefficients
of linear mixtures. The estimation of the unknown parameters of this model is
performed using samples generated using an appropriate Gibbs sampler. The
performance of the proposed algorithm is evaluated through simulation results
conducted on synthetic mixture models. The proposed approach is also applied to
the processing of multicomponent chemical mixtures resulting from Raman
spectroscopy.Comment: v4: minor grammatical changes; Signal Processing, 200
Sparse Gaussian Process Audio Source Separation Using Spectrum Priors in the Time-Domain
Gaussian process (GP) audio source separation is a time-domain approach that
circumvents the inherent phase approximation issue of spectrogram based
methods. Furthermore, through its kernel, GPs elegantly incorporate prior
knowledge about the sources into the separation model. Despite these compelling
advantages, the computational complexity of GP inference scales cubically with
the number of audio samples. As a result, source separation GP models have been
restricted to the analysis of short audio frames. We introduce an efficient
application of GPs to time-domain audio source separation, without compromising
performance. For this purpose, we used GP regression, together with spectral
mixture kernels, and variational sparse GPs. We compared our method with
LD-PSDTF (positive semi-definite tensor factorization), KL-NMF
(Kullback-Leibler non-negative matrix factorization), and IS-NMF (Itakura-Saito
NMF). Results show that the proposed method outperforms these techniques.Comment: Paper submitted to the 44th International Conference on Acoustics,
Speech, and Signal Processing, ICASSP 2019. To be held in Brighton, United
Kingdom, between May 12 and May 17, 201
Component separation methods for the Planck mission
The Planck satellite will map the full sky at nine frequencies from 30 to 857
GHz. The CMB intensity and polarization that are its prime targets are
contaminated by foreground emission. The goal of this paper is to compare
proposed methods for separating CMB from foregrounds based on their different
spectral and spatial characteristics, and to separate the foregrounds into
components of different physical origin. A component separation challenge has
been organized, based on a set of realistically complex simulations of sky
emission. Several methods including those based on internal template
subtraction, maximum entropy method, parametric method, spatial and harmonic
cross correlation methods, and independent component analysis have been tested.
Different methods proved to be effective in cleaning the CMB maps from
foreground contamination, in reconstructing maps of diffuse Galactic emissions,
and in detecting point sources and thermal Sunyaev-Zeldovich signals. The power
spectrum of the residuals is, on the largest scales, four orders of magnitude
lower than that of the input Galaxy power spectrum at the foreground minimum.
The CMB power spectrum was accurately recovered up to the sixth acoustic peak.
The point source detection limit reaches 100 mJy, and about 2300 clusters are
detected via the thermal SZ effect on two thirds of the sky. We have found that
no single method performs best for all scientific objectives. We foresee that
the final component separation pipeline for Planck will involve a combination
of methods and iterations between processing steps targeted at different
objectives such as diffuse component separation, spectral estimation and
compact source extraction.Comment: Matches version accepted by A&A. A version with high resolution
figures is available at http://people.sissa.it/~leach/compsepcomp.pd
Semi-blind Bayesian inference of CMB map and power spectrum
We present a new blind formulation of the Cosmic Microwave Background (CMB)
inference problem. The approach relies on a phenomenological model of the
multi-frequency microwave sky without the need for physical models of the
individual components. For all-sky and high resolution data, it unifies parts
of the analysis that have previously been treated separately, such as component
separation and power spectrum inference. We describe an efficient sampling
scheme that fully explores the component separation uncertainties on the
inferred CMB products such as maps and/or power spectra. External information
about individual components can be incorporated as a prior giving a flexible
way to progressively and continuously introduce physical component separation
from a maximally blind approach. We connect our Bayesian formalism to existing
approaches such as Commander, SMICA and ILC, and discuss possible future
extensions.Comment: 11 pages, 9 figure
- …