1,502 research outputs found
Studies in Signal Processing Techniques for Speech Enhancement: A comparative study
Speech enhancement is very essential to suppress the background noise and to increase speech intelligibility and reduce fatigue in hearing. There exist many simple speech enhancement algorithms like spectral subtraction to complex algorithms like Bayesian Magnitude estimators based on Minimum Mean Square Error (MMSE) and its variants. A continuous research is going and new algorithms are emerging to enhance speech signal recorded in the background of environment such as industries, vehicles and aircraft cockpit. In aviation industries speech enhancement plays a vital role to bring crucial information from pilot’s conversation in case of an incident or accident by suppressing engine and other cockpit instrument noises. In this work proposed is a new approach to speech enhancement making use harmonic wavelet transform and Bayesian estimators. The performance indicators, SNR and listening confirms to the fact that newly modified algorithms using harmonic wavelet transform indeed show better results than currently existing methods. Further, the Harmonic Wavelet Transform is computationally efficient and simple to implement due to its inbuilt decimation-interpolation operations compared to those of filter-bank approach to realize sub-bands
Robust Bayesian target detection algorithm for depth imaging from sparse single-photon data
This paper presents a new Bayesian model and associated algorithm for depth
and intensity profiling using full waveforms from time-correlated single-photon
counting (TCSPC) measurements in the limit of very low photon counts (i.e.,
typically less than 20 photons per pixel). The model represents each Lidar
waveform as an unknown constant background level, which is combined in the
presence of a target, to a known impulse response weighted by the target
intensity and finally corrupted by Poisson noise. The joint target detection
and depth imaging problem is expressed as a pixel-wise model selection and
estimation problem which is solved using Bayesian inference. Prior knowledge
about the problem is embedded in a hierarchical model that describes the
dependence structure between the model parameters while accounting for their
constraints. In particular, Markov random fields (MRFs) are used to model the
joint distribution of the background levels and of the target presence labels,
which are both expected to exhibit significant spatial correlations. An
adaptive Markov chain Monte Carlo algorithm including reversible-jump updates
is then proposed to compute the Bayesian estimates of interest. This algorithm
is equipped with a stochastic optimization adaptation mechanism that
automatically adjusts the parameters of the MRFs by maximum marginal likelihood
estimation. Finally, the benefits of the proposed methodology are demonstrated
through a series of experiments using real data.Comment: arXiv admin note: text overlap with arXiv:1507.0251
Improved speech presence probability estimation based on wavelet denoising
A reliable estimator for speech presence probability (SPP) can significantly improve the performance of many speech enhancement algorithms. Previous work showed that a good SPP estimator can be obtained by using a smooth a-posteriori signal to noise ratio (SNR) function, which can be achieved by reducing the noise variance when estimating the speech power spectrum. In this paper, a wavelet based denoising algorithm is proposed for such purpose. We first apply the wavelet transform to the periodogram of a noisy speech signal to generate an oracle for indicating the locations of the noise floor in the periodogram. We then make use of that oracle to selectively remove the wavelet coefficients of the noise floor in the log multitaper spectrum (MTS) of the noisy speech. The remaining wavelet coefficients are then used to reconstruct a denoised MTS and in turn generate a smooth a-posteriori SNR function. Simulation results show that the new SPP estimator outperforms the traditional approaches and enables a significantly improvement in the quality and intelligibility of the enhanced speeches. © 2012 IEEE.published_or_final_versio
Sequential joint signal detection and signal-to-noise ratio estimation
The sequential analysis of the problem of joint signal detection and
signal-to-noise ratio (SNR) estimation for a linear Gaussian observation model
is considered. The problem is posed as an optimization setup where the goal is
to minimize the number of samples required to achieve the desired (i) type I
and type II error probabilities and (ii) mean squared error performance. This
optimization problem is reduced to a more tractable formulation by transforming
the observed signal and noise sequences to a single sequence of Bernoulli
random variables; joint detection and estimation is then performed on the
Bernoulli sequence. This transformation renders the problem easily solvable,
and results in a computationally simpler sufficient statistic compared to the
one based on the (untransformed) observation sequences. Experimental results
demonstrate the advantages of the proposed method, making it feasible for
applications having strict constraints on data storage and computation.Comment: 5 pages, Proceedings of IEEE International Conference on Acoustics,
Speech, and Signal Processing (ICASSP), 201
Joint Bayesian endmember extraction and linear unmixing for hyperspectral imagery
This paper studies a fully Bayesian algorithm for endmember extraction and
abundance estimation for hyperspectral imagery. Each pixel of the hyperspectral
image is decomposed as a linear combination of pure endmember spectra following
the linear mixing model. The estimation of the unknown endmember spectra is
conducted in a unified manner by generating the posterior distribution of
abundances and endmember parameters under a hierarchical Bayesian model. This
model assumes conjugate prior distributions for these parameters, accounts for
non-negativity and full-additivity constraints, and exploits the fact that the
endmember proportions lie on a lower dimensional simplex. A Gibbs sampler is
proposed to overcome the complexity of evaluating the resulting posterior
distribution. This sampler generates samples distributed according to the
posterior distribution and estimates the unknown parameters using these
generated samples. The accuracy of the joint Bayesian estimator is illustrated
by simulations conducted on synthetic and real AVIRIS images
Wavelet Packet Transform based Speech Enhancement via Two-Dimensional SPP Estimator with Generalized Gamma Priors
Despite various speech enhancement techniques have been developed for different applications, existing methods are limited in noisy environments with high ambient noise levels. Speech presence probability (SPP) estimation is a speech enhancement technique to reduce speech distortions, especially in low signal-to-noise ratios (SNRs) scenario. In this paper, we propose a new two-dimensional (2D) Teager-energyoperators (TEOs) improved SPP estimator for speech enhancement in time-frequency (T-F) domain. Wavelet packet transform (WPT) as a multiband decomposition technique is used to concentrate the energy distribution of speech components. A minimum mean-square error (MMSE) estimator is obtained based on the generalized gamma distribution speech model in WPT domain. In addition, the speech samples corrupted by environment and occupational noises (i.e., machine shop, factory and station) at different input SNRs are used to validate the proposed algorithm. Results suggest that the proposed method achieves a significant enhancement on perceptual quality, compared with four conventional speech enhancement algorithms (i.e., MMSE-84, MMSE-04, Wiener-96, and BTW)
- …