1,993 research outputs found

    Subband particle filtering for speech enhancement

    Get PDF
    Journal ArticleABSTRACT Particle filters have recently been applied to speech enhancement when the input speech signal is modeled as a time-varying autoregressive process with stochastically evolving parameters. This type of modeling results in a nonlinear and conditionally Gaussian statespace system that is not amenable to analytical solutions. Prior work in this area involved signal processing in the fullband domain and assumed white Gaussian noise with known variance. This paper extends such ideas to subband domain particle filters and colored noise. Experimental results indicate that the subband particle filter achieves higher segmental SNR than the fullband algorithm and is effective in dealing with colored noise without increasing the computational complexity

    <strong>Non-Gaussian, Non-stationary and Nonlinear Signal Processing Methods - with Applications to Speech Processing and Channel Estimation</strong>

    Get PDF

    Signal enhancement using single and multi-sensor measurements

    Get PDF
    Includes bibliographical references (leaf 38).Research supported by in part by the Defense Advanced Research Projects Agency and monitored by the Office of Naval Research. N00014-89-J-1489 Research supported in part by Lockheed/Sanders, Inc. Research supported in part by the Office of Naval Research. N00014-90-J-1109E. Weinstein, A.V. Oppenheim and M. Feder

    Implementation and evaluation of a dual-sensor time-adaptive EM algorithm for signal enhancement

    Get PDF
    Submitted in partial fulfillment of the requirements for the degree of Master of Science at the Massachusetts Institute of Technology and the Woods Hole Oceanographic Institution August 1991This thesis describes the implementation and evaluation of an adaptive time-domain algorithm for signal enhancement from multiple-sensor observations. The algorithm is first derived as a noncausal time-domain algorithm, then converted into a causal, recursive form. A more computationally efficient gradient-based parameter estimation step is also presented. The results of several experiments using synthetic data are shown. These experiments first illustrate that the algorithm works on data meeting all the assumptions made by the algorithm, then provide a basis for comparing the performance of the algorithm against the performance of a noncausal frequency-domain algorithm solving the same problem. Finally, an evaluation is made of the performance of the simpler gradient-based parameter estimation step

    Modulation-domain speech enhancement using a kalman filter with a bayesian update of speech and noise in the log-spectral domain

    Get PDF
    We present a Bayesian estimator that performs log-spectrum esti- mation of both speech and noise, and is used as a Bayesian Kalman filter update step for single-channel speech enhancement in the mod- ulation domain. We use Kalman filtering in the log-power spectral domain rather than in the amplitude or power spectral domains. In the Bayesian Kalman filter update step, we define the posterior dis- tribution of the clean speech and noise log-power spectra as a two- dimensional multivariate Gaussian distribution. We utilize a Kalman filter observation constraint surface in the three-dimensional space, where the third dimension is the phase factor. We evaluate the re- sults of the phase-sensitive log-spectrum Kalman filter by comparing them with the results obtained by traditional noise suppression tech- niques and by an alternative Kalman filtering technique that assumes additivity of speech and noise in the power spectral domain

    Single Channel Speech Enhancement using Kalman Filter

    Get PDF
    The quality and intelligibility of speech conversation are generally degraded by the surrounding noises. The main objective of speech enhancement (SE) is to eliminate or reduce such disturbing noises from the degraded speech. Various SE methods have been proposed in literature. Among them, the Kalman filter (KF) is known to be an efficient SE method that uses the minimum mean square error (MMSE). However, most of the conventional KF based speech enhancement methods need access to clean speech and additive noise information for the state-space model parameters, namely, the linear prediction coefficients (LPCs) and the additive noise variance estimation, which is impractical in the sense that in practice, we can access only the noisy speech. Moreover, it is quite difficult to estimate these model parameters efficiently in the presence of adverse environmental noises. Therefore, the main focus of this thesis is to develop single channel speech enhancement algorithms using Kalman filter, where the model parameters are estimated in noisy conditions. Depending on these parameter estimation techniques, the proposed SE methods are classified into three approaches based on non-iterative, iterative, and sub-band iterative KF. In the first approach, a non-iterative Kalman filter based speech enhancement algorithm is presented, which operates on a frame-by-frame basis. In this proposed method, the state-space model parameters, namely, the LPCs and noise variance, are estimated first in noisy conditions. For LPC estimation, a combined speech smoothing and autocorrelation method is employed. A new method based on a lower-order truncated Taylor series approximation of the noisy speech along with a difference operation serving as high-pass filtering is introduced for the noise variance estimation. The non-iterative Kalman filter is then implemented with these estimated parameters effectively. In order to enhance the SE performance as well as parameter estimation accuracy in noisy conditions, an iterative Kalman filter based single channel SE method is proposed as the second approach, which also operates on a frame-by-frame basis. For each frame, the state-space model parameters of the KF are estimated through an iterative procedure. The Kalman filtering iteration is first applied to each noisy speech frame, reducing the noise component to a certain degree. At the end of this first iteration, the LPCs and other state-space model parameters are re-estimated using the processed speech frame and the Kalman filtering is repeated for the same processed frame. This iteration continues till the KF converges or a maximum number of iterations is reached, giving further enhanced speech frame. The same procedure will repeat for the following frames until the last noisy speech frame being processed. For further improving the speech enhancement performance, a sub-band iterative Kalman filter based SE method is also proposed as the third approach. A wavelet filter-bank is first used to decompose the noisy speech into a number of sub-bands. To achieve the best trade-off among the noise reduction, speech intelligibility and computational complexity, a partial reconstruction scheme based on consecutive mean squared error (CMSE) is proposed to synthesize the low-frequency (LF) and highfrequency (HF) sub-bands such that the iterative KF is employed only to the partially reconstructed HF sub-band speech. Finally, the enhanced HF sub-band speech is combined with the partially reconstructed LF sub-band speech to reconstruct the full-band enhanced speech. Experimental results have shown that the proposed KF based SE methods are capable of reducing adverse environmental noises for a wide range of input SNRs, and the overall performance of the proposed methods in terms of different evaluation metrics is superior to some existing state-of-the art SE methods
    corecore