19 research outputs found

    Single channel speech separation with a frame-based pitch range estimation method in modulation frequency

    Get PDF
    Computational Auditory Scene Analysis (CASA) has attracted a lot of interest in segregating speech from monaural mixtures. In this paper, we propose a new method for single channel speech separation with frame-based pitch range estimation in modulation frequency domain. This range is estimated in each frame of modulation spectrum of speech by analyzing onsets and offsets. In the proposed method, target speaker is separated from interfering speaker by filtering the mixture signal with a mask extracted from the modulation spectrogram of mixture signal. Systematic evaluation shows an acceptable level of separation comparing with classic methods

    Spectral Domain Speech Enhancement Using HMM State-Dependent Super-Gaussian Priors

    Full text link

    Adaptive Speech Enhancement Using Partial Differential Equations and Back Propagation Neural Networks

    Get PDF
    Abstract: In this work, we propose a new approach to improve the performance of speech enhancement technique based on partial differential equations. As we know, the real-world noise is highly random in nature. So we try for reduction of white Gaussian noise. The proposed method was evaluated on several speakers. The subjective and objective results show that the new method highly improves speech enhancement. Comparisons of several methods are reported

    The Application of Nonlinear Spectral Subtraction Method on Millimeter Wave Conducted Speech Enhancement

    Get PDF
    A nonlinear multiband spectral subtraction method is investigated in this study to reduce the colored electronic noise in millimeter wave (MMW) radar conducted speech. Because the over-subtraction factor of each Bark frequency band can be adaptively adjusted, the nonuniform effects of colored noise in the spectrum of the MMW radar speech can be taken into account in the enhancement process. Both the results of the time-frequency distribution analysis and perceptual evaluation test suggest that a better whole-frequency noise reduction effect is obtained, and the perceptually annoying musical noise was efficiently reduced, with little distortion to speech information as compared to the other standard speech enhancement algorithm
    corecore