23 research outputs found

    Low-delay nonuniform pseudo-QMF banks with application to speech enhancement

    Get PDF
    Journal ArticleAbstract-This paper presents a method for designing low-delay nonuniform pseudo quadrature mirror filter (QMF) banks. This method is motivated by the work of Li, Nguyen, and Tantaratana, in which the nonuniform filter bank is realized by combining an appropriate number of adjacent sub-bands of a uniform pseudo-QMF bank. In prior work, the prototype filter of the uniform pseudo-QMF bank was constrained to have linear phase and the overall delay associated with the filter bank was often unacceptably large for filter banks with a large number of sub-bands. This paper proposes a pseudo-QMF filter bank design technique that significantly reduces the delay by relaxing the linear phase constraints. An example in which an oversampled critical-band nonuniform filter bank is designed and applied to a two-state modeling speech enhancement system is presented in this paper. Comparison of the performance of this system to competing methods employing tree-structured, linear phase multiresolution analysis indicates that the approach described in this paper strikes a good balance between system performance and low delay

    Design of low-delay nonuniform pseudo QMF banks

    Get PDF
    Journal ArticleABSTRACT This paper presents a method for designing low-delay nonuniform pseudo QMF banks. The method is motivated by the work of Li, Nguyen and Tantaratana, in which the nonuniform filter bank is realized by combining an appropriate number of adjacent subbands of a uniform pseudo QMF filter bank. In prior work, the prototype filter of the uniform pseudo QMF is constrained to have linear phase and the overall delay associated with the filter bank was often unacceptably large for filter banks with a large number of subbands. By relaxing the linear phase constraints, this paper proposes a pseudo QMF filter bank design technique that significantly reduces the delay. An example that experimentally verifies the capabilities of the design technique is presented

    Efficient Multiband Algorithms for Blind Source Separation

    Get PDF
    The problem of blind separation refers to recovering original signals, called source signals, from the mixed signals, called observation signals, in a reverberant environment. The mixture is a function of a sequence of original speech signals mixed in a reverberant room. The objective is to separate mixed signals to obtain the original signals without degradation and without prior information of the features of the sources. The strategy used to achieve this objective is to use multiple bands that work at a lower rate, have less computational cost and a quicker convergence than the conventional scheme. Our motivation is the competitive results of unequal-passbands scheme applications, in terms of the convergence speed. The objective of this research is to improve unequal-passbands schemes by improving the speed of convergence and reducing the computational cost. The first proposed work is a novel maximally decimated unequal-passbands scheme.This scheme uses multiple bands that make it work at a reduced sampling rate, and low computational cost. An adaptation approach is derived with an adaptation step that improved the convergence speed. The performance of the proposed scheme was measured in different ways. First, the mean square errors of various bands are measured and the results are compared to a maximally decimated equal-passbands scheme, which is currently the best performing method. The results show that the proposed scheme has a faster convergence rate than the maximally decimated equal-passbands scheme. Second, when the scheme is tested for white and coloured inputs using a low number of bands, it does not yield good results; but when the number of bands is increased, the speed of convergence is enhanced. Third, the scheme is tested for quick changes. It is shown that the performance of the proposed scheme is similar to that of the equal-passbands scheme. Fourth, the scheme is also tested in a stationary state. The experimental results confirm the theoretical work. For more challenging scenarios, an unequal-passbands scheme with over-sampled decimation is proposed; the greater number of bands, the more efficient the separation. The results are compared to the currently best performing method. Second, an experimental comparison is made between the proposed multiband scheme and the conventional scheme. The results show that the convergence speed and the signal-to-interference ratio of the proposed scheme are higher than that of the conventional scheme, and the computation cost is lower than that of the conventional scheme

    Perceptual models in speech quality assessment and coding

    Get PDF
    The ever-increasing demand for good communications/toll quality speech has created a renewed interest into the perceptual impact of rate compression. Two general areas are investigated in this work, namely speech quality assessment and speech coding. In the field of speech quality assessment, a model is developed which simulates the processing stages of the peripheral auditory system. At the output of the model a "running" auditory spectrum is obtained. This represents the auditory (spectral) equivalent of any acoustic sound such as speech. Auditory spectra from coded speech segments serve as inputs to a second model. This model simulates the information centre in the brain which performs the speech quality assessment. [Continues.

    Subband Adaptive Modeling of Digital Hearing Aids

    Get PDF
    In this thesis, the application of a subband adaptive model to characterize compression behaviour of five digital hearing aids is investigated. Using a signal-to-error ratio metric, modeling performance is determined by varying the number of analysis bands in the subband structure as well as consideration of three adaptive algorithms. The normalized least mean-squares (NLMS), the affine projection algorithm (APA), and the recursive least-squares (RLS) algorithms are employed using a range of parameters to determine the impact on modeling performance. Using the subband adaptive model to estimate the time-varying frequency response of each hearing aid allows the Perceptual Evaluation of Speech Quality (PESQ) mean-opinion score (MOS) to be computed. The PESQ MOS facilitates an estimation of a subjective assessment of speech quality using an objective score. Initial results suggest the PESQ MOS score is able to differentiate speech processed by hearing aids allowing them to be ranked accordingly. Further work is required to obtain subjective assessments of the processed speech signals and determine if possible correlations exist

    Asynchronous Representation and Processing of Analog Sparse Signals Using a Time-Scale Framework

    Get PDF
    In this dissertation we investigate the problem of asynchronous representation and processing of analog sparse signals using a time-scale framework. Recently, in the design of signal representations the focus has been on the use of application-driven constraints for optimality purposes. Appearing in many fields such as neuroscience, implantable biomedical diagnostic devices, and sensor network applications, sparse or burst--like signals are of great interest. A common challenge in the representation of such signals is that they exhibit non--stationary behavior with frequency--varying spectra. By ignoring that the maximum frequency of their spectra is changing with time, uniformly sampling sparse signals collects samples in quiescent segments and results in high power dissipation. Also, continuous monitoring of signals challenges data acquisition, storage, and processing; especially if remote monitoring is desired, as this would require that a large number of samples be generated, stored and transmitted. Power consumption and the type of processing imposed by the size of the devices in the aforementioned applications has motivated the use of asynchronous approaches in our research. First, we work on establishing a new paradigm for the representation of analog sparse signals using a time-frequency representation. Second, we develop a scale-based signal decomposition framework which uses filter-bank structures for the representation-analysis-compression scheme of the sparse information. Using an asynchronous signal decomposition scheme leads to reduced computational requirements and lower power consumption; thus it is promising for hardware implementation. In addition, the proposed algorithm does not require prior knowledge of the bandwidth of the signal and the effect of noise can still be alleviated. Finally, we consider the synthesis step, where the target signal is reconstructed from compressed data. We implement a perfect reconstruction filter bank based on Slepian wavelets to use in the reconstruction of sparse signals from non--uniform samples. In this work, experiments on primary biomedical signal applications, such as electrocardiogram (EEG), swallowing signals and heart sound recordings have achieved significant improvements over traditional methods in the sensing and processing of sparse data. The results are also promising in applications including compression and denoising
    corecore