450 research outputs found
System Identification with Applications in Speech Enhancement
As the increasing popularity of integrating hands-free telephony on mobile portable devices
and the rapid development of voice over internet protocol, identification of acoustic
systems has become desirable for compensating distortions introduced to speech signals
during transmission, and hence enhancing the speech quality. The objective of this research
is to develop system identification algorithms for speech enhancement applications
including network echo cancellation and speech dereverberation.
A supervised adaptive algorithm for sparse system identification is developed for
network echo cancellation. Based on the framework of selective-tap updating scheme
on the normalized least mean squares algorithm, the MMax and sparse partial update
tap-selection strategies are exploited in the frequency domain to achieve fast convergence
performance with low computational complexity. Through demonstrating how
the sparseness of the network impulse response varies in the transformed domain, the
multidelay filtering structure is incorporated to reduce the algorithmic delay.
Blind identification of SIMO acoustic systems for speech dereverberation in the
presence of common zeros is then investigated. First, the problem of common zeros is
defined and extended to include the presence of near-common zeros. Two clustering algorithms
are developed to quantify the number of these zeros so as to facilitate the study
of their effect on blind system identification and speech dereverberation. To mitigate such
effect, two algorithms are developed where the two-stage algorithm based on channel
decomposition identifies common and non-common zeros sequentially; and the forced
spectral diversity approach combines spectral shaping filters and channel undermodelling
for deriving a modified system that leads to an improved dereverberation performance.
Additionally, a solution to the scale factor ambiguity problem in subband-based blind system identification is developed, which motivates further research on subbandbased
dereverberation techniques. Comprehensive simulations and discussions demonstrate
the effectiveness of the aforementioned algorithms. A discussion on possible directions
of prospective research on system identification techniques concludes this thesis
Wavelets, ridgelets and curvelets on the sphere
We present in this paper new multiscale transforms on the sphere, namely the
isotropic undecimated wavelet transform, the pyramidal wavelet transform, the
ridgelet transform and the curvelet transform. All of these transforms can be
inverted i.e. we can exactly reconstruct the original data from its
coefficients in either representation. Several applications are described. We
show how these transforms can be used in denoising and especially in a Combined
Filtering Method, which uses both the wavelet and the curvelet transforms, thus
benefiting from the advantages of both transforms. An application to component
separation from multichannel data mapped to the sphere is also described in
which we take advantage of moving to a wavelet representation.Comment: Accepted for publication in A&A. Manuscript with all figures can be
downloaded at http://jstarck.free.fr/aa_sphere05.pd
Underwater target detection using multichannel subband adaptive filtering and high-order correlation schemes
Includes bibliographical references.In this paper, new pre- and post-processing schemes are developed to process shallow-water sonar data to improve the accuracy of target detection. A multichannel subband adaptive filtering is applied to preprocess the data in order to isolate the potential target returns from the acoustic backscattered signals and improve the signal-to-reverberation ratio. This is done by estimating the time delays associated with the reflections in different subbands. The preprocessed results are then beamformed to generate an image for each ping of the sonar. The testing results on both the simulated and real data revealed the efficiency of this scheme in time-delay estimation and its capability in removing most of the competing reverberations and noise. To improve detection rate while significantly minimizing the incident of false detections, a high-order correlation (HOC) method for postprocessing the beamformed images is then developed. This method determines the consistency in occurrence of the target returns in several consecutive pings. The application of the HOC process to the real beamformed sonar data showed the ability of this method for removing the clutter and at the same time boosting the target returns in several consecutive pings. The algorithm is simple, fast, and easy to implement.This work was supported by the Office of Naval Research (ONR 321TS) under Contract N61331-94-K-0018
A nonlinear Stein based estimator for multichannel image denoising
The use of multicomponent images has become widespread with the improvement
of multisensor systems having increased spatial and spectral resolutions.
However, the observed images are often corrupted by an additive Gaussian noise.
In this paper, we are interested in multichannel image denoising based on a
multiscale representation of the images. A multivariate statistical approach is
adopted to take into account both the spatial and the inter-component
correlations existing between the different wavelet subbands. More precisely,
we propose a new parametric nonlinear estimator which generalizes many reported
denoising methods. The derivation of the optimal parameters is achieved by
applying Stein's principle in the multivariate case. Experiments performed on
multispectral remote sensing images clearly indicate that our method
outperforms conventional wavelet denoising technique
Design of FIR paraunitary filter banks for subband coding using a polynomial eigenvalue decomposition
The problem of paraunitary filter bank design for subband coding has received considerable attention in recent years, not least because of the energy preserving property of this class of filter banks. In this paper, we consider the design of signal-adapted, finite impulse response (FIR), paraunitary filter banks using polynomial matrix EVD (PEVD) techniques. Modifications are proposed to an iterative, time-domain PEVD method, known as the sequential best rotation (SBR2) algorithm, which enables its effective application to the problem of FIR orthonormal filter bank design for efficient subband coding. By choosing an optimisation scheme that maximises the coding gain at each stage of the algorithm, it is shown that the resulting filter bank behaves more and more like the infiniteorder principle component filter bank (PCFB). The proposed method is compared to state-of-the-art techniques, namely the iterative greedy algorithm (IGA), the approximate EVD (AEVD), standard SBR2 and a fast algorithm for FIR compaction filter design, called the window method (WM). We demonstrate that for the calculation of the subband coder, the WM approach offers a low-cost alternative at lower coding gains, while at moderate to high complexity, the proposed approach outperforms the benchmarkers. In terms of run-time complexity, AEVD performs well at low orders, while the proposed algorithm offers a better coding gain than the benchmarkers at moderate to high filter order for a number of simulation scenarios
Blind MultiChannel Identification and Equalization for Dereverberation and Noise Reduction based on Convolutive Transfer Function
This paper addresses the problems of blind channel identification and
multichannel equalization for speech dereverberation and noise reduction. The
time-domain cross-relation method is not suitable for blind room impulse
response identification, due to the near-common zeros of the long impulse
responses. We extend the cross-relation method to the short-time Fourier
transform (STFT) domain, in which the time-domain impulse responses are
approximately represented by the convolutive transfer functions (CTFs) with
much less coefficients. The CTFs suffer from the common zeros caused by the
oversampled STFT. We propose to identify CTFs based on the STFT with the
oversampled signals and the critical sampled CTFs, which is a good compromise
between the frequency aliasing of the signals and the common zeros problem of
CTFs. In addition, a normalization of the CTFs is proposed to remove the gain
ambiguity across sub-bands. In the STFT domain, the identified CTFs is used for
multichannel equalization, in which the sparsity of speech signals is
exploited. We propose to perform inverse filtering by minimizing the
-norm of the source signal with the relaxed -norm fitting error
between the micophone signals and the convolution of the estimated source
signal and the CTFs used as a constraint. This method is advantageous in that
the noise can be reduced by relaxing the -norm to a tolerance
corresponding to the noise power, and the tolerance can be automatically set.
The experiments confirm the efficiency of the proposed method even under
conditions with high reverberation levels and intense noise.Comment: 13 pages, 5 figures, 5 table
Spherical microphone array acoustic rake receivers
Several signal independent acoustic rake receivers are proposed for speech dereverberation using spherical microphone arrays. The proposed rake designs take advantage of multipaths, by separately capturing and combining early reflections with the direct path. We investigate several approaches in combining reflections with the direct path source signal, including the development of beam patterns that point nulls at all preceding reflections. The proposed designs are tested in experimental simulations and their dereverberation performances evaluated using objective measures. For the tested configuration, the proposed designs achieve higher levels of dereverberation compared to conventional signal independent beamforming systems; achieving up to 3.6 dB improvement in the direct-to-reverberant ratio over the plane-wave decomposition beamformer
- …