7,549 research outputs found
A Multidelay Double-Talk Detector Combined with the MDF Adaptive Filter
The multidelay block frequency-domain (MDF) adaptive filter is an excellent candidate for both acoustic and network echo cancellation. There is a need for a very good double-talk detector (DTD) to be combined efficiently with the MDF algorithm. Recently, a DTD based on a normalized cross-correlation vector was proposed and it was shown that this DTD performs much better than the Geigel algorithm and other DTDs based on the cross-correlation coefficient. In this paper, we show how to extend the definition of a normalized cross-correlation vector in the frequency domain for the general case where the block size of the Fourier transform is smaller than the length of the adaptive filter. The resulting DTD has an MDF structure, which makes it easy to implement, and a good fit with an echo canceler based on the MDF algorithm. We also analyze resource requirements (computational complexity and memory requirement) and compare the MDF algorithm with the normalized least mean square algorithm (NLMS) from this point of view.</p
Simple and efficient solutions to the problems associated with acoustic echo cancellation
This dissertation is a collection of papers that addresses several important problems associated with acoustic/line echo cancellation (AEC/LEC), specifically double-talk and echo-path change detection. A double-talk detector is used to freeze AEC filter\u27s adaptation during periods of near-end speech. This dissertation presents three different novel double-talk detection schemes. Simulations demonstrate the efficiency of the proposed algorithms --Abstract, page iii
Analysis of Seeing-Induced Polarization Cross-Talk and Modulation Scheme Performance
We analyze the generation of polarization cross-talk in Stokes polarimeters
by atmospheric seeing, and its effects on the noise statistics of
spectropolarimetric measurements for both single-beam and dual-beam
instruments. We investigate the time evolution of seeing-induced correlations
between different states of one modulation cycle, and compare the response to
these correlations of two popular polarization modulation schemes in a
dual-beam system. Extension of the formalism to encompass an arbitrary number
of modulation cycles enables us to compare our results with earlier work. Even
though we discuss examples pertinent to solar physics, the general treatment of
the subject and its fundamental results might be useful to a wider community.Comment: 33 pages, 7 figures; accepted in Astrophys.
Normalized Double-Talk Detection Based on Microphone and AEC Error Cross-correlation
In this paper, we present two different double-talk detection schemes for Acoustic Echo Cancellation (AEC). First, we present a novel normalized detection statistic based on the cross-correlation coefficient between the microphone signal and the cancellation error. The decision statistic is designed in such a way that it meets the needs of an optimal double-talk detector. We also show that the proposed detection statistic converges to the recently proposed normalized cross-correlation based double-talk detector, the best known cross-correlation based detector. Next, we present a new hybrid double-talk detection scheme based on a cross-correlation coefficient and two signal detectors. The hybrid algorithm not only detects double-talk but also detects and tracks any echo-path variations efficiently. We compare our results with other cross-correlation based double-talk detectors to show their effectiveness
Quantum Measurement Theory in Gravitational-Wave Detectors
The fast progress in improving the sensitivity of the gravitational-wave (GW)
detectors, we all have witnessed in the recent years, has propelled the
scientific community to the point, when quantum behaviour of such immense
measurement devices as kilometer-long interferometers starts to matter. The
time, when their sensitivity will be mainly limited by the quantum noise of
light is round the corner, and finding the ways to reduce it will become a
necessity. Therefore, the primary goal we pursued in this review was to
familiarize a broad spectrum of readers with the theory of quantum measurements
in the very form it finds application in the area of gravitational-wave
detection. We focus on how quantum noise arises in gravitational-wave
interferometers and what limitations it imposes on the achievable sensitivity.
We start from the very basic concepts and gradually advance to the general
linear quantum measurement theory and its application to the calculation of
quantum noise in the contemporary and planned interferometric detectors of
gravitational radiation of the first and second generation. Special attention
is paid to the concept of Standard Quantum Limit and the methods of its
surmounting.Comment: 147 pages, 46 figures, 1 table. Published in Living Reviews in
Relativit
Speckle interferometry
We have presented the basic mathematical treatment of interferometry in the
optical domain. Its applications in astronomical observations using both the
single aperture, as well as the diluted apertures are described in detail. We
have also described about the shortcomings of this technique in the presence of
Earth's atmosphere. A short descriptions of the atmospheric turbulence and its
effect on the flat wavefront from a stellar source is given. The formation of
speckle which acts as carrier of information is defined. Laboratory experiments
with phase modulation screens, as well as the resultant intensity distributions
due to point source are demonstrated. The experimental method to freeze the
speckles, as well as data processing techniques for both Fourier modulus and
Fourier phase are described. We have also discussed the technique of the
aperture synthesis using non-redundant aperture masks at the pupil plane of the
telescope, emphasizing set on the comparison with speckle interferometry. The
various methods of image restoration and their comparisons are also discussed.
Finally, we have touched upon certain astrophysical problems which can be
tackled with the newly developed speckle interferometer using the 2.34 meter
Vainu Bappu Telescope (VBT), situated at the Vainu Bappu Observatory (VBO),
Kavalur, India.Comment: 32 pages tex files including figure
An investigation of the utility of monaural sound source separation via nonnegative matrix factorization applied to acoustic echo and reverberation mitigation for hands-free telephony
In this thesis we investigate the applicability and utility of Monaural Sound Source Separation (MSSS) via Nonnegative Matrix Factorization (NMF) for various problems related to audio for hands-free telephony. We first investigate MSSS via NMF as an alternative acoustic echo reduction approach to existing approaches such as Acoustic Echo Cancellation (AEC). To this end, we present the single-channel acoustic echo problem as an MSSS problem, in which the objective is to extract the users signal from a mixture also containing acoustic echo and noise. To perform separation, NMF is used to decompose the near-end microphone signal onto the union of two nonnegative bases in the magnitude Short Time Fourier Transform domain. One of these bases is for the spectral energy of the acoustic echo signal, and is formed from the in- coming far-end user’s speech, while the other basis is for the spectral energy of the near-end speaker, and is trained with speech data a priori. In comparison to AEC, the speaker extraction approach obviates Double-Talk Detection (DTD), and is demonstrated to attain its maximal echo mitigation performance immediately upon initiation and to maintain that performance during and after room changes for similar computational requirements. Speaker extraction is also shown to introduce distortion of the near-end speech signal during double-talk, which is quantified by means of a speech distortion measure and compared to that of AEC. Subsequently, we address Double-Talk Detection (DTD) for block-based AEC algorithms. We propose a novel block-based DTD algorithm that uses the available signals and the estimate of the echo signal that is produced by NMF-based speaker extraction to compute a suitably normalized correlation-based decision variable, which is compared to a fixed threshold to decide on doubletalk. Using a standard evaluation technique, the proposed algorithm is shown to have comparable detection performance to an existing conventional block-based DTD algorithm. It is also demonstrated to inherit the room change insensitivity of speaker extraction, with the proposed DTD algorithm generating minimal false doubletalk indications upon initiation and in response to room changes in comparison to the existing conventional DTD. We also show that this property allows its paired AEC to converge at a rate close to the optimum. Another focus of this thesis is the problem of inverting a single measurement of a non- minimum phase Room Impulse Response (RIR). We describe the process by which percep- tually detrimental all-pass phase distortion arises in reverberant speech filtered by the inverse of the minimum phase component of the RIR; in short, such distortion arises from inverting the magnitude response of the high-Q maximum phase zeros of the RIR. We then propose two novel partial inversion schemes that precisely mitigate this distortion. One of these schemes employs NMF-based MSSS to separate the all-pass phase distortion from the target speech in the magnitude STFT domain, while the other approach modifies the inverse minimum phase filter such that the magnitude response of the maximum phase zeros of the RIR is not fully compensated. Subjective listening tests reveal that the proposed schemes generally produce better quality output speech than a comparable inversion technique
An investigation of the utility of monaural sound source separation via nonnegative matrix factorization applied to acoustic echo and reverberation mitigation for hands-free telephony
In this thesis we investigate the applicability and utility of Monaural Sound Source Separation (MSSS) via Nonnegative Matrix Factorization (NMF) for various problems related to audio for hands-free telephony. We first investigate MSSS via NMF as an alternative acoustic echo reduction approach to existing approaches such as Acoustic Echo Cancellation (AEC). To this end, we present the single-channel acoustic echo problem as an MSSS problem, in which the objective is to extract the users signal from a mixture also containing acoustic echo and noise. To perform separation, NMF is used to decompose the near-end microphone signal onto the union of two nonnegative bases in the magnitude Short Time Fourier Transform domain. One of these bases is for the spectral energy of the acoustic echo signal, and is formed from the in- coming far-end user’s speech, while the other basis is for the spectral energy of the near-end speaker, and is trained with speech data a priori. In comparison to AEC, the speaker extraction approach obviates Double-Talk Detection (DTD), and is demonstrated to attain its maximal echo mitigation performance immediately upon initiation and to maintain that performance during and after room changes for similar computational requirements. Speaker extraction is also shown to introduce distortion of the near-end speech signal during double-talk, which is quantified by means of a speech distortion measure and compared to that of AEC. Subsequently, we address Double-Talk Detection (DTD) for block-based AEC algorithms. We propose a novel block-based DTD algorithm that uses the available signals and the estimate of the echo signal that is produced by NMF-based speaker extraction to compute a suitably normalized correlation-based decision variable, which is compared to a fixed threshold to decide on doubletalk. Using a standard evaluation technique, the proposed algorithm is shown to have comparable detection performance to an existing conventional block-based DTD algorithm. It is also demonstrated to inherit the room change insensitivity of speaker extraction, with the proposed DTD algorithm generating minimal false doubletalk indications upon initiation and in response to room changes in comparison to the existing conventional DTD. We also show that this property allows its paired AEC to converge at a rate close to the optimum. Another focus of this thesis is the problem of inverting a single measurement of a non- minimum phase Room Impulse Response (RIR). We describe the process by which percep- tually detrimental all-pass phase distortion arises in reverberant speech filtered by the inverse of the minimum phase component of the RIR; in short, such distortion arises from inverting the magnitude response of the high-Q maximum phase zeros of the RIR. We then propose two novel partial inversion schemes that precisely mitigate this distortion. One of these schemes employs NMF-based MSSS to separate the all-pass phase distortion from the target speech in the magnitude STFT domain, while the other approach modifies the inverse minimum phase filter such that the magnitude response of the maximum phase zeros of the RIR is not fully compensated. Subjective listening tests reveal that the proposed schemes generally produce better quality output speech than a comparable inversion technique
- …