5 research outputs found

    Annulation d'échos acoustiques par filtrage non linéaire

    Get PDF
    - De nos jours, l'utilisation d'un téléphone portable en mode main libre fait intervenir un filtre linéaire adaptatif, dans le but de compenser les échos acoustiques. Dans ce mode de fonctionnement, le signal émis par le haut parleur est de fort niveau; il en résulte que des distortions non linéaires interviennent dans la propagation du signal, entre le haut parleur et le microphone. Différents types de filtres sont proposés pour compenser ces échos, incluant les filtres non linéaires en cascade ainsi que le filtre bilinéaire. Le filtre bilinéaire est un sous modèle du NARMAX (Nonlinear Autoregessive Moving Average with eXogenous inputs). Par la suite, nous présenterons une évaluation des performances basée sur une mesure standard ERLE (Echo Return Loss Enhancement), entre les différents filtres utilisés

    Efficient Multidimensional Regularization for Volterra Series Estimation

    Full text link
    This paper presents an efficient nonparametric time domain nonlinear system identification method. It is shown how truncated Volterra series models can be efficiently estimated without the need of long, transient-free measurements. The method is a novel extension of the regularization methods that have been developed for impulse response estimates of linear time invariant systems. To avoid the excessive memory needs in case of long measurements or large number of estimated parameters, a practical gradient-based estimation method is also provided, leading to the same numerical results as the proposed Volterra estimation method. Moreover, the transient effects in the simulated output are removed by a special regularization method based on the novel ideas of transient removal for Linear Time-Varying (LTV) systems. Combining the proposed methodologies, the nonparametric Volterra models of the cascaded water tanks benchmark are presented in this paper. The results for different scenarios varying from a simple Finite Impulse Response (FIR) model to a 3rd degree Volterra series with and without transient removal are compared and studied. It is clear that the obtained models capture the system dynamics when tested on a validation dataset, and their performance is comparable with the white-box (physical) models

    주파수 및 시간적 상관관계에 기반한 음향학적 에코 억제 기법

    Get PDF
    학위논문 (박사)-- 서울대학교 대학원 : 전기·컴퓨터공학부, 2016. 8. 김남수.In the past decades, a number of approaches have been dedicated to acoustic echo cancellation and suppression which reduce the negative effects of acoustic echo, namely the acoustic coupling between the loudspeaker and microphone in a room. In particular, the increasing use of full-duplex telecommunication systems has led to the requirement of faster and more reliable acoustic echo cancellation algorithms. The solutions have been based on adaptive filters, but the length of these filters has to be long enough to consider most of the echo signal and linear filtering in these algorithms may be limited to remove the echo signal in various environments. In this thesis, a novel stereophonic acoustic echo suppression (SAES) technique based on spectral and temporal correlations is proposed in the short-time Fourier transform (STFT) domain. Unlike traditional stereophonic acoustic echo cancellation, the proposed algorithm estimates the echo spectra in the STFT domain and uses a Wiener filter to suppress echo without performing any explicit double-talk detection. The proposed approach takes account of interdependencies among components in adjacent time frames and frequency bins, which enables more accurate estimation of the echo signals. Due to the limitations of power amplifiers or loudspeakers, the echo signals captured in the microphones are not in a linear relationship with the far-end signals even when the echo path is perfectly linear. The nonlinear components of the echo cannot be successfully removed by a linear acoustic echo canceller. The remaining echo components in the output of acoustic echo suppression (AES) can be further suppressed by applying residual echo suppression (RES) algorithms. In this thesis, we propose an optimal RES gain estimation based on deep neural network (DNN) exploiting both the far-end and the AES output signals in all frequency bins. A DNN structure is introduced as a regression function representing the complex nonlinear mapping from these signals to the optimal RES gain. Because of the capability of the DNN, the spectro-temporal correlations in the full-band can be considered while finding the nonlinear function. The proposed method does not require any explicit double-talk detectors to deal with single-talk and double-talk situations. One of the well-known approaches for nonlinear acoustic echo cancellation is an adaptive Volterra filtering and various algorithms based on the Volterra filter were proposed to describe the characteristics of nonlinear echo and showed the better performance than the conventional linear filtering. However, the performance might be not satisfied since these algorithms could not consider the full correlation for the nonlinear relationship between the input signal and far-end signal in time-frequency domain. In this thesis, we propose a novel DNN-based approach for nonlinear acoustic echo suppression (NAES), extending the proposed RES algorithm. Instead of estimating the residual gain for suppressing the nonlinear echo components, the proposed algorithm straightforwardly recovers the near-end speech signal through the direct gain estimation obtained from DNN frameworks on the input and far-end signal. For echo aware training, a priori and a posteriori signal-to-echo ratio (SER) are introduced as additional inputs of the DNN for tracking the change of the echo signal. In addition, the multi-task learning (MTL) to the DNN-based NAES is combined to the DNN incorporating echo aware training for robustness. In the proposed system, an additional task of double-talk detection is jointly trained with the primary task of the gain estimation for NAES. The DNN can learn the good representations which can suppress more in single-talk periods and improve the gain estimates in double-talk periods through the MTL framework. Besides, the proposed NAES using echo aware training and MTL with double-talk detection makes the DNN be more robust in various conditions. The proposed techniques show significantly better performance than the conventional AES methods in both single- and double-talk periods. As a pre-processing of various applications such as speech recognition and speech enhancement, these approaches can help to transmit the clean speech and provide an acceptable communication in full-duplex real environments.Chapter 1 Introduction 1 1.1 Background 1 1.2 Scope of thesis 3 Chapter 2 Conventional Approaches for Acoustic Echo Suppression 7 2.1 Single Channel Acoustic Echo Cancellation and Suppression 8 2.1.1 Single Channel Acoustic Echo Cancellation 8 2.1.2 Adaptive Filters for Acoustic Echo Cancellation 10 2.1.3 Acoustic Echo Suppression Based on Spectral Modication 11 2.2 Residual Echo Suppression 13 2.2.1 Spectral Feature-based Nonlinear Residual Echo Suppression 15 2.3 Stereophonic Acoustic Echo Cancellation 17 2.4 Wiener Filtering for Stereophonic Acoustic Echo Suppression 20 Chapter 3 Stereophonic Acoustic Echo Suppression Incorporating Spectro-Temporal Correlations 25 3.1 Introduction 25 3.2 Linear Time-Invariant Systems in the STFT Domain with Crossband Filtering 26 3.3 Enhanced SAES (ESAES) Utilizing Spectro-Temporal Correlations 29 3.3.1 Problem Formulation 31 3.3.2 Estimation of Extended PSD Matrices, Echo Spectra, and Gain Function 34 3.3.3 Complexity of the Proposed ESAES Algorithm 36 3.4 Experimental Results 37 3.5 Summary 41 Chapter 4 Nonlinear Residual Echo Suppression Based on Deep Neural Network 43 4.1 Introduction 43 4.2 A Brief Review on RES 45 4.3 Deep Neural Networks 46 4.4 Nonlinear RES using Deep Neural Network 49 4.5 Experimental Results 52 4.5.1 Combination with Stereophonic Acoustic Echo Suppression 59 4.6 Summary 61 Chapter 5 Enhanced Deep Learning Frameworks for Nonlinear Acoustic Echo Suppression 69 5.1 Introduction 69 5.2 DNN-based Nonlinear Acoustic Echo Suppression using Echo Aware Training 72 5.3 Multi-Task Learning for NAES 75 5.4 Experimental Results 78 5.5 Summary 82 Chapter 6 Conclusions 89 Bibliography 91 요약 101Docto

    An Immune-inspired, Information-theoretic Framework For Blind Inversion Of Wiener Systems

    No full text
    This work proposes a new approach to the blind inversion of Wiener systems. A Wiener system is composed of a linear time-invariant (LTI) sub-system followed by a memoryless nonlinear function. The goal is to recover the input signal by knowing just the output of the Wiener system, and the straightforward scheme to achieve this is called the Hammerstein system - apply a memoryless nonlinear mapping followed by a LTI sub-system to the output signal of the Wiener system. If the input of the Wiener system is originally iid and some mild conditions are satisfied, the inversion is possible. Based on this statement and the limitations of relevant previous works, a solution is proposed combining (i) immune-inspired optimization algorithms, (ii) information theory and (iii) IIR filters that yield a robust scheme with a relatively reduced risk of local convergence. Experimental results indicated a similar or superior performance of the new approach, in comparison with two other blind methodologies.1131831Haykin, S., (2001) Adaptive Filter Theory, , 4th ed. Prentice HallJutten, C., Karhunen, J., Advances in blind source separation (BSS) and independent component analysis (ICA) for nonlinear mixtures (2004) Int.J. Neural Syst., 14 (5), pp. 267-292Romano, J.M.T., Attux, R., Cavalcante, C., Suyama, R., (2011) Unsupervised Signal Processing: Channel Equalization and Source Separation, , CRC PressTaleb, A., Solé-Casals, J., Jutten, C., Quasi-nonparametric blind inversion of Wiener systems (2001) IEEE Trans. Signal Process., 49 (5), pp. 917-924Taleb, A., Jutten, C., Source separation in post-nonlinear mixtures (1999) IEEE Trans. Signal Process., 47 (10), pp. 2807-2820Hosseini, S., Deville, Y., Blind separation of linear-quadratic mixtures of real sources using a recurrent structure (2003) Artificial Neural Nets Problem Solving Methods, Lecture Notes in Computer Science, 2687, pp. 241-248Duarte, L., Jutten, C., Moussaoui, S., A Bayesian nonlinear source separation method for smart ion-selective electrode arrays (2009) IEEE Sens. J., 9 (12), pp. 1763-1771Hunter, I.W., Korenberg, M.J., The identification of nonlinear biological systems Wiener and Hammerstein cascade models (1986) Biol. Cybern., 55 (23), pp. 135-144Chatterjee, S.K., Ghosh, S., Das, S., Manzella, V., Vitaletti, A., Masi, E., Santopolo, L., Maharatna, K., Forward and inverse modelling approaches for prediction of light stimulus from electrophysiological response in plants (2014) Measurement, 53, pp. 101-116Bars, R., Nonlinear and long range control of a distillation pilot plant (1990) Proceedings of the 9th IFAC/IFORS Symposium on Identification and System Parameter Estimation, pp. 848-853Das, S., Mukherjee, S., Pan, I., Gupta, A., Identification of the core temperature in a fractional order noisy environment for thermal feedback in nuclear reactors (2011) IEEE Technology Students' Symposium, pp. 180-186Ma, W., Lim, H., Sznaier, M., Camps, O., Risk Adjusted Identification of Wiener Systems (2006) 45th IEEE Conference on Decision and Control, pp. 2512-2517Príncipe, J., (2010) Information Theoretic Learning Renyi's Entropy and Kernel Perspectives, , Springer VerlagCover, T., Thomas, J., (2006) Elements of Information Theory, , 2nd ed. Wiley-InterscienceSolé-Casals, J., Jutten, C., Taleb, A., Parametric approach to blind deconvolution of nonlinear channels (2002) Neurocomputing, 48 (14), pp. 339-355Rojas, F., An evolutionary approach for blind inversion of Wiener systems (2007) Independent Component Analysis and Signal Separation, Lecture Notes in Computer Science, 4666, pp. 260-267Babaie-Zadeh, M., Blind inversion of Wiener system using a minimization-projection (MP) approach (2003) ICA 2003, pp. 681-686Parzen, E., On estimation of a probability density function and mode (1962) Ann. Math. Stat., 33 (3), pp. 1065-1076Solé-Casals, J., Faundez-Zanuy, M., Application of the mutual information minimization to speaker recognition/identification improvement (2006) Neurocomputing, 69 (1315), pp. 1467-1474Zhang, K., Practical method for blind inversion of Wiener systems (2004) 2004 International Joint Conference on Neural Networks, pp. 2163-2168Solé-Casals, J., Jutten, C., Pham, D., Fast approximation of nonlinearities for improving inversion algorithms of pnl mixtures and Wiener systems (2005) Signal Process., 85 (9), pp. 1780-1786Solé-Casals, J., Caiafa, C., A fast gradient approximation for nonlinear blind signal processing (2012) Cogn. Comput., 4 (1063), pp. 1-10Rojas, F., A canonical genetic algorithm for blind inversion of linear channels (2006) Independent Component Analysis and Signal Separation, Lecture Notes in Computer Science, 3889, pp. 238-245Bäck, T., Schwefel, H., An overview of evolutionary algorithms for parameter optimization (1993) Evol. Comput., 1 (1), pp. 1-23Comon, P., Jutten, C., (2010) Handbook of Blind Source Separation, , Academic PressDe Castro, L.N., Von Zuben, F.J., Learning and optimization using the clonal selection principle (2002) IEEE Trans. Evol. Comput., 6 (13), pp. 239-251Darbellay, G.A., Vajda, I., Estimation of the information by an adaptive partitioning of the observation space (1999) IEEE Trans. Inf. Theory, 45 (4), pp. 1315-1321Pham, D.-T., Blind separation of instantaneous mixture of sources based on order statistics (2000) IEEE Trans. Signal Process., 48 (2), pp. 363-375Leon-Garcia, A., (1994) Probability and Random Processes for Electrical Engineering, , Addison WesleyLarue, A., Mars, J., Jutten, C., Frequency-domain blind deconvolution based on mutual information rate (2006) IEEE Trans. Signal Process., 54 (5), pp. 1771-1781Burnet, F.M., Clonal selection and after (1978) Theoretical Immunology, pp. 63-85De Castro, L.N., Timmis, J., (2002) Artificial Immune Systems A New Computational Intelligence Approach, , SpringerDe França, F.O., On the diversity mechanisms of opt-aiNet: A comparative study with fitness sharing (2010) 2010 IEEE Congress on Evolutionary Computation (CEC), pp. 1-8Dias, T., Attux, R., Romano, J., Suyama, R., Blind source separation of post-nonlinear mixtures using evolutionary computation and Gaussianization (2009) Independent Component Analysis and Signal Separation, Lecture Notes in Computer Science, 5441, pp. 235-242Wada, C., Consolaro, D., Ferrari, R., Suyama, R., Attux, R., Zuben, F., Nonlinear blind source deconvolution using recurrent prediction-error filters and an artificial immune system (2009) Independent Component Analysis and Signal Separation, Lecture Notes in Computer Science, 5441, pp. 371-378Wolpert, D., Macready, W., No free lunch theorems for optimization (1997) IEEE Trans. Evol. Comput., 1 (1), pp. 67-82Darbellay, G.A., Tichavsky, P., Independent component analysis through direct estimation of the mutual information (2000) ICA 2000, pp. 69-75Costa, J.P., Lagrange, A., Arliaud, A., Acoustic echo cancellation using nonlinear cascade filters (2003) Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP'03, pp. 389-392Sill, J., Monotonic networks (1998) Advances in Neural Information Processing Systems, 10, pp. 661-667Lang, B., Monotonic multi-layer perceptron networks as universal approximators (2005) Artificial Neural Networks: Formal Models and Their Applications, ICANN 2005, pp. 31-3
    corecore