Search CORE

1,296 research outputs found

Audio Source Separation Using Sparse Representations

Author: Jafari MG
Nesbit A
Plumbley MD
Vincent E
Publication venue: 'IGI Global'
Publication date: 01/01/2010
Field of study

This is the author's final version of the article, first published as A. Nesbit, M. G. Jafari, E. Vincent and M. D. Plumbley. Audio Source Separation Using Sparse Representations. In W. Wang (Ed), Machine Audition: Principles, Algorithms and Systems. Chapter 10, pp. 246-264. IGI Global, 2011. ISBN 978-1-61520-919-4. DOI: 10.4018/978-1-61520-919-4.ch010file: NesbitJafariVincentP11-audio.pdf:n\NesbitJafariVincentP11-audio.pdf:PDF owner: markp timestamp: 2011.02.04file: NesbitJafariVincentP11-audio.pdf:n\NesbitJafariVincentP11-audio.pdf:PDF owner: markp timestamp: 2011.02.04The authors address the problem of audio source separation, namely, the recovery of audio signals from recordings of mixtures of those signals. The sparse component analysis framework is a powerful method for achieving this. Sparse orthogonal transforms, in which only few transform coefficients differ significantly from zero, are developed; once the signal has been transformed, energy is apportioned from each transform coefficient to each estimated source, and, finally, the signal is reconstructed using the inverse transform. The overriding aim of this chapter is to demonstrate how this framework, as exemplified here by two different decomposition methods which adapt to the signal to represent it sparsely, can be used to solve different problems in different mixing scenarios. To address the instantaneous (neither delays nor echoes) and underdetermined (more sources than mixtures) mixing model, a lapped orthogonal transform is adapted to the signal by selecting a basis from a library of predetermined bases. This method is highly related to the windowing methods used in the MPEG audio coding framework. In considering the anechoic (delays but no echoes) and determined (equal number of sources and mixtures) mixing case, a greedy adaptive transform is used based on orthogonal basis functions that are learned from the observed data, instead of being selected from a predetermined library of bases. This is found to encode the signal characteristics, by introducing a feedback system between the bases and the observed data. Experiments on mixtures of speech and music signals demonstrate that these methods give good signal approximations and separation performance, and indicate promising directions for future research

HAL-CentraleSupelec

INRIA a CCSD electronic archive server

Surrey Research Insight

HAL-Rennes 1

MISEP - Linear and Nonlinear ICA Based on Mutual Information

Author: Almeida Luis B.
Publication venue: Elsevier
Publication date: 01/12/2002
Field of study

MISEP is a method for linear and nonlinear ICA, that is able to handle a large variety of situations. It is an extension of the well known INFOMAX method, in two directions: (1) handling of nonlinear mixtures, and (2) learning the nonlinearities to be used at the outputs. The method can therefore separate linear and nonlinear mixtures of components with a wide range of statistical distributions. This paper presents the basis of the MISEP method, as well as experimental results obtained with it. The results illustrate the applicability of the method to various situations, and show that, although the nonlinear blind separation problem is ill-posed, use of regularization allows the problem to be solved when the nonlinear mixture is relatively smooth

CiteSeerX

CogPrints Cognitive Sciences Eprint Archive

Hybrid solutions to instantaneous MIMO blind separation and decoding: narrowband, QAM and square cases

Author: Zhao Xu
Publication venue: The University of Edinburgh
Publication date: 01/01/2009
Field of study

Future wireless communication systems are desired to support high data rates and high quality transmission when considering the growing multimedia applications. Increasing the channel throughput leads to the multiple input and multiple output and blind equalization techniques in recent years. Thereby blind MIMO equalization has attracted a great interest.Both system performance and computational complexities play important roles in real time communications. Reducing the computational load and providing accurate performances are the main challenges in present systems. In this thesis, a hybrid method which can provide an affordable complexity with good performance for Blind Equalization in large constellation MIMO systems is proposed first. Saving computational cost happens both in the signal sep- aration part and in signal detection part. First, based on Quadrature amplitude modulation signal characteristics, an efficient and simple nonlinear function for the Independent Compo- nent Analysis is introduced. Second, using the idea of the sphere decoding, we choose the soft information of channels in a sphere, and overcome the so- called curse of dimensionality of the Expectation Maximization (EM) algorithm and enhance the final results simultaneously. Mathematically, we demonstrate in the digital communication cases, the EM algorithm shows Newton -like convergence.Despite the widespread use of forward -error coding (FEC), most multiple input multiple output (MIMO) blind channel estimation techniques ignore its presence, and instead make the sim- plifying assumption that the transmitted symbols are uncoded. However, FEC induces code structure in the transmitted sequence that can be exploited to improve blind MIMO channel estimates. In final part of this work, we exploit the iterative channel estimation and decoding performance for blind MIMO equalization. Experiments show the improvements achievable by exploiting the existence of coding structures and that it can access the performance of a BCJR equalizer with perfect channel information in a reasonable SNR range. All results are confirmed experimentally for the example of blind equalization in block fading MIMO systems

Adaptive Multiple Subtraction: Unification And Comparison Of Matching Filters Based On The L(q)-norm And Statistical Independence

Author: Batany
D
H
JMT
LT
YM
Publication venue: 'Society of Exploration Geophysicists'
Publication date: 06/12/2016
Field of study

An adaptive multiple subtraction step is necessary for almost all methods that predict seismic multiple reflected waves. We aim at giving a better understanding of matching filters based on l(q)-norms and on statistical independence. We found that the formulation of all of these techniques can be gathered in a mutual framework by introducing a space-time operator, called the primary enhancer, acting on the estimated primaries. The differences between the considered matching filters become more intuitive because this operator behaves as a simple amplitude compressor. In this perspective, all the methods tend to uncorrelate the predicted multiples and the enhanced estimated primaries. The study of these matching-filter methods can be narrowed to the study of the primary enhancer operator because it is the only difference. Moreover, we have emphasized the role of using adjacent traces or windowing approaches in terms of statistics, and we show that an adequate windowing strategy may overbear the choice of the objective function. Indeed, our analysis showed that setting a good windowing strategy may be more important than changing the classical least-squares adaptation criterion to other approaches based on l(q)-norm minimization or independent component analysis.81V43V54PetrobrasFrench National Research AgencyCGGTotalSchlumberge