933 research outputs found
Application of the Mutual Information Minimization to speaker recognition / identification improvement
In this paper we propose the inversion of nonlinear distortions in
order to improve the recognition rates of a speaker recognizer system. We
study the effect of saturations on the test signals, trying to take into account real
situations where the training material has been recorded in a controlled
situation but the testing signals present some mismatch with the input signal
level (saturations). The experimental results for speaker recognition shows that
a combination of several strategies can improve the recognition rates with
saturated test sentences from 80% to 89.39%, while the results with clean
speech (without saturation) is 87.76% for one microphone, and for speaker
identification can reduce the minimum detection cost function with saturated
test sentences from 6.42% to 4.15%, while the results with clean speech
(without saturation) is 5.74% for one microphone and 7.02% for the other one
A Sparsity-Based Method for Blind Compensation of a Memoryless Nonlinear Distortion: Application to Ion-Selective Electrodes
International audience— In this paper, we propose a method for blind compensation of a memoryless nonlinear distortion. We assume as prior information that the desired signal admits a sparse representation in a transformed domain that should be known in advance. Then, given that a nonlinear distortion tends to generate signals that are less sparse than the desired one, our proposal is to build a compensating function model that gives rise to a maximally sparse signal. The implementation of this proposal has, as central elements, a criterion built upon an approximation of the 0-norm, the use of polynomial functions as compensating structures, and an optimization strategy based on sequential quadratic programming. We provide a theoretic analysis for an 0-norm criterion and results considering synthetic data. We also employ the method in an actual application related to chemical analysis via ion-selective electrode arrays
Statistical single channel source separation
PhD ThesisSingle channel source separation (SCSS) principally is one of the challenging fields
in signal processing and has various significant applications. Unlike conventional
SCSS methods which were based on linear instantaneous model, this research sets out
to investigate the separation of single channel in two types of mixture which is
nonlinear instantaneous mixture and linear convolutive mixture. For the nonlinear
SCSS in instantaneous mixture, this research proposes a novel solution based on a
two-stage process that consists of a Gaussianization transform which efficiently
compensates for the nonlinear distortion follow by a maximum likelihood estimator to
perform source separation. For linear SCSS in convolutive mixture, this research
proposes new methods based on nonnegative matrix factorization which decomposes a
mixture into two-dimensional convolution factor matrices that represent the spectral
basis and temporal code. The proposed factorization considers the convolutive mixing
in the decomposition by introducing frequency constrained parameters in the model.
The method aims to separate the mixture into its constituent spectral-temporal source
components while alleviating the effect of convolutive mixing. In addition, family of
Itakura-Saito divergence has been developed as a cost function which brings the
beneficial property of scale-invariant. Two new statistical techniques are proposed,
namely, Expectation-Maximisation (EM) based algorithm framework which
maximizes the log-likelihood of a mixed signals, and the maximum a posteriori
approach which maximises the joint probability of a mixed signal using multiplicative
update rules. To further improve this research work, a novel method that incorporates
adaptive sparseness into the solution has been proposed to resolve the ambiguity and
hence, improve the algorithm performance. The theoretical foundation of the proposed
solutions has been rigorously developed and discussed in details. Results have
concretely shown the effectiveness of all the proposed algorithms presented in this
thesis in separating the mixed signals in single channel and have outperformed others
available methods.Universiti Teknikal Malaysia Melaka(UTeM),
Ministry of Higher Education of Malaysi
Flexible methods for blind separation of complex signals
One of the main matter in Blind Source Separation (BSS) performed with a neural network approach is the choice of the nonlinear activation function (AF). In fact if the shape of the activation function is chosen as the cumulative density function (c.d.f.) of the original source the problem is solved.
For this scope in this thesis a flexible approach is introduced and the shape of the
activation functions is changed during the learning process using the so-called “spline functions”.
The problem is complicated in the case of separation of complex sources where there is the problem of the dichotomy between analyticity and boundedness of the complex activation functions. The problem is solved introducing the “splitting function” model as activation function. The “splitting function” is a couple of “spline function” which wind off the real and the imaginary part of the complex activation function, each of one depending from the real and imaginary variable.
A more realistic model is the “generalized splitting function”, which is formed by a couple of two bi-dimensional functions (surfaces), one for the real and one for
the imaginary part of the complex function, each depending by both the real and imaginary part of the complex variable.
Unfortunately the linear environment is unrealistic in many practical applications.
In this way there is the need of extending BSS problem in the nonlinear environment: in this case both the activation function than the nonlinear distorting function are realized by the “splitting function” made of “spline function”.
The complex and instantaneous separation in linear and nonlinear environment allow us to perform a complex-valued extension of the well-known INFOMAX algorithm in several practical situations, such as convolutive mixtures, fMRI signal analysis and bandpass signal transmission.
In addition advanced characteristics on the proposed approach are introduced and deeply described. First of all it is shows as splines are universal nonlinear functions for BSS problem: they are able to perform separation in anyway. Then it is analyzed as the “splitting solution” allows the algorithm to obtain a phase recovery:
usually there is a phase ambiguity. Finally a Cramér-Rao lower bound for ICA is discussed.
Several experimental results, tested by different objective indexes, show the
effectiveness of the proposed approaches
Flexible methods for blind separation of complex signals
One of the main matter in Blind Source Separation (BSS) performed with a neural network approach is the choice of the nonlinear activation function (AF). In fact if the shape of the activation function is chosen as the cumulative density function (c.d.f.) of the original source the problem is solved.
For this scope in this thesis a flexible approach is introduced and the shape of the
activation functions is changed during the learning process using the so-called “spline functions”.
The problem is complicated in the case of separation of complex sources where there is the problem of the dichotomy between analyticity and boundedness of the complex activation functions. The problem is solved introducing the “splitting function” model as activation function. The “splitting function” is a couple of “spline function” which wind off the real and the imaginary part of the complex activation function, each of one depending from the real and imaginary variable.
A more realistic model is the “generalized splitting function”, which is formed by a couple of two bi-dimensional functions (surfaces), one for the real and one for
the imaginary part of the complex function, each depending by both the real and imaginary part of the complex variable.
Unfortunately the linear environment is unrealistic in many practical applications.
In this way there is the need of extending BSS problem in the nonlinear environment: in this case both the activation function than the nonlinear distorting function are realized by the “splitting function” made of “spline function”.
The complex and instantaneous separation in linear and nonlinear environment allow us to perform a complex-valued extension of the well-known INFOMAX algorithm in several practical situations, such as convolutive mixtures, fMRI signal analysis and bandpass signal transmission.
In addition advanced characteristics on the proposed approach are introduced and deeply described. First of all it is shows as splines are universal nonlinear functions for BSS problem: they are able to perform separation in anyway. Then it is analyzed as the “splitting solution” allows the algorithm to obtain a phase recovery:
usually there is a phase ambiguity. Finally a Cramér-Rao lower bound for ICA is discussed.
Several experimental results, tested by different objective indexes, show the
effectiveness of the proposed approaches
A unified approach to sparse signal processing
A unified view of the area of sparse signal processing is presented in tutorial form by bringing together various fields in which the property of sparsity has been successfully exploited. For each of these fields, various algorithms and techniques, which have been developed to leverage sparsity, are described succinctly. The common potential benefits of significant reduction in sampling rate and processing manipulations through sparse signal processing are revealed. The key application domains of sparse signal processing are sampling, coding, spectral estimation, array processing, compo-nent analysis, and multipath channel estimation. In terms of the sampling process and reconstruction algorithms, linkages are made with random sampling, compressed sensing and rate of innovation. The redundancy introduced by channel coding i
- …