Search CORE

2,619 research outputs found

Information Theoretical Estimators Toolbox

Author: Szabo Zoltan
Publication venue
Publication date: 01/01/2014
Field of study

We present ITE (information theoretical estimators) a free and open source, multi-platform, Matlab/Octave toolbox that is capable of estimating many different variants of entropy, mutual information, divergence, association measures, cross quantities, and kernels on distributions. Thanks to its highly modular design, ITE supports additionally (i) the combinations of the estimation techniques, (ii) the easy construction and embedding of novel information theoretical estimators, and (iii) their immediate application in information theoretical optimization problems. ITE also includes a prototype application in a central problem class of signal processing, independent subspace analysis and its extensions.Comment: 5 pages; ITE toolbox: https://bitbucket.org/szzoli/ite

arXiv.org e-Print Archive

UCL Discovery

BMICA-independent component analysis based on B-spline mutual information estimator

Author: Li Yan
Walters-Williams Janett
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 01/04/2012
Field of study

The information theoretic concept of mutual information provides a general framework to evaluate dependencies between variables. Its estimation however using B-Spline has not been used before in creating an approach for Independent Component Analysis. In this paper we present a B-Spline estimator for mutual information to find the independent components in mixed signals. Tested using electroencephalography (EEG) signals the resulting BMICA (B-Spline Mutual Information Independent Component Analysis) exhibits better performance than the standard Independent Component Analysis algorithms of FastICA, JADE, SOBI and EFICA in similar simulations. BMICA was found to be also more reliable than the 'renown' FastICA

University of Southern Queensland ePrints

Least Dependent Component Analysis Based on Mutual Information

Author: A. Cichocki
A. Cichocki
A. Hyvärinen
A. Hyvärinen
A. K. Jain
A. Ziehe
Alexander Kraskov
E. Ott
F. R. Bach
H. Kantz
Harald Stögbauer
J. Chen
J.-F. Cardoso
J.-F. Cardoso
L. F. Kozachenko
O. Vasicek
P. Grassberger
Peter Grassberger
R. L. Somorjai
S. Amari
S. E. Stein
Sergey A. Astakhov
T. M. Cover
Publication venue: 'American Physical Society (APS)'
Publication date: 01/01/2004
Field of study

We propose to use precise estimators of mutual information (MI) to find least dependent components in a linearly mixed signal. On the one hand this seems to lead to better blind source separation than with any other presently available algorithm. On the other hand it has the advantage, compared to other implementations of `independent' component analysis (ICA) some of which are based on crude approximations for MI, that the numerical values of the MI can be used for: (i) estimating residual dependencies between the output components; (ii) estimating the reliability of the output, by comparing the pairwise MIs with those of re-mixed components; (iii) clustering the output according to the residual interdependencies. For the MI estimator we use a recently proposed k-nearest neighbor based algorithm. For time sequences we combine this with delay embedding, in order to take into account non-trivial time correlations. After several tests with artificial data, we apply the resulting MILCA (Mutual Information based Least dependent Component Analysis) algorithm to a real-world dataset, the ECG of a pregnant woman. The software implementation of the MILCA algorithm is freely available at http://www.fz-juelich.de/nic/cs/softwareComment: 18 pages, 20 figures, Phys. Rev. E (in press

arXiv.org e-Print Archive

CiteSeerX

Crossref

Juelich Shared Electronic Resources

CERN Document Server

Improving the performance of translation wavelet transform using BMICA

Author: Li Yan
Walters-Williams Janett
Publication venue: LJS Publishing
Publication date: 01/09/2011
Field of study

Research has shown Wavelet Transform to be one of the best methods for denoising biosignals. Translation-Invariant form of this method has been found to be the best performance. In this paper however we utilize this method and merger with our newly created Independent Component Analysis method – BMICA. Different EEG signals are used to verify the method within the MATLAB environment. Results are then compared with those of the actual Translation-Invariant algorithm and evaluated using the performance measures Mean Square Error (MSE), Peak Signal to Noise Ratio (PSNR), Signal to Distortion Ratio (SDR), and Signal to Interference Ratio (SIR). Experiments revealed that the BMICA Translation-Invariant Wavelet Transform out performed in all four measures. This indicates that it performed superior to the basic Translation- Invariant Wavelet Transform algorithm producing cleaner EEG signals which can influence diagnosis as well as clinical studies of the brain

University of Southern Queensland ePrints

Estimating Mutual Information

Author: A. B. Tsybakov
A. Hyvärinen
A. Renyi
A. Ziehe
Alexander Kraskov
B. van Es
B. W. Silverman
E. S. Dudewicz
G. A. Darbellay
Harald Stögbauer
J. C. Correa
J.-F. Cardoso
J.-F. Cardoso
L. F. Kozachenko
O. Vasicek
Peter Grassberger
R. L. Dobrushin
R. L. Somorjai
R. Steuer
R. Wieczorkowski
T. M. Cover
W. H. Press
Publication venue: 'American Physical Society (APS)'
Publication date: 28/05/2003
Field of study

We present two classes of improved estimators for mutual information

M(X,Y)

, from samples of random points distributed according to some joint probability density

\mu(x,y)

. In contrast to conventional estimators based on binnings, they are based on entropy estimates from

k

-nearest neighbour distances. This means that they are data efficient (with

k=1

we resolve structures down to the smallest possible scales), adaptive (the resolution is higher where data are more numerous), and have minimal bias. Indeed, the bias of the underlying entropy estimates is mainly due to non-uniformity of the density at the smallest resolved scale, giving typically systematic errors which scale as functions of

k/N

for

N

points. Numerically, we find that both families become {\it exact} for independent distributions, i.e. the estimator

\hat M(X,Y)

vanishes (up to statistical fluctuations) if

\mu(x,y) = \mu(x) \mu(y)

. This holds for all tested marginal distributions and for all dimensions of

x

and

y

. In addition, we give estimators for redundancies between more than 2 random variables. We compare our algorithms in detail with existing algorithms. Finally, we demonstrate the usefulness of our estimators for assessing the actual independence of components obtained from independent component analysis (ICA), for improving ICA, and for estimating the reliability of blind source separation.Comment: 16 pages, including 18 figure

arXiv.org e-Print Archive

Crossref

Juelich Shared Electronic Resources

On accuracy of PDF divergence estimators and their applicability to representative data sampling

Author: Bogdan Gabrys
Budka
Cardoso
Cardoso
Cichocki
Dhillon
Duda
Fukunaga
Jenssen
Jenssen
Kapur
Katarzyna Musial
Kullback
Kullback
Kuncheva
Le Cam
MacKay
Marcin Budka
Moreno
Ojala
Parzen
Principe
Ripley
Sheather
Silverman
Stone
Turlach
Publication venue: 'MDPI AG'
Publication date: 01/01/2011
Field of study

Generalisation error estimation is an important issue in machine learning. Cross-validation traditionally used for this purpose requires building multiple models and repeating the whole procedure many times in order to produce reliable error estimates. It is however possible to accurately estimate the error using only a single model, if the training and test data are chosen appropriately. This paper investigates the possibility of using various probability density function divergence measures for the purpose of representative data sampling. As it turned out, the first difficulty one needs to deal with is estimation of the divergence itself. In contrast to other publications on this subject, the experimental results provided in this study show that in many cases it is not possible unless samples consisting of thousands of instances are used. Exhaustive experiments on the divergence guided representative data sampling have been performed using 26 publicly available benchmark datasets and 70 PDF divergence estimators, and their results have been analysed and discussed

Multidisciplinary Digital Publishing Institute

CiteSeerX

Crossref

Directory of Open Access Journals

Bournemouth University Research Online

King's Research Portal