Search CORE

214 research outputs found

On some entropy functionals derived from R\'enyi information divergence

Author: Asadi
Banerjee
Baraniuk
Bashkirov
Basseville
Beck
Bhandari
Cebrian
Csiszár
Csiszár
Ellis
Esteban
Golan
Grendar
He
J.-F. Bercher
Jaynes
Jaynes
Jizba
Krishnamachari
Kullback
LaCour
Mayoral
Molina
Montfort
Nadarajah
Nadarajah
Nanda
Naudts
Neemuchwala
Nock
Rényi
Song
Tsallis
Tsallis
Tsallis
Vignat
Vinga
Publication venue: 'Elsevier BV'
Publication date: 01/05/2008
Field of study

We consider the maximum entropy problems associated with R\'enyi

Q

-entropy, subject to two kinds of constraints on expected values. The constraints considered are a constraint on the standard expectation, and a constraint on the generalized expectation as encountered in nonextensive statistics. The optimum maximum entropy probability distributions, which can exhibit a power-law behaviour, are derived and characterized. The R\'enyi entropy of the optimum distributions can be viewed as a function of the constraint. This defines two families of entropy functionals in the space of possible expected values. General properties of these functionals, including nonnegativity, minimum, convexity, are documented. Their relationships as well as numerical aspects are also discussed. Finally, we work out some specific cases for the reference measure

Q(x)

and recover in a limit case some well-known entropies

arXiv.org e-Print Archive

HAL-CentraleSupelec

Hal-Diderot

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

HAL-Rennes 1

Direct Estimation of Information Divergence Using Nearest Neighbor Ratios

Author: Hero III Alfred O.
Moon Kevin R.
Noshad Morteza
Sekeh Salimeh Yasaei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/11/2017
Field of study

We propose a direct estimation method for R\'{e}nyi and f-divergence measures based on a new graph theoretical interpretation. Suppose that we are given two sample sets

X

and

Y

, respectively with

N

and

M

samples, where

\eta:=M/N

is a constant value. Considering the

k

-nearest neighbor (

k

-NN) graph of

Y

in the joint data set

(X,Y)

, we show that the average powered ratio of the number of

X

points to the number of

Y

points among all

k

-NN points is proportional to R\'{e}nyi divergence of

X

and

Y

densities. A similar method can also be used to estimate f-divergence measures. We derive bias and variance rates, and show that for the class of

\gamma

-H\"{o}lder smooth functions, the estimator achieves the MSE rate of

O(N^{-2\gamma/(\gamma+d)})

. Furthermore, by using a weighted ensemble estimation technique, for density functions with continuous and bounded derivatives of up to the order

d

, and some extra conditions at the support set boundary, we derive an ensemble estimator that achieves the parametric MSE rate of

O(1/N)

. Our estimators are more computationally tractable than other competing estimators, which makes them appealing in many practical applications.Comment: 2017 IEEE International Symposium on Information Theory (ISIT

arXiv.org e-Print Archive

Finite-Sample Analysis of Fixed-k Nearest Neighbor Density Functional Estimators

Author: Póczos Barnabás
Singh Shashank
Publication venue
Publication date: 01/01/2016
Field of study

We provide finite-sample analysis of a general framework for using k-nearest neighbor statistics to estimate functionals of a nonparametric continuous probability density, including entropies and divergences. Rather than plugging a consistent density estimate (which requires

k \to \infty

as the sample size

n \to \infty

) into the functional of interest, the estimators we consider fix k and perform a bias correction. This is more efficient computationally, and, as we show in certain cases, statistically, leading to faster convergence rates. Our framework unifies several previous estimators, for most of which ours are the first finite sample guarantees.Comment: 16 pages, 0 figure

arXiv.org e-Print Archive

CiteSeerX

Information Theoretic Structure Learning with Confidence

Author: Hero III Alfred O.
Moon Kevin R.
Noshad Morteza
Sekeh Salimeh Yasaei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 13/09/2016
Field of study

Information theoretic measures (e.g. the Kullback Liebler divergence and Shannon mutual information) have been used for exploring possibly nonlinear multivariate dependencies in high dimension. If these dependencies are assumed to follow a Markov factor graph model, this exploration process is called structure discovery. For discrete-valued samples, estimates of the information divergence over the parametric class of multinomial models lead to structure discovery methods whose mean squared error achieves parametric convergence rates as the sample size grows. However, a naive application of this method to continuous nonparametric multivariate models converges much more slowly. In this paper we introduce a new method for nonparametric structure discovery that uses weighted ensemble divergence estimators that achieve parametric convergence rates and obey an asymptotic central limit theorem that facilitates hypothesis testing and other types of statistical validation.Comment: 10 pages, 3 figure

arXiv.org e-Print Archive

Ensemble estimation of multivariate f-divergence

Author: Hero III Alfred O.
Moon Kevin R.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 09/06/2014
Field of study

f-divergence estimation is an important problem in the fields of information theory, machine learning, and statistics. While several divergence estimators exist, relatively few of their convergence rates are known. We derive the MSE convergence rate for a density plug-in estimator of f-divergence. Then by applying the theory of optimally weighted ensemble estimation, we derive a divergence estimator with a convergence rate of O(1/T) that is simple to implement and performs well in high dimensions. We validate our theoretical results with experiments.Comment: 14 pages, 6 figures, a condensed version of this paper was accepted to ISIT 2014, Version 2: Moved the proofs of the theorems from the main body to appendices at the en

arXiv.org e-Print Archive

A simple probabilistic construction yielding generalized entropies and divergences, escort distributions and q-Gaussians

Author: Abe
Abe
Amari
Barankin
Beck
Bercher
Bercher
Bercher
Burbea
Campbell
Chernoff
Chhabra
Cover
Crooks
Ellis
Frieden
Frieden
Hammad
J.-F. Bercher
Kawai
Lutz
Nulton
Ohara
Ohara
Parrondo
Pennini
Pennini
Pennini
Pino
Rao
Rényi
Schwämmle
Shenfeld
Tsallis
Tsallis
Weinhold
Publication venue: 'Elsevier BV'
Publication date: 04/06/2012
Field of study

We give a simple probabilistic description of a transition between two states which leads to a generalized escort distribution. When the parameter of the distribution varies, it defines a parametric curve that we call an escort-path. The R\'enyi divergence appears as a natural by-product of the setting. We study the dynamics of the Fisher information on this path, and show in particular that the thermodynamic divergence is proportional to Jeffreys' divergence. Next, we consider the problem of inferring a distribution on the escort-path, subject to generalized moments constraints. We show that our setting naturally induces a rationale for the minimization of the R\'enyi information divergence. Then, we derive the optimum distribution as a generalized q-Gaussian distribution

arXiv.org e-Print Archive

HAL-Ecole des Ponts ParisTech

HAL - UPEC / UPEM

Ensemble Estimation of Information Divergence

Author: Greenewald Kristjan
Hero Alfred O., III
Moon Kevin R.
Sricharan Kumar
Publication venue: Hosted by Utah State University Libraries
Publication date: 04/06/2018
Field of study

Recent work has focused on the problem of nonparametric estimation of information divergence functionals between two continuous random variables. Many existing approaches require either restrictive assumptions about the density support set or difficult calculations at the support set boundary which must be known a priori. The mean squared error (MSE) convergence rate of a leave-one-out kernel density plug-in divergence functional estimator for general bounded density support sets is derived where knowledge of the support boundary, and therefore, the boundary correction is not required. The theory of optimally weighted ensemble estimation is generalized to derive a divergence estimator that achieves the parametric rate when the densities are sufficiently smooth. Guidelines for the tuning parameter selection and the asymptotic distribution of this estimator are provided. Based on the theory, an empirical estimator of Rényi-α divergence is proposed that greatly outperforms the standard kernel density plug-in estimator in terms of mean squared error, especially in high dimensions. The estimator is shown to be robust to the choice of tuning parameters. We show extensive simulation results that verify the theoretical results of our paper. Finally, we apply the proposed estimator to estimate the bounds on the Bayes error rate of a cell classification problem

arXiv.org e-Print Archive

Directory of Open Access Journals

A family of generalized quantum entropies: definition and properties

Author: Bosyk Gustavo Martín
Holik Federico Hernán
Lamberti Pedro Walter
Portesi Mariela Adelina
Zozor S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/08/2020
Field of study

We present a quantum version of the generalized (h, φ)-entropies, introduced by Salicrú et al. for the study of classical probability distributions.We establish their basic properties and show that already known quantum entropies such as von Neumann, and quantum versions of Rényi, Tsallis, and unified entropies, constitute particular classes of the present general quantum Salicrú form. We exhibit that majorization plays a key role in explaining most of their common features. We give a characterization of the quantum (h, φ)-entropies under the action of quantum operations and study their properties for composite systems. We apply these generalized entropies to the problem of detection of quantum entanglement and introduce a discussion on possible generalized conditional entropies as well.Facultad de Ciencias ExactasInstituto de Física La Plat

Servicio de Difusión de la Creación Intelectual

Postquantum Br\`{e}gman relative entropies and nonlinear resource theories

Author: Kostecki Ryszard Paweł
Publication venue
Publication date: 13/06/2019
Field of study

We introduce the family of postquantum Br\`{e}gman relative entropies, based on nonlinear embeddings into reflexive Banach spaces (with examples given by reflexive noncommutative Orlicz spaces over semi-finite W*-algebras, nonassociative L

_p

spaces over semi-finite JBW-algebras, and noncommutative L

_p

spaces over arbitrary W*-algebras). This allows us to define a class of geometric categories for nonlinear postquantum inference theory (providing an extension of Chencov's approach to foundations of statistical inference), with constrained maximisations of Br\`{e}gman relative entropies as morphisms and nonlinear images of closed convex sets as objects. Further generalisation to a framework for nonlinear convex operational theories is developed using a larger class of morphisms, determined by Br\`{e}gman nonexpansive operations (which provide a well-behaved family of Mielnik's nonlinear transmitters). As an application, we derive a range of nonlinear postquantum resource theories determined in terms of this class of operations.Comment: v2: several corrections and improvements, including an extension to the postquantum (generally) and JBW-algebraic (specifically) cases, a section on nonlinear resource theories, and more informative paper's titl

arXiv.org e-Print Archive