Search CORE

856 research outputs found

Minimum Description Length Induction, Bayesianism, and Kolmogorov Complexity

Author: Li Ming
Vitanyi Paul
Publication venue
Publication date: 01/01/1998
Field of study

The relationship between the Bayesian approach and the minimum description length approach is established. We sharpen and clarify the general modeling principles MDL and MML, abstracted as the ideal MDL principle and defined from Bayes's rule by means of Kolmogorov complexity. The basic condition under which the ideal principle should be applied is encapsulated as the Fundamental Inequality, which in broad terms states that the principle is valid when the data are random, relative to every contemplated hypothesis and also these hypotheses are random relative to the (universal) prior. Basically, the ideal principle states that the prior probability associated with the hypothesis should be given by the algorithmic universal probability, and the sum of the log universal probability of the model plus the log of the probability of the data given the model should be minimized. If we restrict the model class to the finite sets then application of the ideal principle turns into Kolmogorov's minimal sufficient statistic. In general we show that data compression is almost always the best strategy, both in hypothesis identification and prediction.Comment: 35 pages, Latex. Submitted IEEE Trans. Inform. Theor

arXiv.org e-Print Archive

CiteSeerX

CWI's Institutional Repository

Implementation of the Combined--Nonlinear Condensation Transformation

Author: Abe
Abe
Aksenov
Bailey
Bailey
Bailey
Bailey
Bailey
Baratella
Bateman
Blaisdell
Boersma
Bromwich
Champernowne
Daniel
Darwin
Dempsey
Dempsey
Denisov
Doray
Drake
Fisher
Forrey
Gamow
Gautschi
Gerhard Soff
Gnedenko
Good
Haywood
Herdan
Homeier
Jens Becher
Jentschura
Jentschura
Jentschura
Jentschura
Jentschura
Jentschura
Jentschura
Johnson
Karlin
Knopp
Konopka
Kulasekera
Levin
Macdonald
Mandelbrot
Mantegna
Martindale
Michael A. Savageau
Mohr
Olver
Pachucki
Peter J. Mohr
Sergej V. Aksenov
Sidi
Simon
Truesdell
Tsallis
Tsallis
Ulrich D. Jentschura
van Wijngaarden
Weniger
Weniger
Williams
Wolfram
Yule
Zipf
Zörnig
Publication venue: 'Elsevier BV'
Publication date: 01/01/2002
Field of study

We discuss several applications of the recently proposed combined nonlinear-condensation transformation (CNCT) for the evaluation of slowly convergent, nonalternating series. These include certain statistical distributions which are of importance in linguistics, statistical-mechanics theory, and biophysics (statistical analysis of DNA sequences). We also discuss applications of the transformation in experimental mathematics, and we briefly expand on further applications in theoretical physics. Finally, we discuss a related Mathematica program for the computation of Lerch's transcendent.Comment: 23 pages, 1 table, 1 figure (Comput. Phys. Commun., in press

arXiv.org e-Print Archive

CiteSeerX

Crossref

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

Applying MDL to Learning Best Model Granularity

Author: Gao Qiong
Li Ming
Vitanyi Paul
Publication venue
Publication date: 01/01/2000
Field of study

The Minimum Description Length (MDL) principle is solidly based on a provably ideal method of inference using Kolmogorov complexity. We test how the theory behaves in practice on a general problem in model selection: that of learning the best model granularity. The performance of a model depends critically on the granularity, for example the choice of precision of the parameters. Too high precision generally involves modeling of accidental noise and too low precision may lead to confusion of models that should be distinguished. This precision is often determined ad hoc. In MDL the best model is the one that most compresses a two-part code of the data set: this embodies ``Occam's Razor.'' In two quite different experimental settings the theoretical value determined using MDL coincides with the best value found experimentally. In the first experiment the task is to recognize isolated handwritten characters in one subject's handwriting, irrespective of size and orientation. Based on a new modification of elastic matching, using multiple prototypes per character, the optimal prediction rate is predicted for the learned parameter (length of sampling interval) considered most likely by MDL, which is shown to coincide with the best value found experimentally. In the second experiment the task is to model a robot arm with two degrees of freedom using a three layer feed-forward neural network where we need to determine the number of nodes in the hidden layer giving best modeling performance. The optimal model (the one that extrapolizes best on unseen examples) is predicted for the number of nodes in the hidden layer considered most likely by MDL, which again is found to coincide with the best value found experimentally.Comment: LaTeX, 32 pages, 5 figures. Artificial Intelligence journal, To appea

arXiv.org e-Print Archive

Elsevier - Publisher Connector

CWI's Institutional Repository

CERN Document Server

International Migration, Integration and Social Cohesion online publications