Search CORE

6 research outputs found

Equations of States in Statistical Learning for a Nonparametrizable and Regular Case

Author: Watanabe Sumio
Publication venue: 'Institute of Electronics, Information and Communications Engineers (IEICE)'
Publication date: 02/06/2009
Field of study

Many learning machines that have hierarchical structure or hidden variables are now being used in information science, artificial intelligence, and bioinformatics. However, several learning machines used in such fields are not regular but singular statistical models, hence their generalization performance is still left unknown. To overcome these problems, in the previous papers, we proved new equations in statistical learning, by which we can estimate the Bayes generalization loss from the Bayes training loss and the functional variance, on the condition that the true distribution is a singularity contained in a learning machine. In this paper, we prove that the same equations hold even if a true distribution is not contained in a parametric model. Also we prove that, the proposed equations in a regular case are asymptotically equivalent to the Takeuchi information criterion. Therefore, the proposed equations are always applicable without any condition on the unknown true distribution

arXiv.org e-Print Archive

Crossref

A Bayesian information criterion for singular models

Author: Drton Mathias
Plummer Martyn
Publication venue
Publication date: 23/03/2016
Field of study

We consider approximate Bayesian model choice for model selection problems that involve models whose Fisher-information matrices may fail to be invertible along other competing submodels. Such singular models do not obey the regularity conditions underlying the derivation of Schwarz's Bayesian information criterion (BIC) and the penalty structure in BIC generally does not reflect the frequentist large-sample behavior of their marginal likelihood. While large-sample theory for the marginal likelihood of singular models has been developed recently, the resulting approximations depend on the true parameter value and lead to a paradox of circular reasoning. Guided by examples such as determining the number of components of mixture models, the number of factors in latent factor models or the rank in reduced-rank regression, we propose a resolution to this paradox and give a practical extension of BIC for singular model selection problems

arXiv.org e-Print Archive

CiteSeerX

Crossref

Warwick Research Archives Portal Repository

Learning coefficients of layered models when the true distribution mismatches the singularities

Author: WATANABE SUMIO
渡邊澄夫
Publication venue
Publication date: 30/11/2006
Field of study

Institutional Repositories DataBase (IRDB)

Learning Coefficients of Layered Models When the True Distribution Mismatches the Singularities

Author: Efron B.
Shun-ichi Amari
Sumio Watanabe
Watanabe S.
Publication venue: 'MIT Press - Journals'
Publication date
Field of study

Crossref