Search CORE

9 research outputs found

Distances between Distributions: Comparing Language Models

Author: A. Fred
A.S. Reber
D. McAllester
F. Jelinek
H. Ney
J.C. Amengual
K.S. Fu
L. Miclet
N. Morgan
P. García
R. Carrasco
R.C. Carrasco
R.C. Carrasco
S. Lucas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2004
Field of study

Accurate Computation of the Relative Entropy Between Stochastic Regular Grammars

Author: Rafael C. Carrasco
Publication venue
Publication date: 01/01/1997
Field of study

Works dealing with grammatical inference of stochastic grammars often evaluate the relative entropy between the model and the true grammar by means of large test sets generated with the true distribution. In this paper, an iterative procedure to compute the relative entropy between two stochastic deterministic regular grammars is proposed. Resum'e Les travails sur l'inf'erence de grammaires stochastiques 'evaluent l'entropie relative entre le mod`ele et la vraie grammaire en utilisant grands ensembles de test g'en'er'es avec la distribution correcte. Dans cet article, on propose une proc'edure it'erative pour calculer l'entropie relative entre deux grammaires. 1 Introduction Stochastic models have been widely used in computer science, especially in those tasks dealing with noisy data or random sources such as pattern recognition, natural language modeling, etc. A stochastic model predicts a probability distribution for the events in the class under consideration and one of the mo..

CiteSeerX

Crossref

Numérisation de Documents Anciens Mathématiques

Accurate computation of the relative entropy between stochastic regular grammars

Author: R. C. Carrasco
Publication venue: 'EDP Sciences'
Publication date: 08/01/2011
Field of study

EDP Sciences OAI-PMH repository (1.2.0)

Accurate Computation of the Relative Entropy Between Stochastic Regular Grammars

Author: Fritsch Jannik
Gepperth Alexander
Goerick Christian
Michalke Thomas
Schneider Martin
Publication venue
Publication date: 01/01/1997
Field of study

CiteSeerX

TUbiblio

BieColl - Bielefeld Electronic Collections

BieColl - Bielefeld eCollections

In Language and Information Technologies

Author: Shay Cohen
Publication venue
Publication date
Field of study

With the rising amount of available multilingual text data, computational linguistics faces an opportunity and a challenge. This text can enrich the domains of NLP applications and improve their performance. Traditional supervised learning for this kind of data would require annotation of part of this text for induction of natural language structure. For these large amounts of rich text, such an annotation task can be daunting and expensive. Unsupervised learning of natural language structure can compensate for the need for such annotation. Natural language structure can be modeled through the use of probabilistic grammars, generative statistical models which are useful for compositional and sequential structures. Probabilistic grammars are widely used in natural language processing, but they are also used in other fields as well, such as computer vision, computational biology and cognitive science. This dissertation focuses on presenting a theoretical and an empirical analysis for the learning of these widely used grammars in the unsupervised setting. We analyze computational properties involved in estimation of probabilisti

CiteSeerX