Search CORE

3 research outputs found

Developing Corpora for Statistical Graphical Language Models

Author: Keyes Laura
O'Sullivan Andrew
Winstanley Adam C.
Publication venue
Publication date: 01/01/2006
Field of study

In this work Statistical Graphical Language Models (SGLMs), a technique adapted from Statistical Language Models (SLMs), are applied to the task of graphical object recognition. SLMs are used in Natural Language Processing for tasks such as Speech Recognition and Information Retrieval. SGLMs view graphical objects as belonging to graphical languages and use this view to compute probabilistic distributions of graphical objects within graphical documents. SGLMs such as N-grams require large corpora of training data, which consist of graphical objects in contextual use (real world graphical documents). Constructing corpora is an important stage in developing the models and many issues need to be addressed. This paper discusses the development of graphical corpora and presents approaches to some of the problems encountered

MURAL - Maynooth University Research Archive Library

Maynooth University ePrints and eTheses Archive

NUI Maynooth Eprint Archive

Developing Corpora for Statistical Graphical Language Models

Author: Keyes Laura
O'Sullivan Andrew
Winstanley Adam C.
Publication venue
Publication date: 01/01/2006
Field of study

Developing Corpora for Statistical Graphical Language Models

Author: Keyes Laura
O'Sullivan Andrew
Winstanley Adam C.
Publication venue
Publication date: 01/01/2006
Field of study

Maynooth University ePrints and eTheses Archive