Learning Language from a Large (Unannotated) Corpus

Goertzel, Ben; Vepstas, Linas

research

Learning Language from a Large (Unannotated) Corpus

Authors: Ben Goertzel
Linas Vepstas
Publication date: 14 January 2014
Publisher

Abstract

A novel approach to the fully automated, unsupervised extraction of dependency grammars and associated syntax-to-semantic-relationship mappings from large text corpora is described. The suggested approach builds on the authors' prior work with the Link Grammar, RelEx and OpenCog systems, as well as on a number of prior papers and approaches from the statistical language learning literature. If successful, this approach would enable the mining of all the information needed to power a natural language comprehension and generation system, directly from a large, unannotated corpus.Comment: 29 pages, 5 figures, research proposa

Similar works

Full text

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.754.8...

Last time updated on 30/10/2017