Search CORE

345,406 research outputs found

Recommended from our members

Parsing with parallelism : a spreading-activation model of inference processing during text understanding

Author: Eiselt Kurt P.
Granger Richard H.
Holbrook Jennifer K.
Publication venue: eScholarship, University of California
Publication date: 01/01/1984
Field of study

The past decade of reseatch in Natural Language Processing has universally recognized that, since natural language input is almost always ambiguous with respect to its pragmatic implications, its syntactic parse, and even its lexical analysis (i.e., choice of correct word-sense for an ambiguous word), processing natural language input requires decisions about word meanings, syntactic structure, and pragmatic inferences. The lexical, syntactic, and pragmatic levels of inferencing are not as disparate as they have often been treated in both psychological and artificial intelligence research. In fact, these three levels of analysis interact to form a joint interpretation of text.ATLAST (A Three-level Language Analysis SysTem) is an implemented integration of human language understanding at the lexical, the syntactic, and the pragmatic levels. For psychological validity, ATLAST is based on results of experiments with human subjects. The ATLAST model uses a new architecture which was developed to incorporate three features: spreading activation memory, two-stage syntax, and parallel processing of syntax and semantics. It is also a new framework within which to interpret and tackle unsolved problems through implementation and experimentation

eScholarship - University of California

Automatic Identification of AltLexes using Monolingual Parallel Corpora

Author: Davoodi Elnaz
Kosseim Leila
Publication venue
Publication date: 11/08/2017
Field of study

The automatic identification of discourse relations is still a challenging task in natural language processing. Discourse connectives, such as "since" or "but", are the most informative cues to identify explicit relations; however discourse parsers typically use a closed inventory of such connectives. As a result, discourse relations signaled by markers outside these inventories (i.e. AltLexes) are not detected as effectively. In this paper, we propose a novel method to leverage parallel corpora in text simplification and lexical resources to automatically identify alternative lexicalizations that signal discourse relation. When applied to the Simple Wikipedia and Newsela corpora along with WordNet and the PPDB, the method allowed the automatic discovery of 91 AltLexes.Comment: 6 pages, Proceedings of Recent Advances in Natural Language Processing (RANLP 2017

arXiv.org e-Print Archive

Crossref

Unsupervised generation of parallel treebanks through sub-tree alignment

Author: Zhechev Ventsislav
Publication venue: Charles University, Prague
Publication date: 01/01/2009
Field of study

The need for syntactically annotated data for use in natural language processing has increased dramatically in recent years. This is true especially for parallel treebanks, of which very few exist. The ones that exist are mainly hand-crafted and too small for reliable use in data-oriented applications. In this paper we introduce an open-source system for fast and robust automatic generation of parallel treebanks. We expect the opening of the presented platform to the scientific community to help boost research in the field of data-oriented machine translation and lead to advancements in other fields where parallel treebanks can be employed

CiteSeerX

DCU Online Research Access Service

Evaluation of the NLP Components of the OVIS2 Spoken Dialogue System

Author: Bonnema Remko
Bouma Gosse
Sima'an Khalil
van Noord Gertjan
van Zanten Gert Veldhuijzen
Publication venue
Publication date: 01/01/1999
Field of study

The NWO Priority Programme Language and Speech Technology is a 5-year research programme aiming at the development of spoken language information systems. In the Programme, two alternative natural language processing (NLP) modules are developed in parallel: a grammar-based (conventional, rule-based) module and a data-oriented (memory-based, stochastic, DOP) module. In order to compare the NLP modules, a formal evaluation has been carried out three years after the start of the Programme. This paper describes the evaluation procedure and the evaluation results. The grammar-based component performs much better than the data-oriented one in this comparison.Comment: Proceedings of CLIN 9

arXiv.org e-Print Archive

CiteSeerX

Natural Language Processing of Large Parallel Corpora

Author: Varga Dániel
Publication venue
Publication date: 01/01/2012
Field of study

ELTE Digital Institutional Repository (EDIT)