Search CORE

10 research outputs found

Learning Features that Predict Cue Usage

Author: Di Eugenio Barbara
Moore Johanna D.
Paolucci Massimo
Publication venue
Publication date: 01/01/1997
Field of study

Our goal is to identify the features that predict the occurrence and placement of discourse cues in tutorial explanations in order to aid in the automatic generation of explanations. Previous attempts to devise rules for text generation were based on intuition or small numbers of constructed examples. We apply a machine learning program, C4.5, to induce decision trees for cue occurrence and placement from a corpus of data coded for a variety of features previously thought to affect cue usage. Our experiments enable us to identify the features with most predictive power, and show that machine learning can be used to induce decision trees useful for text generation.Comment: 10 pages, 2 Postscript figures, uses aclap.sty, psfig.te

arXiv.org e-Print Archive

CiteSeerX

Crossref

Edinburgh Research Explorer

Some Challenges of Advanced Question-Answering: an Experiment with How-to Questions

Author: Saint-Dizier Patrick
Publication venue: De La Salle University - Dasmarinas
Publication date: 01/01/2008
Field of study

PACLIC / The University of the Philippines Visayas Cebu College Cebu City, Philippines / November 20-22, 200

CiteSeerX

Waseda University Repository

The language of explanation dedicated

Author: Bourse Sara
Saint-Dizier Patrick
Publication venue: 'Universitat de Lleida'
Publication date: 01/01/2011
Field of study

Revistes Catalanes amb Accés Obert

Discourse structure analysis for requirement mining

Author: Kang Juyeon
Saint Dizier Patrick
Publication venue: 'Research Institute for Knowledge Content Development & Technology'
Publication date: 01/01/2013
Field of study

International audienceIn this work, we first introduce two main approaches to writing requirements and then propose a method based on Natural Language Processing to improve requirement authoring and the overall coherence, cohesion and organization of requirement documents. We investigate the structure of requirement kernels, and then the discourse structure associated with those kernels. This will then enable the system to accurately extract requirements and their related contexts from texts (called requirement mining). Finally, we relate a first experimentation on requirement mining based on texts from seven companies. An evaluation that compares those results with manually annotated corpora of documents is given to conclude

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Directory of Open Access Journals

Open Archive Toulouse Archive Ouverte

Improving discourse structure identification

Author: Guidry Jamie Allison
Publication venue: LSU Digital Commons
Publication date: 01/01/2012
Field of study

Rhetorical Structure Theory (Mann et al. 1988), a popular approach for analyzing discourse coherence, suggests that coherent text can be placed into a hierarchical organization of clauses. Identification of a text’s rhetorical structure through automatic discourse analysis is a crucial element for many of today’s Natural Language Processing tasks, but no sufficient tool is available. The current state-of -the-art discourse parser, SPADE (Soricut et al. 2003), is limited to parsing discourse within a single sentence. HILDA (Hernault et al. 2010) extends the parsing abilities of SPADE to the document level, but with a decrease in performance. This study achieved document-level discourse parsing without sacrificing performance. Provided text was already segmented into elementary discourse units, the task of discourse parsing was separated into three steps: structuring, nuclearity labeling, and relation labeling. An algorithm was developed for classifying relation existence, nuclearity, and relation label that improved upon previous methods. New features were explored for all three steps to maintain state-of-the-art performance when parsing at the document-level

Louisiana State University

Customizing RST for the automatic production of technical manuals

Author: Roesner D.
Ulm Univ. (Germany). Forschungsinstitut fuer Anwendungsorientierte Wissensverarbeitung (FAW)
Publication venue
Publication date: 01/01/1991
Field of study

SIGLEAvailable from TIB Hannover: RO 9630(91028) / FIZ - Fachinformationszzentrum Karlsruhe / TIB - Technische InformationsbibliothekDEGerman

OpenGrey Repository

Natural language generation in the LOLITA system an engineering approach

Author: Smith Mark H.
Publication venue
Publication date: 01/01/1995
Field of study

Natural Language Generation (NLG) is the automatic generation of Natural Language (NL) by computer in order to meet communicative goals. One aim of NL processing (NLP) is to allow more natural communication with a computer and, since communication is a two-way process, a NL system should be able to produce as well as interpret NL text. This research concerns the design and implementation of a NLG module for the LOLITA system. LOLITA (Large scale, Object-based, Linguistic Interactor, Translator and Analyser) is a general purpose base NLP system which performs core NLP tasks and upon which prototype NL applications have been built. As part of this encompassing project, this research shares some of its properties and methodological assumptions: the LOLITA generator has been built following Natural Language Engineering principles uses LOLITA's SemNet representation as input and is implemented in the functional programming language Haskell. As in other generation systems the adopted solution utilises a two component architecture. However, in order to avoid problems which occur at the interface between traditional planning and realisation modules (known as the generation gap) the distribution of tasks between the planner and plan-realiser is different: the plan-realiser, in the absence of detailed planning instructions, must perform some tasks (such as the selection and ordering of content) which are more traditionally performed by a planner. This work largely concerns the development of the plan- realiser and its interface with the planner. Another aspect of the solution is the use of Abstract Transformations which act on the SemNet input before realisation leading to an increased ability for creating paraphrases. The research has lead to a practical working solution which has greatly increased the power of the LOLITA system. The research also investigates how NLG systems can be evaluated and the advantages and disadvantages of using a functional language for the generation task

Durham e-Theses