A Formal Model for Information Selection in Multi-Sentence Text Extraction

Filatova, Elena; Hatzivassiloglou, Vasileios

A Formal Model for Information Selection in Multi-Sentence Text Extraction

Authors: Elena Filatova
Vasileios Hatzivassiloglou
Publication date: 1 January 2004
Publisher: 'Columbia University Libraries/Information Services'
Doi

Abstract

Selecting important information while accounting for repetitions is a hard task for both summarization and question answering. We propose a formal model that represents a collection of documents in a two-dimensional space of textual and conceptual units with an associated mapping between these two dimensions. This representation is then used to describe the task of selecting textual units for a summary or answer as a formal optimization task. We provide approximation algorithms and empirically validate the performance of the proposed model when used with two very different sets of features, words and atomic events

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Crossref

Last time updated on 20/07/2021

Sustaining member

Columbia University Academic Commons

oai:academiccommons.columbia.e...

Last time updated on 02/10/2018