Location of Repository

GLEANS: A Generator of Logical Extracts and Abstracts for Nice Summaries

By Hal Daum, III, Abdessamad Echihabi, Daniel Marcu, Dragos Stefan Munteanu and Radu Soricu

Abstract

We describe GLEANS, a summarization system that uses four novel techniques for summarizating document collections. (i) GLEANS first maps all documents in a collection into a canonical, database-like representation that makes explicit the main entities and relations in a document collection. (ii) GLEANS also classifies each document collection into one of four categories: collections about a single person, single event, multiple event, and natural disaster. (iii) For each type of document collection, GLEANS generates a short headline using a set of predefined templates. For multi-event documents, GLEANS also generates from scratch, using predefined templates, the first two sentences in the abstract. (iv) The rest of the summary is then generated by extracting from the database sentences that conform to a set of predefined schemas and by presenting them in an order that reflects coherence constraints specific to each collection category

Year: 2002
OAI identifier: oai:CiteSeerX.psu:10.1.1.19.7357
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://www.isi.edu/~marcu/pape... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.