Location of Repository

Description of the UAM system for generating very short summaries at DUC-2003

By Enrique Alfonseca, José María and Guirao Antonio Moreno-sandoval

Abstract

This paper describes the techniques used for producing very short summaries (around 75 bytes) of single documents. As in last year’s version, the processing has been divided into two separate steps: firstly, a sentence extractor selects the most relevant sentences from the document; and, next, portions of those sentences are put together in order to produce the final headline. The main novelty is the way in which the text chunks have been weighted, and completed with keywords and noun phrases obtained from the documents. Our runs are ranked in the 12 th and 13 th positions with respect to unigram recall (ROUGE-1), but failed to identify bigrams, trigrams and four-grams as accurately as most of the other systems.

Year: 2011
OAI identifier: oai:CiteSeerX.psu:10.1.1.184.5492
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://duc.nist.gov/pubs/2004p... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.