Skip to main content
Article thumbnail
Location of Repository

A Comparison of Automatic Summarizers of Texts in Brazilian Portuguese

By Lucia H.M. Rino, Thiago A.S. Pardo, Carlos N. Silla Jr, Celso A.A. Kaestner and Michael Pombo

Abstract

Automatic Summarization (AS) in Brazil has only recently become a significant research topic. When compared to other languages initiatives, such a delay can be explained by the lack of specific resources, such as expressive lexicons and corpora that could provide adequate foundations for deep or shallow approaches on AS. Taking advantage of having commonalities with respect to resources and a corpus of texts and summaries written in Brazilian Portuguese, two NLP research groups have decided to start a common task to assess and compare their AS systems. In the experiment five distinct extractive AS systems have been assessed. Some of them incorporate techniques that have been already used to summarize texts in English; others propose novel approaches to AS. Two baseline systems have also been considered. An overall performance comparison has been carried out, and its outcomes are discussed in this paper

Topics: QA76
Publisher: Springer
Year: 2004
OAI identifier: oai:kar.kent.ac.uk:24120

Suggested articles

Citations

  1. (2000). A.A.: Document clustering and text summarization. In: doi
  2. (2000). A.C.: The Building of an Electronic thesaurus for Brazilian Portuguese (in Portuguese). In:
  3. (1999). Advances in Automatic Text Summarization. doi
  4. (2001). Automatic Summarization. doi
  5. (2002). C.A.A.: Automatic text summarization using a machine learning approach. In: doi
  6. (2002). Contribution to the study of automatic text summarization techniques (in Portuguese).
  7. (2003). DiZer: An automatic discourse analysis proposal to brazilian portuguese (in Portuguese). In: doi
  8. (2003). E.: Evaluation challenges in large-scale document summarization. In: doi
  9. (1998). E.: Machine learning of generic and user-focused summarization. In: doi
  10. (2001). Evaluation of a stemming algorithm for the Portuguese language (in Portuguese). In:
  11. (1995). F.: A trainable document summarizer. In: doi
  12. (2002). Generating indicative-informative summaries with sumUM. doi
  13. (2003). GistSumm: A summarization tool based on a new extractive method. In: doi
  14. (2003). H.R.: How much information. Retrieved from http://www.sims.berkeley.edu/how-much-info-2003 on [01/19/2004]
  15. (2003). L.H.M.: TeMário: A corpus for automatic text summarization (in Portuguese).
  16. (2002). M.G.V.: Curupira: a functional parser for Portuguese (in Portuguese). doi
  17. (2003). M.G.V.: NeuralSumm: A connexionist approach to automatic text summarization (in Portuguese). In:
  18. (1969). New methods in automatic extracting. doi
  19. (2000). O.N.: Combining classifiers to improve part of speech tagging: A case study for Brazilian Portuguese. In:
  20. (1982). Self organized formation of topologically correct feature maps. doi
  21. (2002). Summarizing scientific articles: Experiments with relevance and rhetorical status. doi
  22. (2003). Supor: an environment for exploration of extractive methods for automatic text summarization for portuguese (in Portuguese). Master’s thesis, Departamento de Computação,
  23. (1988). Term-weighting approaches in automatic text retrieval. doi
  24. (1958). The automatic creation of literature abstracts. doi
  25. (1996). The building of a Brazilian Portuguese lexicon for supporting automatic grammar checking (in Portuguese).
  26. (1999). Using Lexical Chains for Text Summarization. In: Advances in Automatic Text Summarization. doi

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.