4 research outputs found

    Some Reflections on the Task of Content Determination in the Context of Multi-Document Summarization of Evolving Events

    Full text link
    Despite its importance, the task of summarizing evolving events has received small attention by researchers in the field of multi-document summariztion. In a previous paper (Afantenos et al. 2007) we have presented a methodology for the automatic summarization of documents, emitted by multiple sources, which describe the evolution of an event. At the heart of this methodology lies the identification of similarities and differences between the various documents, in two axes: the synchronic and the diachronic. This is achieved by the introduction of the notion of Synchronic and Diachronic Relations. Those relations connect the messages that are found in the documents, resulting thus in a graph which we call grid. Although the creation of the grid completes the Document Planning phase of a typical NLG architecture, it can be the case that the number of messages contained in a grid is very large, exceeding thus the required compression rate. In this paper we provide some initial thoughts on a probabilistic model which can be applied at the Content Determination stage, and which tries to alleviate this problem.Comment: 5 pages, 2 figure

    Proposta de Sumarização Automática Multidocumento usando modelos semântico-discursivos

    Get PDF
    Sumarizadores automáticos de textos são  sistemas computacionais que têm o objetivo de selecionar as informações mais importantes de um texto para produzir uma versão mais curta chamada de sumário. Este artigo apresenta uma proposta de sumarização automática multidocumento com base em modelos semântico-discursivos para produção de sumários mais informativos e coerentes. As estratégias de sumarização serão aplicadas a textos em português de caráter jornalístico

    Estudio de la aplicación de la Teoría de Estructura Retórica RST en sumarización multi-documento

    Get PDF
    Rhetorical Structure Theory (RST) has been applied in different areas, such as single document summarization, with promising results. In this paper, we discuss how Multi-document Summarization may benefit from RST in both rulebased and statistical methods. Results show that RST may contribute to produce more informative summaries.RST ha sido aplicada con éxito en várias áreas. En este trabajo se realizó un estudio sobre el impacto de RST en el área de sumarización multi-documento. En particular, son propuestos dos métodos: con base en reglas pré-definidas y aprendizaje estadístico. Los resultados se muestran prometedores.FAPESPCAPE
    corecore