Search CORE

3 research outputs found

Creating a Reference Data Set for the Summarization of Discussion Forum Threads

Author: Antal van den Bosch
AS Tigelaar
B Richards
Emiel Krahmer
G Erkan
I Mani
Iris Hendrickx
JR Landis
M Mitray
Sander Wubben
Suzan Verberne
U Hahn
V Gupta
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

In this paper we address extractive summarization of long threads in online discussion fora. We present an elaborate user evaluation study to determine human preferences in forum summarization and to create a reference data set. We showed long threads to ten different raters and asked them to create a summary by selecting the posts that they considered to be the most important for the thread. We study the agreement between human raters on the summarization task, and we show how multiple reference summaries can be combined to develop a successful model for automatic summarization. We found that although the inter-rater agreement for the summarization task was slight to fair, the automatic summarizer obtained reasonable results in terms of precision, recall, and ROUGE. Moreover, when human raters were asked to choose between the summary created by another human and the summary created by our model in a blind side-by-side comparison, they judged the model's summary equal to or better than the human summary in over half of the cases. This shows that even for a summarization task with low inter-rater agreement, a model can be trained that generates sensible summaries. In addition, we investigated the potential for personalized summarization. However, the results for the three raters involved in this experiment were inconclusive. We release the reference summaries as a publicly available dataset

Crossref

Leiden University Scholary Publications

Tilburg University Repository

Creating a reference data set for the summarization of discussion forum threads

Author: Antal van den Bosch
AS Tigelaar
B Richards
Emiel Krahmer
G Erkan
I Mani
Iris Hendrickx
JR Landis
M Mitray
Sander Wubben
Suzan Verberne
U Hahn
V Gupta
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Argument discovery via crowdsourcing

Author: A McAfee
A McCallum
AJ Freeley
Chi Thang Duong
E Cabrio
F Wolf
FR Kschischang
Hongzhi Yin
J Gao
K Krippendorff
Karl Aberer
M Banko
M Mitray
MA Hearst
Matthias Weidlich
P Besnard
PN Tan
Quoc Viet Hung Nguyen
T Yuan
Thanh Tam Nguyen
Xiaofang Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref