Hierarchical Catalogue Generation for Literature Review: A Benchmark

Feng, Xiachong; Feng, Xiaocheng; Qin, Bing; Wu, Yingsheng; Zhu, Kun

Hierarchical Catalogue Generation for Literature Review: A Benchmark

Authors: Xiachong Feng
Xiaocheng Feng
Bing Qin
Yingsheng Wu
Kun Zhu
Publication date: 10 April 2023
Publisher

Abstract

Multi-document scientific summarization can extract and organize important information from an abundant collection of papers, arousing widespread attention recently. However, existing efforts focus on producing lengthy overviews lacking a clear and logical hierarchy. To alleviate this problem, we present an atomic and challenging task named Hierarchical Catalogue Generation for Literature Review (HiCatGLR), which aims to generate a hierarchical catalogue for a review paper given various references. We carefully construct a novel English Hierarchical Catalogues of Literature Reviews Dataset (HiCaD) with 13.8k literature review catalogues and 120k reference papers, where we benchmark diverse experiments via the end-to-end and pipeline methods. To accurately assess the model performance, we design evaluation metrics for similarity to ground truth from semantics and structure. Besides, our extensive analyses verify the high quality of our dataset and the effectiveness of our evaluation metrics. Furthermore, we discuss potential directions for this task to motivate future research

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2304.03512

Last time updated on 14/04/2023