From Clozing to Comprehending: Retrofitting Pre-trained Language Model
  to Pre-trained Machine Reader

Bing, Lidong; Lam, Wai; Li, Xin; Si, Luo; Xu, Weiwen; Zhang, Wenxuan; Zhou, Meng

From Clozing to Comprehending: Retrofitting Pre-trained Language Model to Pre-trained Machine Reader

Authors: Lidong Bing
Wai Lam
Xin Li
Luo Si
Weiwen Xu
Wenxuan Zhang
Meng Zhou
Publication date: 9 December 2022
Publisher

Abstract

We present Pre-trained Machine Reader (PMR), a novel method to retrofit Pre-trained Language Models (PLMs) into Machine Reading Comprehension (MRC) models without acquiring labeled data. PMR is capable of resolving the discrepancy between model pre-training and downstream fine-tuning of existing PLMs, and provides a unified solver for tackling various extraction tasks. To achieve this, we construct a large volume of general-purpose and high-quality MRC-style training data with the help of Wikipedia hyperlinks and design a Wiki Anchor Extraction task to guide the MRC-style pre-training process. Although conceptually simple, PMR is particularly effective in solving extraction tasks including Extractive Question Answering and Named Entity Recognition, where it shows tremendous improvements over previous approaches especially under low-resource settings. Moreover, viewing sequence classification task as a special case of extraction task in our MRC formulation, PMR is even capable to extract high-quality rationales to explain the classification process, providing more explainability of the predictions

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2212.04755

Last time updated on 08/01/2023