Improving In-Context Few-Shot Learning via Self-Supervised Training

Chen, Mingda; Du, Jingfei; Iyer, Srini; Kozareva, Zornitsa; Mihaylov, Todor; Pasunuru, Ramakanth; Stoyanov, Veselin

Improving In-Context Few-Shot Learning via Self-Supervised Training

Authors: Mingda Chen
Jingfei Du
Srini Iyer
Zornitsa Kozareva
Todor Mihaylov
Ramakanth Pasunuru
Veselin Stoyanov
Publication date: 6 June 2022
Publisher

Abstract

Self-supervised pretraining has made few-shot learning possible for many NLP tasks. But the pretraining objectives are not typically adapted specifically for in-context few-shot learning. In this paper, we propose to use self-supervision in an intermediate training stage between pretraining and downstream few-shot usage with the goal to teach the model to perform in-context few shot learning. We propose and evaluate four self-supervised objectives on two benchmarks. We find that the intermediate self-supervision stage produces models that outperform strong baselines. Ablation study shows that several factors affect the downstream performance, such as the amount of training data and the diversity of the self-supervised objectives. Human-annotated cross-task supervision and self-supervision are complementary. Qualitative analysis suggests that the self-supervised-trained models are better at following task requirements.Comment: NAACL 202

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2205.01703

Last time updated on 18/08/2022