MoCoUTRL: a momentum contrastive framework for unsupervised text representation learning

Ao Zou; Dawei Jin; Feiyan Sun; Gang Chen; Wenning Hao

MoCoUTRL: a momentum contrastive framework for unsupervised text representation learning

Authors: Ao Zou
Dawei Jin
Feiyan Sun
Gang Chen
Wenning Hao
Publication date: 1 December 2023
Publisher: Taylor & Francis Group
Doi

Abstract

This paper presents MoCoUTRL: a Momentum Contrastive Framework for Unsupervised Text Representation Learning. This model improves two aspects of recently popular contrastive learning algorithms in natural language processing (NLP). Firstly, MoCoUTRL employs multi-granularity semantic contrastive learning objectives, enabling a more comprehensive understanding of the semantic features of samples. Secondly, MoCoUTRL uses a dynamic dictionary to act as the approximately ground-truth representation for each token, providing the pseudo labels for token-level contrastive learning. The MoCoUTRL can extend the use of pre-trained language models (PLM) and even large-scale language models (LLM) into a plug-and-play semantic feature extractor that can fuel multiple downstream tasks. Experimental results on several publicly available datasets and further theoretical analysis validate the effectiveness and interpretability of the proposed method in this paper

Similar works

Full text

Available Versions

Directory of Open Access Journals

oai:doaj.org/article:8d9dceb6c...

Last time updated on 06/10/2023