Generating Efficient Training Data via LLM-based Attribute Manipulation

Peng, Letian; Shang, Jingbo; Zhang, Yuwei

Generating Efficient Training Data via LLM-based Attribute Manipulation

Authors: Letian Peng
Jingbo Shang
Yuwei Zhang
Publication date: 13 July 2023
Publisher

Abstract

In this paper, we propose a novel method, Chain-of-Thoughts Attribute Manipulation (CoTAM), to guide few-shot learning by carefully crafted data from Large Language Models (LLMs). The main idea is to create data with changes only in the attribute targeted by the task. Inspired by facial attribute manipulation, our approach generates label-switched data by leveraging LLMs to manipulate task-specific attributes and reconstruct new sentences in a controlled manner. Instead of conventional latent representation controlling, we implement chain-of-thoughts decomposition and reconstruction to adapt the procedure to LLMs. Extensive results on text classification and other tasks verify the advantage of CoTAM over other LLM-based text generation methods with the same number of training examples. Analysis visualizes the attribute manipulation effectiveness of CoTAM and presents the potential of LLM-guided learning with even less supervision

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2307.07099

Last time updated on 20/07/2023