An Exploration of In-Context Learning for Speech Language Model

Chang, Kai-Wei; Hsu, Ming-Hao; Lee, Hung-yi; Li, Shang-Wen

An Exploration of In-Context Learning for Speech Language Model

Authors: Kai-Wei Chang
Ming-Hao Hsu
Hung-yi Lee
Shang-Wen Li
Publication date: 19 October 2023
Publisher

Abstract

Ever since the development of GPT-3 in the natural language processing (NLP) field, in-context learning (ICL) has played an important role in utilizing large language models (LLMs). By presenting the LM utterance-label demonstrations at the input, the LM can accomplish few-shot learning without relying on gradient descent or requiring explicit modification of its parameters. This enables the LM to learn and adapt in a black-box manner. Despite the success of ICL in NLP, little work is exploring the possibility of ICL in speech processing. This study proposes the first exploration of ICL with a speech LM without text supervision. We first show that the current speech LM does not have the ICL capability. With the proposed warmup training, the speech LM can, therefore, perform ICL on unseen tasks. In this work, we verify the feasibility of ICL for speech LM on speech classification tasks.Comment: The first two authors contributed equall

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2310.12477

Last time updated on 06/01/2024