TSST: A Benchmark and Evaluation Models for Text Speech-Style Transfer

Gao, Yang; Li, Jiawei; Li, Yinghao; Sun, Huashan; Wu, Yixiao; Yang, Yizhe

TSST: A Benchmark and Evaluation Models for Text Speech-Style Transfer

Authors: Yang Gao
Jiawei Li
Yinghao Li
Huashan Sun
Yixiao Wu
Yizhe Yang
Publication date: 14 November 2023
Publisher

Abstract

Text style is highly abstract, as it encompasses various aspects of a speaker's characteristics, habits, logical thinking, and the content they express. However, previous text-style transfer tasks have primarily focused on data-driven approaches, lacking in-depth analysis and research from the perspectives of linguistics and cognitive science. In this paper, we introduce a novel task called Text Speech-Style Transfer (TSST). The main objective is to further explore topics related to human cognition, such as personality and emotion, based on the capabilities of existing LLMs. Considering the objective of our task and the distinctive characteristics of oral speech in real-life scenarios, we trained multi-dimension (i.e. filler words, vividness, interactivity, emotionality) evaluation models for the TSST and validated their correlation with human assessments. We thoroughly analyze the performance of several large language models (LLMs) and identify areas where further improvement is needed. Moreover, driven by our evaluation models, we have released a new corpus that improves the capabilities of LLMs in generating text with speech-style characteristics. In summary, we present the TSST task, a new benchmark for style transfer and emphasizing human-oriented evaluation, exploring and advancing the performance of current LLMs.Comment: Working in progres

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2311.08389

Last time updated on 10/02/2024