Search CORE

1 research outputs found

The economic trade-offs of large language models: A case study

Author: Bradley Joseph
Christian Gwen
Fomitchov Pavel
Howell Kristen
Kehat Gitit
Marzulla Julianne
Rolston Leanne
Selfridge Ethan
Tredup Jadin
Zimmerman Ilana
Publication venue
Publication date: 08/06/2023
Field of study

Contacting customer service via chat is a common practice. Because employing customer service agents is expensive, many companies are turning to NLP that assists human agents by auto-generating responses that can be used directly or with modifications. Large Language Models (LLMs) are a natural fit for this use case; however, their efficacy must be balanced with the cost of training and serving them. This paper assesses the practical cost and impact of LLMs for the enterprise as a function of the usefulness of the responses that they generate. We present a cost framework for evaluating an NLP model's utility for this use case and apply it to a single brand as a case study in the context of an existing agent assistance product. We compare three strategies for specializing an LLM - prompt engineering, fine-tuning, and knowledge distillation - using feedback from the brand's customer service agents. We find that the usability of a model's responses can make up for a large difference in inference cost for our case study brand, and we extrapolate our findings to the broader enterprise space.Comment: Paper to be published at the Association for Computational Linguistics in the Industry Track 202

arXiv.org e-Print Archive