Low-rank Adaptation of Large Language Model Rescoring for
  Parameter-Efficient Speech Recognition

Bulyko, Ivan; Chen, I-Fan; Dinh, Tuan; Filimonov, Denis; Gandhe, Ankur; Ghosh, Shalini; Gourav, Aditya; Gu, Yile; Kolehmainen, Jari; Liu, Yi-Chieh; Luo, Qi; Rastow, Ariya; Ren, Roger; Ryu, Sungho; Shivakumar, Prashanth G.; Stolcke, Andreas; Yang, Chao-Han Huck; Yu, Yu

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Authors: Ivan Bulyko
I-Fan Chen
Tuan Dinh
Denis Filimonov
Ankur Gandhe
Shalini Ghosh
Aditya Gourav
Yile Gu
Jari Kolehmainen
Yi-Chieh Liu
Qi Luo
Ariya Rastow
Roger Ren
Sungho Ryu
Prashanth G. Shivakumar
Andreas Stolcke
Chao-Han Huck Yang
Yu Yu
Publication date: 10 October 2023
Publisher

Abstract

We propose a neural language modeling system based on low-rank adaptation (LoRA) for speech recognition output rescoring. Although pretrained language models (LMs) like BERT have shown superior performance in second-pass rescoring, the high computational cost of scaling up the pretraining stage and adapting the pretrained models to specific domains limit their practical use in rescoring. Here we present a method based on low-rank decomposition to train a rescoring BERT model and adapt it to new domains using only a fraction (0.08%) of the pretrained parameters. These inserted matrices are optimized through a discriminative training objective along with a correlation-based regularization loss. The proposed low-rank adaptation Rescore-BERT (LoRB) architecture is evaluated on LibriSpeech and internal datasets with decreased training times by factors between 5.4 and 3.6.Comment: Accepted to IEEE ASRU 2023. Internal Review Approved. Revised 2nd version with Andreas and Huck. The first version is in Sep 29th. 8 page

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2309.15223

Last time updated on 14/12/2023