Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding

Alexandros Lazaridis; Milos Cernak; Petr Motlicek; Philip N Garner

Stress and Accent Transmission In HMM-Based Syllable-Context Very Low Bit Rate Speech Coding

Authors: Alexandros Lazaridis
Milos Cernak
Petr Motlicek
Philip N Garner
Publication date: 11 April 2020
Publisher

Abstract

Abstract In this paper, we propose a solution to reconstruct stress and accent contextual factors at the receiver of a very low bitrate speech codec built on recognition/synthesis architecture. In speech synthesis, accent and stress symbols are predicted from the text, which is not available at the receiver side of the speech codec. Therefore, speech signal-based symbols, generated as syllable-level log average F0 and energy acoustic measures, quantized using a scalar quantization, are used instead of accentual and stress symbols for HMM-based speech synthesis. Results from incremental real-time speech synthesis confirmed, that a combination of F0 and energy signal-based symbols can replace their counterparts of text-based binary accent and stress symbols developed for text-to-speech systems. The estimated transmission bit-rate overhead is about 14 bits/second per acoustic measure

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.1052....

Last time updated on 07/12/2020