This work deals with a linear prediction and cepstral synthesis of speech signal in the TTS (Text-to-Speech) systems with the opportunity of modeling the prosody. The work contains a description of speech signal in acoustic and phonetic plane, the principle of speech production and the way we can figure the speech signal in time and frequency domain. Next, there is the TTS block structure mentioned, whereas each block has its own detailed description. In the work, the modeling of prosody using the three most important suprasegmental features (fundamental tone, continuation and speech intensity) is also described. At the end of this work, there is a design and realization of universal Czech TTS system which is based on the speech synthesis in frequency domain. This system is implemented in program MATLAB
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.