Search CORE

1,648 research outputs found

Speech Synthesis Based on Hidden Markov Models

Author: Nankaku Y.
Oura K.
Toda T.
Tokuda K.
Yamagishi J.
Zen H.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/05/2013
Field of study

Edinburgh Research Explorer

Voice Conversion

Author: Elina Helander
Hanna Silén
Jani Nurminen
Moncef Gabbouj
Victor Popa
Publication venue: 'IntechOpen'
Publication date: 14/03/2012
Field of study

IntechOpen

Adapting Prosody in a Text-to-Speech System

Author: Caglayan Erdem
Janez Stergar
Publication venue: 'IntechOpen'
Publication date: 02/11/2010
Field of study

IntechOpen

Cross-Lingual Neural Network Speech Synthesis Based on Multiple Embeddings

Author: Delić Vlado D.
Nosek Tijana V.
Obradović Radovan J.
Pekar Darko J.
Sečujski Milan S.
Suzić Siniša B.
Publication venue: 'Universidad Internacional de La Rioja'
Publication date: 11/05/2022
Field of study

The paper presents a novel architecture and method for speech synthesis in multiple languages, in voices of multiple speakers and in multiple speaking styles, even in cases when speech from a particular speaker in the target language was not present in the training data. The method is based on the application of neural network embedding to combinations of speaker and style IDs, but also to phones in particular phonetic contexts, without any prior linguistic knowledge on their phonetic properties. This enables the network not only to efficiently capture similarities and differences between speakers and speaking styles, but to establish appropriate relationships between phones belonging to different languages, and ultimately to produce synthetic speech in the voice of a certain speaker in a language that he/she has never spoken. The validity of the proposed approach has been confirmed through experiments with models trained on speech corpora of American English and Mexican Spanish. It has also been shown that the proposed approach supports the use of neural vocoders, i.e. that they are able to produce synthesized speech of good quality even in languages that they were not trained on

Re-UNIR

EMG-to-Speech: Direct Generation of Speech from Facial Electromyographic Signals

Author: Janke Matthias
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2016
Field of study

The general objective of this work is the design, implementation, improvement and evaluation of a system that uses surface electromyographic (EMG) signals and directly synthesizes an audible speech output: EMG-to-speech

KITopen

Multilingual and Multimodal Corpus-Based Text-to-Speech System - PLATTOS -

Author: Izidor Mlakar
Matej Rojc
Publication venue: 'IntechOpen'
Publication date: 21/06/2011
Field of study

IntechOpen

Digital library of University of Maribor

Speaker Clustering for Multilingual Synthesis

Author: Black Alan W.
Schultz Tanja
Publication venue
Publication date: 18/06/2008
Field of study

KITopen