Search CORE

3 research outputs found

Speaker-Dependent BiLSTM-Based Phrasing

Author: D Tihelka
I Read
J Hirschberg
J Matoušek
J Vít
M Jůzová
M Jůzová
N Hojo
P Taylor
P Taylor
S Bird
S Hochreiter
Z Hanzlíček
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Phrase boundary detection is an important part of text-to-speech systems since it ensures more natural speech synthesis outputs. However, the problem of phrasing is ambiguous, especially per speaker and per style. This is the reason why this paper focuses on speaker-dependent phrasing for the purposes of speech synthesis, using a neural network model with a speaker code. We also describe results of a listening test focused on incorrectly detected breaks because it turned out that some mistakes could be actually fine, not wrong

Crossref

University of West Bohemia Digital Library

DSpace at University of West Bohemia

Hodnocení kvality syntetické řeči pomocí spojité detekce emočních stavů založené na GMM

Author: AC Rencher
C Busso
D Tihelka
D Tihelka
DA Reynolds
E Jokinen
G Biagetti
J Přibil
J Přibil
J Vít
M Jůzová
M Pal
N Adiga
S Achanta
Y Saito
Y Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Příspěvek popisuje systém pro automatické hodnocení kvality syntetické řeči založený na spojité detekci emočních stavů během mluvené věty pomocí klasifikace s využitím modelu gaussovských směsí (GMM). Konečné hodnocení se provádí statistickou analýzou výsledků rozdílů emočních stavů mezi větami původních mužských nebo ženských hlasů a řečí syntetizovanými různými metodami s různými parametry, přístupy k manipulaci s prozodií atd. Základní experimenty potvrzují funkčnost vyvinutého systému, který poskytuje výsledky srovnatelné s výsledky získanými standardní metodou využívající poslechový test. Další výzkumy ukázaly, že počet směsí, typy řečových příznaků a řečové databáze používané pro tvorbu a výcvik GMM mají relativně velký vliv na spojitou detekci emočního stylu a konečné hodnocení kvality testované syntetické řeči.The paper describes a system for automatic evaluation of synthetic speech quality based on continuous detection of emotional states throughout the spoken sentence using a Gaussian mixture model (GMM) classification. The final evaluation decision is made by statistical analysis of the results of emotional class differences between the sentences of original male or female voices and the speech synthesized by various methods with different parameters, approaches to prosody manipulation, etc. The basic experiments confirm the functionality of the developed system producing results comparable with those obtained by the standard listening test method. Additional investigations have shown that a number of mixtures, types of speech features, and a speech database used for creation and training of GMMs have a relatively great influence on continuous emotional style detection and the final quality evaluation of the tested synthetic speech

Crossref

University of West Bohemia Digital Library

DSpace at University of West Bohemia

Distinctive behaviour of live biopsy-derived carcinoma cells unveiled using coherence-controlled holographic microscopy

Author: A Gandalovičová
A Krizova
A Partin
A Sinha
A Tzur
AA Friedman
Břetislav Gál
CJ Di Como
D Palm
DJ Liebertz
E Watanabe
G Popescu
GA Dunn
H Burstein
H Li
J Balvan
J Balvan
J Collakova
Jana Čolláková
K Nylander
M Brozova
M Jinushi
M Kalia
M Re
M Zhao
Marta Nekulová
Miroslav Veselý
NR Khan
O Kepp
P Veselý
Pavel Veselý
R Barer
R Chmelik
R Kalluri
Radim Chmelík
SRS Rangan
T Lippert
T Slabý
T Zhao
Thomas Abraham
TM Svitkina
TT Dang
V Dotsch
Veronika Jůzová
Y Su
Publication venue: 'Public Library of Science (PLoS)'
Publication date
Field of study

Crossref