Search CORE

9 research outputs found

The Cerevoice Blizzard Entry 2007: Are Small Database Errors Worse than Compression Artifacts?

Author: Andersson J. Sebastian
Aylett Matthew P.
Badino Leonardo
Pidcock Christopher J.
Publication venue
Publication date: 01/01/2007
Field of study

In commercial systems the memory footprint of unit selection systems is often a key issue. This is especially true for PDAs and other embedded devices. In this years Blizzard entry CereProc R○gave itself the criteria that the full database system entered would have a smaller memory footprint than either of the two smaller database entries. This was accomplished by applying speex speech compression to the full database entry. In turn a set of small database techniques used to improve the quality of small database systems in last years entry were extended. Finally, for all systems, two quality control methods were applied to the underlying database to improve the lexicon and transcription match to the underlying data. Results suggest that mild audio quality artifacts introduced by lossy compression have almost as much impact on MOS perceived quality as concatenation errors introduced by sparse data in the smaller systems with bulked diphones. Index Terms: speech synthesis, unit selection. 1

CiteSeerX

Edinburgh Research Explorer

The CereProc Blizzard Entry 2009: Some dumb algorithms that don't work

Author: Aylett Matthew
Pidcock Christopher J.
Publication venue
Publication date: 01/01/2009
Field of study

Within unit selection systems there is a constant tension between data sparsity and quality. This limits the control possible in a unit selection system. The RP data used in Blizzard this year and last year is expressive and spoken in a spirited manner. Last years entry focused on maintaining expressiveness, this year we focused on two simple algorithms to restrain and control this prosodic variation. 1) Variable width valley floor pruning on duration and pitch (Applied to the full database entry EH1), 2) Bulking of data with average HTS data (Applied to small database entry EH2). Results for both techniques were disappointing. The full database system achieved an MOS of around 2 (compared to 4 for a similar system attempting to emphasise variation in 2008), while the small database entry achieved an MOS of also 2 (compared to 3 for a similar system, but with a difference voice, entered in 2007). Index Terms: speech synthesis, unit selection. 1

CiteSeerX

Edinburgh Research Explorer

The Cerevoice Blizzard Entry 2006: A Prototype Small Database Unit Selection Engine

Author: Aylett Matthew P.
Fraser Mark E.
Pidcock Christopher J.
Publication venue
Publication date: 01/01/2006
Field of study

Edinburgh Research Explorer

Expressive speech synthesis: synthesising ambiguity

Author: Aylett Matthew P.
Pidcock Christopher J.
Potard Blaise
Publication venue
Publication date: 01/01/2013
Field of study

Previous work in HCI has shown that ambiguity, normally avoided in interaction design, can contribute to a user’s engagement by increasing interest and uncertainty. In this work, we create and evaluate synthetic utterances where there is a conflict between text content, and the emotion in the voice. We show that: 1) text content measurably alters the negative/positive perception of a spoken utterance, 2) changes in voice quality also produce this effect, 3) when the voice quality and text content are conflicting the result is a synthesised ambiguous utterance. Results were analysed using an evaluation/activation space. Whereas the effect of text content was restricted to the negative/positive dimension (valence), voice quality also had a significant effect on how active or passive the utterance was perceived (activation). Index Terms: speech synthesis, unit selection, expressive speech synthesis, emotion, prosody

CiteSeerX

Edinburgh Research Explorer

Proper Name Splicing in Computer Games with TTS

Author: Aylett Matthew P.
Pidcock Christopher J.
Potard Blaise
Publication venue
Publication date: 01/01/2012
Field of study

Building high quality synthesis systems with open domain vocabulary and a small audio database is a challenging problem, even when the targeted application is well constrained. Monophone unit concatenation (as opposed to diphone) is an approach that can compensate for the poor unit coverage that a small database implies. However, joining at phone boundaries is a delicate task that requires accurate targeting. In this paper, we present an automatically trained targeting system based on the parametric synthesiser HTS, and compare it to a concatenative monophone system and a baseline concatenative diphone system. We apply a novel evaluation methodology which includes a qualitative component, and allows for fast incremental development of synthesis systems. Preliminary results show that although the hybrid system performed significantly more poorly on out of database items, it is less affected by segmentation errors than the monophone system. Index Terms: hybrid speech synthesis, unit selection, evaluation of TTS system

CiteSeerX

Edinburgh Research Explorer

Phylogenetic systematics of Butyrivibrio and Pseudobutyrivibrio genomes illustrate vast taxonomic diversity, open genomes and an abundance of carbohydrate-active enzyme family isoforms

Author: Courtney Stephen
Creevey Christopher J
Godoy Santos Fernanda
Huws Sharon
Pidcock Sara
Skvortsov Timofey
Sui-Ting Karen
Publication venue: 'Microbiology Society'
Publication date: 04/10/2021
Field of study

Queen's University Belfast Research Portal

Genome sequencing and the rumen microbiome

Author: Creevey Christopher J.
Friedersdorff Jessica C. A.
Hart Elizabeth H.
Pidcock Sara E.
Rubino Francesco
Thomas Benjamin J.
Publication venue: 'Informa UK Limited'
Publication date: 23/06/2020
Field of study

Queen's University Belfast Research Portal

Crossref