Search CORE

374 research outputs found

Estimating underlying articulatory targets of Thai vowels by using deep learning based on generating synthetic samples from a 3D vocal tract model and data augmentation

Author: Birkholz P
Lapthawan T
Prom-On S
Xu Y
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 12/04/2022
Field of study

Representation learning is one of the fundamental issues in modeling articulatory-based speech synthesis using target-driven models. This paper proposes a computational strategy for learning underlying articulatory targets from a 3D articulatory speech synthesis model using a bi-directional long short-term memory recurrent neural network based on a small set of representative seed samples. From a seeding set, a larger training set was generated that provided richer contextual variations for the model to learn. The deep learning model for acoustic-to-target mapping was then trained to model the inverse relation of the articulation process. This method allows the trained model to map the given acoustic data onto the articulatory target parameters which can then be used to identify the distribution based on linguistic contexts. The model was evaluated based on its effectiveness in mapping acoustics to articulation, and the perceptual accuracy of speech reproduced from the estimated articulation. The results indicate that the model can accurately imitate speech with a high degree of phonemic precision

UCL Discovery

Early intravenous magnesium sulfate and its impact on cerebral vasospasm as well as delayed cerebral ischemia in aneurysmal subarachnoid hemorrhage: a retrospective matched case-control analysis

Author: Birkholz Torsten
Buchfelder Michael
Dörfler Arnd
Feulner Julian
Sommer Björn
Weidinger Cornelia S.
Publication venue
Publication date: 01/01/2024
Field of study

Introduction Magnesium sulfate (MgSO4) is a potential neuroprotective agent for patients with aneurysmal subarachnoid hemorrhage (aSAH). We analysed the effect of early application of intraoperative intravenous magnesium sulfate (MgSO4) and compared the rate of cerebral vasospasm (CV), delayed cerebral ischemia (DCI) and neurological outcome in two patient cohorts. Material and methods A retrospective matched-pair analysis from patients of a single center in Germany was performed without (group A) and with (group B) MgSO4 application <24 hrs after diagnosis. Pairs were matched according to the known risk factors for DCI and CV (age, Fisher grade, smoking, severity of SAH). Incidence of CV, DCI, and neurological outcome using the modified Rankin Scale (mRS) 3 and 12 months after SAH were recorded. Results 196 patients fulfilled the inclusion criteria. After risk stratification, 48 patients were included in the final analysis (mean±SD age 54.2±8.1yrs, 30 f, 18 m) and were assigned to group A (n=24) or group B (n=24). CV occurred less frequently in group B (33%) than in A (46%). Likewise, DCI was present in 13% (group B) as compared to 42% (group A). After 12 months, 22 patients of group B had a favorable functional outcome (mRS 0 to 3) compared to 15 of group A. Conclusions In this study, the incidence of CV and DCI was lower in patients receiving intravenous MgSO4 within 24 hrs after aneurysmal SAH onset. Favorable functional outcome was more likely in the magnesium group after 12 months of follow-up

OPUS Augsburg

Model-based exploration of linking between vowel articulatory space and acoustic space

Author: Birkholz P
Gerazov B
Krug PK
Prom-On S
van Niekerk D
Xu A
Xu Y
Publication venue: 'The International Fiscal Association of Korea'
Publication date: 03/09/2021
Field of study

While the acoustic vowel space has been extensively studied in previous research, little is known about the high-dimensional articulatory space of vowels. The articulatory imaging techniques are limited to tracking only a few key articulators, leaving the rest of the articulators unmonitored. In the present study, we attempted to develop a detailed articulatory space obtained by training a 3D articulatory synthesizer to learn eleven British English vowels. An analysis-by-synthesis strategy was used to acoustically optimize vocal tract parameters that represent twenty articulatory dimensions. The results show that tongue height and retraction, larynx location and lip roundness are the most perceptually distinctive articulatory dimensions. Yet, even for these dimensions, there is a fair amount of articulatory overlap between vowels, unlike the fine-grained acoustic space. This method opens up the possibility of using modelling to investigate the link between speech production and perception

UCL Discovery

Modell einer Frauenstimme für die artikulatorische Sprachsynthese mit VocalTractLab Studientexte zur Sprachkommunikation

Author: Birkholz P.
Drechsel S.
Frahm J.
Gao Y.
Publication venue
Publication date: 01/01/2019
Field of study

Für das artikulatorische Sprachsynthesesystem VocalTractLab, das inder veröffentlichten Version auf dem geometrischen Modell eines männlichen Vokaltrakts basiert, wird das Modell für eine Frauenstimme vorgestellt. Anhand von MRT-Aufnahmen, Kieferabdrücken und Sprachaufnahmen einer ausgebildeten Sprecherinwurden die anatomischen Parameter für den weiblichen Vokaltrakt bestimmt und dieZielformen der Einzellaute sowie der glottalen Gesten angepasst. Die Sprachsynthese direkt aus Text oder einer phonetischen Transkription erfolgt mit VocalTractLab derzeit noch nicht automatisch. Die Schritte zur Erstellung von gestischen Partiturenwerden beschrieben und die Ergebnisse einer ersten Hörerbefragung zur Qualität dersynthetischen Frauenstimme präsentiert

MPG.PuRe

Evoc-Learn - High quality simulation of early vocal learning

Author: Birkholz P
Gerazov B
Halliday LF
Krug PK
Prom-On S
van Niekerk DR
Xu A
Xu Y
Publication venue: International Speech Communication Association (ISCA)
Publication date: 22/09/2022
Field of study

Evoc-Learn is a system for simulating early vocal learning of spoken language in ways that can overcome some of the major bottlenecks in vocal learning. The system consists of VocalTractLab, a geometrical three-dimensional vocal tract model for simulating aeroacoustics and articulatory dynamics, a coarticulation model for controlling the temporal dynamics of articulation, and a sensory feedback system for guiding the learning process. We will demonstrate each component of Evoc-Learn and show how they work together to simulate the learning of highly intelligible speech

UCL Discovery

Influence of the substrate-induced strain and irradiation disorder on the Peierls transition in TTF-TCNQ microdomains

Author: Anastasia Cmyrev
Beloborodov I S
Birkholz M
Drouin D Couture A R Gauvin R Hovington P Horny P Demers H
Grüner G
Heiko Reith
Huth M
Michael Huth
Peierls R E
Phanindra Sai T
Porrati F
Roland Sachser
Shklovskii B I
Solovyeva V
Vita Solovyeva
Zaitsev-Zotov S V
Publication venue: 'IOP Publishing'
Publication date: 01/01/2011
Field of study

The influence of the combined effects of substrate-induced strain, finite size and electron irradiation-induced defects have been studied on individual micron-sized domains of the organic charge transfer compound tetrathiafulvalene-tetracyanoquinodimethane (TTF-TCNQ) by temperature-dependent conductivity and current-voltage measurements. The individual domains have been isolated by focused ion beam etching and electrically contacted by focused ion and electron beam induced deposition of metallic contacts. The temperature-dependent conductivity follows a variable range hopping behavior which shows a crossover of the exponent as the Peierls transition is approached. The low temperature behavior is analyzed within the segmented rod model of Fogler, Teber and Shklowskii, as originally developed for a charge-ordered quasi one-dimensional electron crystal. The results are compared with data obtained on as-grown and electron irradiated epitaxial TTF-TCNQ thin films of the two-domain type

arXiv.org e-Print Archive

Crossref

Syddansk Universitets Forskerportal