374 research outputs found
Estimating underlying articulatory targets of Thai vowels by using deep learning based on generating synthetic samples from a 3D vocal tract model and data augmentation
Representation learning is one of the fundamental issues in modeling articulatory-based speech synthesis using target-driven models. This paper proposes a computational strategy for learning underlying articulatory targets from a 3D articulatory speech synthesis model using a bi-directional long short-term memory recurrent neural network based on a small set of representative seed samples. From a seeding set, a larger training set was generated that provided richer contextual variations for the model to learn. The deep learning model for acoustic-to-target mapping was then trained to model the inverse relation of the articulation process. This method allows the trained model to map the given acoustic data onto the articulatory target parameters which can then be used to identify the distribution based on linguistic contexts. The model was evaluated based on its effectiveness in mapping acoustics to articulation, and the perceptual accuracy of speech reproduced from the estimated articulation. The results indicate that the model can accurately imitate speech with a high degree of phonemic precision
Early intravenous magnesium sulfate and its impact on cerebral vasospasm as well as delayed cerebral ischemia in aneurysmal subarachnoid hemorrhage: a retrospective matched case-control analysis
Introduction
Magnesium sulfate (MgSO4) is a potential neuroprotective agent for patients with aneurysmal subarachnoid hemorrhage (aSAH). We analysed the effect of early application of intraoperative intravenous magnesium sulfate (MgSO4) and compared the rate of cerebral vasospasm (CV), delayed cerebral ischemia (DCI) and neurological outcome in two patient cohorts.
Material and methods
A retrospective matched-pair analysis from patients of a single center in Germany was performed without (group A) and with (group B) MgSO4 application <24 hrs after diagnosis. Pairs were matched according to the known risk factors for DCI and CV (age, Fisher grade, smoking, severity of SAH). Incidence of CV, DCI, and neurological outcome using the modified Rankin Scale (mRS) 3 and 12 months after SAH were recorded.
Results
196 patients fulfilled the inclusion criteria. After risk stratification, 48 patients were included in the final analysis (mean±SD age 54.2±8.1yrs, 30 f, 18 m) and were assigned to group A (n=24) or group B (n=24). CV occurred less frequently in group B (33%) than in A (46%). Likewise, DCI was present in 13% (group B) as compared to 42% (group A). After 12 months, 22 patients of group B had a favorable functional outcome (mRS 0 to 3) compared to 15 of group A.
Conclusions
In this study, the incidence of CV and DCI was lower in patients receiving intravenous MgSO4 within 24 hrs after aneurysmal SAH onset. Favorable functional outcome was more likely in the magnesium group after 12 months of follow-up
Model-based exploration of linking between vowel articulatory space and acoustic space
While the acoustic vowel space has been extensively studied in previous research, little is known about the high-dimensional articulatory space of vowels. The articulatory imaging techniques are limited to tracking only a few key articulators, leaving the rest of the articulators unmonitored. In the present study, we attempted to develop a detailed articulatory space obtained by training a 3D articulatory synthesizer to learn eleven British English vowels. An analysis-by-synthesis strategy was used to acoustically optimize vocal tract parameters that represent twenty articulatory dimensions. The results show that tongue height and retraction, larynx location and lip roundness are the most perceptually distinctive articulatory dimensions. Yet, even for these dimensions, there is a fair amount of articulatory overlap between vowels, unlike the fine-grained acoustic space. This method opens up the possibility of using modelling to investigate the link between speech production and perception
Modell einer Frauenstimme für die artikulatorische Sprachsynthese mit VocalTractLab Studientexte zur Sprachkommunikation
Für das artikulatorische Sprachsynthesesystem VocalTractLab, das inder veröffentlichten Version auf dem geometrischen Modell eines männlichen Vokaltrakts basiert, wird das Modell für eine Frauenstimme vorgestellt. Anhand von MRT-Aufnahmen, Kieferabdrücken und Sprachaufnahmen einer ausgebildeten Sprecherinwurden die anatomischen Parameter für den weiblichen Vokaltrakt bestimmt und dieZielformen der Einzellaute sowie der glottalen Gesten angepasst. Die Sprachsynthese direkt aus Text oder einer phonetischen Transkription erfolgt mit VocalTractLab derzeit noch nicht automatisch. Die Schritte zur Erstellung von gestischen Partiturenwerden beschrieben und die Ergebnisse einer ersten Hörerbefragung zur Qualität dersynthetischen Frauenstimme präsentiert
Evoc-Learn - High quality simulation of early vocal learning
Evoc-Learn is a system for simulating early vocal learning of spoken language in ways that can overcome some of the major bottlenecks in vocal learning. The system consists of VocalTractLab, a geometrical three-dimensional vocal tract model for simulating aeroacoustics and articulatory dynamics, a coarticulation model for controlling the temporal dynamics of articulation, and a sensory feedback system for guiding the learning process. We will demonstrate each component of Evoc-Learn and show how they work together to simulate the learning of highly intelligible speech
Influence of the substrate-induced strain and irradiation disorder on the Peierls transition in TTF-TCNQ microdomains
The influence of the combined effects of substrate-induced strain, finite
size and electron irradiation-induced defects have been studied on individual
micron-sized domains of the organic charge transfer compound
tetrathiafulvalene-tetracyanoquinodimethane (TTF-TCNQ) by temperature-dependent
conductivity and current-voltage measurements. The individual domains have been
isolated by focused ion beam etching and electrically contacted by focused ion
and electron beam induced deposition of metallic contacts. The
temperature-dependent conductivity follows a variable range hopping behavior
which shows a crossover of the exponent as the Peierls transition is
approached. The low temperature behavior is analyzed within the segmented rod
model of Fogler, Teber and Shklowskii, as originally developed for a
charge-ordered quasi one-dimensional electron crystal. The results are compared
with data obtained on as-grown and electron irradiated epitaxial TTF-TCNQ thin
films of the two-domain type
Ligand-induced type II interleukin-4 receptor dimers are sustained by rapid re-association within plasma membrane microcompartments
Biological and Soft Matter Physic
- …