90 research outputs found
A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images
Real-time magnetic resonance imaging (RT-MRI) of human speech production is
enabling significant advances in speech science, linguistics, bio-inspired
speech technology development, and clinical applications. Easy access to RT-MRI
is however limited, and comprehensive datasets with broad access are needed to
catalyze research across numerous domains. The imaging of the rapidly moving
articulators and dynamic airway shaping during speech demands high
spatio-temporal resolution and robust reconstruction methods. Further, while
reconstructed images have been published, to-date there is no open dataset
providing raw multi-coil RT-MRI data from an optimized speech production
experimental setup. Such datasets could enable new and improved methods for
dynamic image reconstruction, artifact correction, feature extraction, and
direct extraction of linguistically-relevant biomarkers. The present dataset
offers a unique corpus of 2D sagittal-view RT-MRI videos along with
synchronized audio for 75 subjects performing linguistically motivated speech
tasks, alongside the corresponding first-ever public domain raw RT-MRI data.
The dataset also includes 3D volumetric vocal tract MRI during sustained speech
sounds and high-resolution static anatomical T2-weighted upper airway MRI for
each subject.Comment: 27 pages, 6 figures, 5 tables, submitted to Nature Scientific Dat
Influences on articulatory timing in consonant sequences
This paper presents an experimental study of articulatory timing in English consonant sequences using electropalatography (EPG). Ef fects of consonantal place, manner, and syllable structure on reduction and temporal overlap are considered. The results evidence reduction in coda position and show that stops are more subject to coda reduction than are fricatives. Coronal consonants are shown to be more overlapped by a following velar stop than a velar stop is by a following coronal. Stops are more overlapped by a following consonant than are fricatives. Finally, an onset cluster is shown to be less overlapped and less variable in its timing than coda clusters and heterosyllabic sequences. These findings support the claim that inter-gestural coordination is variable and af fected by linguistic factors—both gestural and prosodic. ÷ 1996 Academic Press Limited
Discovering speech, words, and mind/ Byrd
viii, 308 hal.: ill, tab.; 25 cm
- …