Search CORE

1,652 research outputs found

Inter-speaker speech variability assessment using statistical deformable models from 3.0 Tesla magnetic resonance images

Author: Freitas Diamantino R. S.
Tavares João Manuel R. S.
Vasconcelos Maria J. M.
Ventura Sandra Moreira Rua
Publication venue: 'SAGE Publications'
Publication date: 01/01/2011
Field of study

The morphological and dynamic characterisation of the vocal tract during speech production has been gaining greater attention due to the motivation of the latest improvements in magnetic resonance (MR) imaging; namely, with the use of higher magnetic fields, such as 3.0 Tesla. In this work, the automatic study of the vocal tract from 3.0 Tesla MR images was assessed through the application of statistical deformable models. Therefore, the primary goal focused on the analysis of the shape of the vocal tract during the articulation of European Portuguese sounds, followed by the evaluation of the results concerning the automatic segmentation, i.e. identification of the vocal tract in new MR images. In what concerns speech production, this is the first attempt to automatically characterise and reconstruct the vocal tract shape of 3.0 Tesla MR images by using deformable models; particularly, by using active and appearance shape models. The achieved results clearly evidence the adequacy and advantage of the automatic analysis of the 3.0 Tesla MR images of these deformable models in order to extract the vocal tract shape and assess the involved articulatory movements. These achievements are mostly required, for example, for a better knowledge of speech production, mainly of patients suffering from articulatory disorders, and to build enhanced speech synthesizer models.info:eu-repo/semantics/publishedVersio

Repositório Científico do Instituto Politécnico do Porto

Recommended from our members

Analysis of fuzzy clustering and a generic fuzzy rule-based image segmentation technique

Author: Dooley Laurence S.
Karmakar Gour C.
Publication venue
Publication date: 01/06/2001
Field of study

Many fuzzy clustering based techniques when applied to image segmentation do not incorporate spatial relationships of the pixels, while fuzzy rule-based image segmentation techniques are generally application dependent. Also for most of these techniques, the structure of the membership functions is predefined and parameters have to either automatically or manually derived. This paper addresses some of these issues by introducing a new generic fuzzy rule based image segmentation (GFRIS) technique, which is both application independent and can incorporate the spatial relationships of the pixels as well. A qualitative comparison is presented between the segmentation results obtained using this method and the popular fuzzy c-means (FCM) and possibilistic c-means (PCM) algorithms using an empirical discrepancy method. The results demonstrate this approach exhibits significant improvements over these popular fuzzy clustering algorithms for a wide range of differing image types

Open Research Online (The Open University)

Magnetic resonance imaging of the vocal tract: techniques and applications

Author: Diamantino Freitas
João Manuel R. S. Tavares
Sandra M. Rua Ventura
Publication venue
Publication date: 01/01/2009
Field of study

Magnetic resonance (MR) imaging has been used to analyse and evaluate the vocal tract shape through different techniques and with promising results in several fields. Our purpose is to demonstrate the relevance of MR and image processing for the vocal tract study. The extraction of contours of the air cavities allowed the set-up of a number of 3D reconstruction image stacks by means of the combination of orthogonally oriented sets of slices for each articulatory gesture, as a new approach to solve the expected spatial under sampling of the imaging process. In result these models give improved information for the visualization of morphologic and anatomical aspects and are useful for partial measurements of the vocal tract shape in different situations. Potential use can be found in Medical and therapeutic applications as well as in acoustic articulatory speech modelling

Repositório Aberto da Universidade do Porto

Deep-Learning-Based Methods for Automatic Articulator and Levator Veli Palatini Segmentation and Motion Quantification in Magnetic Resonance Images of the Vocal Tract

Author: Ruthven Matthieu
Publication venue
Publication date: 01/10/2023
Field of study

King's Research Portal

Magnetic resonance imaging of the vocal tract: techniques and applications

Author: Freitas Diamantino Rui
Tavares João Manuel
Ventura Sandra Moreira Rua
Publication venue: Institute for Systems and Technologies of Information, Control and Communication Press
Publication date: 17/11/2013
Field of study

Magnetic resonance (MR) imaging has been used to analyse and evaluate the vocal tract shape through different techniques and with promising results in several fields. Our purpose is to demonstrate the relevance of MR and image processing for the vocal tract study. The extraction of contours of the air cavities allowed the set - up of a number of 3D reconstruction image stacks by means of the combination of orthogonally oriented sets of slices for e ach articulatory gesture, as a new approach to solve the expected spatial under sampling of the imaging process. In result these models give improved information for the visualization of morphologic and anatomical aspects and are useful for partial measure ments of the vocal tract shape in different situations. Potential use can be found in Medical and therapeutic applications as well as in acoustic articulatory speech modelling

Repositório Científico do Instituto Politécnico do Porto

Magnetic resonance imaging of the brain and vocal tract:Applications to the study of speech production and language learning

Author: Badin
Baer
Berken
Bresch
Bresch
Bresch
Bressman
Bressmann
Buchsbaum
Carolyn McGettigan
Cartei
Cheng
Dagenais
Daniel Carey
Delvaux
Devereux
Drissi
Dronkers
Evans
Fitch
Flege
Flege
Garnier
Gibbon
Golestani
Goozée
Goozée
Guenther
Guenther
Hagedorn
Hashizume
Hickok
Hickok
Hickok
Hu
Hughes
Jacquemot
Jacquemot
Kappes
Katz
Kriegeskorte
Kriegeskorte
Krishnan
McGettigan
McGettigan
McLeod
Moser
Narayanan
Niebergall
Oh
Pardo
Pardo
Pardo
Pardo
Peschke
Peschke
Pisanski
Piske
Proctor
Rauschecker
Reiterer
Reiterer
Sagar
Schoenle
Scott
Scott
Segawa
Silva
Silva
Simmonds
Simmonds
Simmonds
Simmonds
Tourville
Vasquez Miloro
Vorperian
Weirich
Weiss-Croft
Publication venue: 'Elsevier BV'
Publication date: 01/04/2017
Field of study

The human vocal system is highly plastic, allowing for the flexible expression of language, mood and intentions. However, this plasticity is not stable throughout the life span, and it is well documented that adult learners encounter greater difficulty than children in acquiring the sounds of foreign languages. Researchers have used magnetic resonance imaging (MRI) to interrogate the neural substrates of vocal imitation and learning, and the correlates of individual differences in phonetic “talent”. In parallel, a growing body of work using MR technology to directly image the vocal tract in real time during speech has offered primarily descriptive accounts of phonetic variation within and across languages. In this paper, we review the contribution of neural MRI to our understanding of vocal learning, and give an overview of vocal tract imaging and its potential to inform the field. We propose methods by which our understanding of speech production and learning could be advanced through the combined measurement of articulation and brain activity using MRI – specifically, we describe a novel paradigm, developed in our laboratory, that uses both MRI techniques to for the first time map directly between neural, articulatory and acoustic data in the investigation of vocalisation. This non-invasive, multimodal imaging method could be used to track central and peripheral correlates of spoken language learning, and speech recovery in clinical settings, as well as provide insights into potential sites for targeted neural interventions

Crossref

Royal Holloway - Pure

UCL Discovery

Recommended from our members

A generic fuzzy rule based technique for image segmentation

Author: Dooley L. S.
Karmakar G. C.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/05/2001
Field of study

Many fuzzy clustering based techniques do not incorporate the spatial relationships of the pixels, while all fuzzy rule based image segmentation techniques tend to be very much application dependent. In most techniques, the structure of the membership functions are predefined and their parameters are either automatically or manually determined. This paper addresses the aforementioned problems by introducing a general fuzzy rule based image segmentation technique, which is application independent and can also incorporate the spatial relationships of the pixels. It also proposes the automatic defining of the structure of the membership functions. A qualitative comparison is made between the segmentation results using this method and the popular fuzzy c-means (FCM) applied to two types of images: light intensity (LI) and an X-ray of the human vocal tract. The results clearly show that this method exhibits significant improvements over FCM for both types of image

Open Research Online (The Open University)

Analyzing speech in both time and space : generalized additive mixed models can uncover systematic patterns of variation in vocal tract shape in real-time MRI

Author: Carignan Christopher (R18263)
Frahm Jens
Harrington Jonathan
Hoole Phil
Joseph Arun
Kunay Esther
Pouplier Marianne
Voit Dirk
Publication venue: 'Ubiquity Press, Ltd.'
Publication date: 01/01/2020
Field of study

We present a method of using generalized additive mixed models (GAMMs) to analyze midsagittal vocal tract data obtained from real-time magnetic resonance imaging (rt-MRI) video of speech production. Applied to rt-MRI data, GAMMs allow for observation of factor effects on vocal tract shape throughout two key dimensions: time (vocal tract change over the temporal course of a speech segment) and space (location of change within the vocal tract). Examples of this method are provided for rt-MRI data collected at a temporal resolution of 20 ms and a spatial resolution of 1.41 mm, for 36 native speakers of German. The rt-MRI data were quantified as 28-point semi-polar-grid aperture functions. Three test cases are provided as a way of observing vocal tract differences between: (1) /aː/ and /iː/, (2) /aː/ and /aɪ/, and (3) accentuated and unstressed /aː/. The results for each GAMM are independently validated using functional linear mixed models (FLMMs) constructed from data obtained at 20% and 80% of the vowel interval. In each case, the two methods yield similar results. In light of the method similarities, we propose that GAMMs are a robust, powerful, and interpretable method of simultaneously analyzing both temporal and spatial effects in rt-MRI video of speech

Directory of Open Access Journals

UCL Discovery

Western Sydney ResearchDirect

MPG.PuRe