5,622 research outputs found

    Vocal tract morphometry by magnetic resonance imaging: simulation of pathological articulatory patterns

    Get PDF
    Introduction – The shape or morphologic analysis of anatomical structures, such as the vocal tract can be performed from two-dimensional (2D) or volumetric acquisitions (3D) of magnetic resonance imaging (MRI). This imaging technique has had an increasing use in the study of speech production. Objectives – To determine a method to perform the morphometric analysis of the vocal tract from magnetic resonance imaging; to present anatomical patterns during the normal speech production of some vowels and two pathological articulatory disorders in simulated context. Methods – The image data was collected from 2D (Turbo Spin Echo) and 3D (Flash Gradient Echo) acquisitions of MRI of four subjects during the production of three vowels; in addition, two articulatory disorders were assessed using this imaging protocol. The morphology of the vocal tract was extracted using manual and automatic techniques of image processing and analysis. Results – Based on five articulatory measurements, it was possible to study the entire vocal tract during vowel production, including the position and shape of the articulators involved. Based on these measurements, it was possible to identify the strategies most commonly adopted in the production of each sound, including the articulatory posture and the modification of each measure for the subjects under study. Concerning the voices of the different speakers, the variability in the assessed volumes of the vocal tract for each sound was found, and in particular, the increased vocal tract volume in the articulatory disorder – the sigmatism. Conclusion – MRI is a promising technique for speech production studies, safe, non-invasive and that provides reliable information concerning the morphometric analysis of the vocal tract

    Singing synthesis and the Vocal Tract Organ

    Get PDF
    Vocal synthesis has been the subject of investigation since the late 18th century when von Kempelen produced his mechanical ‘speaking machine’. The advert of electronics has enabled a number of different methods of voice synthesis to be realized in practice. Recently with the advent of 3-D printing and magnetic resonance imaging of human vocal tracts, it has been possible to create synthetic vocal sounds that combine both mechanical (3-D printed tracts) and electronic (synthesized larynx sound source) to enable the effects of various parts of the vocal tract on the acoustic output to be investigated. Given that the 3-D tracts look rather like organ pipes, the author (an organist) has developed a new musical instrument based on this technology, which is called the Vocal Tract Organ. This paper reviews voice synthesis techniques and describes the structure and operation of the Vocal Tract Organ

    Morfometria do trato vocal por ressonância magnética: simulação de padrões patológicos articulatórios

    Get PDF
    Introdução - A análise da forma ou morfometria de estruturas anatómicas, como o trato vocal, pode ser efetuada a partir de imagens bidimensionais (2D) como de aquisições volumétricas (3D) de ressonância magnética (RM). Esta técnica de imagem tem vindo a ter uma utilização crescente no estudo da produção da fala. Objetivos - Demonstrar como pode ser efetuada a morfometria do trato vocal a partir da imagem por ressonância magnética e ainda apresentar padrões anatómicos normais durante a produção das vogais [i a u] e dois padrões articulatórios patológicos em contexto simulado. Métodos - As imagens consideradas foram recolhidas a partir de aquisições 2D (Turbo Spin-eco) e 3D (Flash Gradiente-Eco) de RM em quatro sujeitos durante a produção das vogais em estudo; adicionalmente procedeu-se à avaliação de duas perturbações articulatórias usando o mesmo protocolo de RM. A morfometria do trato vocal foi extraída com recurso a técnicas manuais (para extração de cinco medidas articulatórias) e automáticas (para determinação de volumes) de processamento e análise de imagem. Resultados - Foi possível analisar todo o trato vocal, incluindo a posição e a forma dos articuladores, tendo por base cinco medidas descritivas do posicionamento destes órgãos durante a produção das vogais. A determinação destas medições permitiu identificar quais as estratégias mais comummente adotadas na produção de cada som, nomeadamente a postura articulatória e a variação de cada medida para cada um dos sujeitos em estudo. No contexto de voz falada intersujeitos, foi notória a variabilidade nos volumes estimados do trato vocal para cada som e, em especial, o aumento do volume do trato vocal na perturbação articulatória de sigmatismo. Conclusão - A imagem por RM é, sem dúvida, uma técnica promissora no estudo da fala, inócua, não-invasiva e que fornece informação fiável da morfometria do trato vocal.Introduction - The shape or morphologic analysis of anatomical structures, such as the vocal tract can be performed from two-dimensional (2D) or volumetric acquisitions (3D) of magnetic resonance imaging (MRI). This imaging technique has had an increasing use in the study of speech production. Objectives - To determine a method to perform the morphometric analysis of the vocal tract from magnetic resonance imaging; to present anatomical patterns during the normal speech production of some vowels and two pathological articulatory disorders in simulated context. Methods - The image data was collected from 2D (Turbo Spin Echo) and 3D (Flash Gradient Echo) acquisitions of MRI of four subjects during the production of three vowels; in addition, two articulatory disorders were assessed using this imaging protocol. The morphology of the vocal tract was extracted using manual and automatic techniques of image processing and analysis. Results - Based on five articulatory measurements, it was possible to study the entire vocal tract during vowel production, including the position and shape of the articulators involved. Based on these measurements, it was possible to identify the strategies most commonly adopted in the production of each sound, including the articulatory posture and the modification of each measure for the subjects under study. Concerning the voices of the different speakers, the variability in the assessed volumes of the vocal tract for each sound was found, and in particular, the increased vocal tract volume in the articulatory disorder - the sigmatism. Conclusion - MRI is a promising technique for speech production studies, safe, non-invasive and that provides reliable information concerning the morphometric analysis of the vocal tract

    Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI

    Full text link
    Vocal tract configurations play a vital role in generating distinguishable speech sounds, by modulating the airflow and creating different resonant cavities in speech production. They contain abundant information that can be utilized to better understand the underlying speech production mechanism. As a step towards automatic mapping of vocal tract shape geometry to acoustics, this paper employs effective video action recognition techniques, like Long-term Recurrent Convolutional Networks (LRCN) models, to identify different vowel-consonant-vowel (VCV) sequences from dynamic shaping of the vocal tract. Such a model typically combines a CNN based deep hierarchical visual feature extractor with Recurrent Networks, that ideally makes the network spatio-temporally deep enough to learn the sequential dynamics of a short video clip for video classification tasks. We use a database consisting of 2D real-time MRI of vocal tract shaping during VCV utterances by 17 speakers. The comparative performances of this class of algorithms under various parameter settings and for various classification tasks are discussed. Interestingly, the results show a marked difference in the model performance in the context of speech classification with respect to generic sequence or video classification tasks.Comment: To appear in the INTERSPEECH 2018 Proceeding

    Using Active Shape Modeling Based on MRI to Study Morphologic and Pitch-Related Functional Changes Affecting Vocal Structures and the Airway

    Get PDF
    Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.Peer reviewedPostprin

    Singing synthesis with an evolved physical model

    Get PDF
    A two-dimensional physical model of the human vocal tract is described. Such a system promises increased realism and control in the synthesis. of both speech and singing. However, the parameters describing the shape of the vocal tract while in use are not easily obtained, even using medical imaging techniques, so instead a genetic algorithm (GA) is applied to the model to find an appropriate configuration. Realistic sounds are produced by this method. Analysis of these, and the reliability of the technique (convergence properties) is provided

    Magnetic resonance imaging of the brain and vocal tract:Applications to the study of speech production and language learning

    Get PDF
    The human vocal system is highly plastic, allowing for the flexible expression of language, mood and intentions. However, this plasticity is not stable throughout the life span, and it is well documented that adult learners encounter greater difficulty than children in acquiring the sounds of foreign languages. Researchers have used magnetic resonance imaging (MRI) to interrogate the neural substrates of vocal imitation and learning, and the correlates of individual differences in phonetic “talent”. In parallel, a growing body of work using MR technology to directly image the vocal tract in real time during speech has offered primarily descriptive accounts of phonetic variation within and across languages. In this paper, we review the contribution of neural MRI to our understanding of vocal learning, and give an overview of vocal tract imaging and its potential to inform the field. We propose methods by which our understanding of speech production and learning could be advanced through the combined measurement of articulation and brain activity using MRI – specifically, we describe a novel paradigm, developed in our laboratory, that uses both MRI techniques to for the first time map directly between neural, articulatory and acoustic data in the investigation of vocalisation. This non-invasive, multimodal imaging method could be used to track central and peripheral correlates of spoken language learning, and speech recovery in clinical settings, as well as provide insights into potential sites for targeted neural interventions

    Magnetic resonance imaging of the vocal tract: techniques and applications

    Get PDF
    Magnetic resonance (MR) imaging has been used to analyse and evaluate the vocal tract shape through different techniques and with promising results in several fields. Our purpose is to demonstrate the relevance of MR and image processing for the vocal tract study. The extraction of contours of the air cavities allowed the set-up of a number of 3D reconstruction image stacks by means of the combination of orthogonally oriented sets of slices for each articulatory gesture, as a new approach to solve the expected spatial under sampling of the imaging process. In result these models give improved information for the visualization of morphologic and anatomical aspects and are useful for partial measurements of the vocal tract shape in different situations. Potential use can be found in Medical and therapeutic applications as well as in acoustic articulatory speech modelling
    corecore