1,872 research outputs found

    A multilinear tongue model derived from speech related MRI data of the human vocal tract

    Get PDF
    We present a multilinear statistical model of the human tongue that captures anatomical and tongue pose related shape variations separately. The model is derived from 3D magnetic resonance imaging data of 11 speakers sustaining speech related vocal tract configurations. The extraction is performed by using a minimally supervised method that uses as basis an image segmentation approach and a template fitting technique. Furthermore, it uses image denoising to deal with possibly corrupt data, palate surface information reconstruction to handle palatal tongue contacts, and a bootstrap strategy to refine the obtained shapes. Our evaluation concludes that limiting the degrees of freedom for the anatomical and speech related variations to 5 and 4, respectively, produces a model that can reliably register unknown data while avoiding overfitting effects. Furthermore, we show that it can be used to generate a plausible tongue animation by tracking sparse motion capture data

    A stabilized finite element method for the mixed wave equation in an ALE framework with application to diphthong production

    Get PDF
    The archived file is not the final published version of the article. © (2016) S. Hirzel Verlag/European Acoustics Association The definitive publisher-authenticated version is available online at http://www.ingentaconnect.com/contentone/dav/aaua/2016/00000102/00000001/art00012 Readers must contact the publisher for reprint or permission to use the material in any form.Working with the wave equation in mixed rather than irreducible form allows one to directly account for both, the acoustic pressure field and the acoustic particle velocity field. Indeed, this becomes the natural option in many problems, such as those involving waves propagating in moving domains, because the equations can easily be set in an arbitrary Lagrangian-Eulerian (ALE) frame of reference. Yet, when attempting a standard Galerkin finite element solution (FEM) for them, it turns out that an inf-sup compatibility constraint has to be satisfied, which prevents from using equal interpolations for the approximated acoustic pressure and velocity fields. In this work it is proposed to resort to a subgrid scale stabilization strategy to circumvent this condition and thus facilitate code implementation. As a possible application, we address the generation of diphthongs in voice production.Peer ReviewedPostprint (author's final draft

    Registration and statistical analysis of the tongue shape during speech production

    Get PDF
    This thesis analyzes the human tongue shape during speech production. First, a semi-supervised approach is derived for estimating the tongue shape from volumetric magnetic resonance imaging data of the human vocal tract. Results of this extraction are used to derive parametric tongue models. Next, a framework is presented for registering sparse motion capture data of the tongue by means of such a model. This method allows to generate full three-dimensional animations of the tongue. Finally, a multimodal and statistical text-to-speech system is developed that is able to synthesize audio and synchronized tongue motion from text.Diese Dissertation beschäftigt sich mit der Analyse der menschlichen Zungenform während der Sprachproduktion. Zunächst wird ein semi-überwachtes Verfahren vorgestellt, mit dessen Hilfe sich Zungenformen von volumetrischen Magnetresonanztomographie- Aufnahmen des menschlichen Vokaltrakts schätzen lassen. Die Ergebnisse dieses Extraktionsverfahrens werden genutzt, um ein parametrisches Zungenmodell zu konstruieren. Danach wird eine Methode hergeleitet, die ein solches Modell nutzt, um spärliche Bewegungsaufnahmen der Zunge zu registrieren. Dieser Ansatz erlaubt es, dreidimensionale Animationen der Zunge zu erstellen. Zuletzt wird ein multimodales und statistisches Text-to-Speech-System entwickelt, das in der Lage ist, Audio und die dazu synchrone Zungenbewegung zu synthetisieren.German Research Foundatio

    Accelerated partial separable model using dimension-reduced optimization technique for ultra-fast cardiac MRI

    Full text link
    Objective. Imaging dynamic object with high temporal resolution is challenging in magnetic resonance imaging (MRI). Partial separable (PS) model was proposed to improve the imaging quality by reducing the degrees of freedom of the inverse problem. However, PS model still suffers from long acquisition time and even longer reconstruction time. The main objective of this study is to accelerate the PS model, shorten the time required for acquisition and reconstruction, and maintain good image quality simultaneously. Approach. We proposed to fully exploit the dimension reduction property of the PS model, which means implementing the optimization algorithm in subspace. We optimized the data consistency term, and used a Tikhonov regularization term based on the Frobenius norm of temporal difference. The proposed dimension-reduced optimization technique was validated in free-running cardiac MRI. We have performed both retrospective experiments on public dataset and prospective experiments on in-vivo data. The proposed method was compared with four competing algorithms based on PS model, and two non-PS model methods. Main results. The proposed method has robust performance against shortened acquisition time or suboptimal hyper-parameter settings, and achieves superior image quality over all other competing algorithms. The proposed method is 20-fold faster than the widely accepted PS+Sparse method, enabling image reconstruction to be finished in just a few seconds. Significance. Accelerated PS model has the potential to save much time for clinical dynamic MRI examination, and is promising for real-time MRI applications.Comment: 23 pages, 11 figures. Accepted as manuscript on Physics in Medicine & Biolog

    Magnetic Resonance Imaging of the Paediatric Respiratory Tract

    Get PDF

    Magnetic Resonance Imaging of the Paediatric Respiratory Tract

    Get PDF

    Combined brain language connectivity and intraoperative neurophysiologic techniques in awake craniotomy for eloquent-area brain tumor resection

    Get PDF
    Speech processing can be disturbed by primary brain tumors (PBT). Improvement of presurgical planning techniques decrease neurological morbidity associated to tumor resection during awake craniotomy. The aims of this work were: 1. To perform Diffusion Kurtosis Imaging based tractography (DKI-tract) in the detection of brain tracts involved in language; 2. To investigate which factors contribute to functional magnetic resonance imaging (fMRI) maps in predicting eloquent language regional reorganization; 3. To determine the technical aspects of accelerometric (ACC) recording of speech during surgery. DKI-tracts were streamlined using a 1.5T magnetic resonance scanner. Number of tracts and fiber pathways were compared between DKI and standard Diffusion Tensor Imaging (DTI) in healthy subjects (HS) and PBT patients. fMRI data were acquired using task-specific and resting-state paradigms during language and motor tasks. After testing intraoperative fMRI’s influence on direct cortical stimulation (DCS) number of stimuli, graph-theory measures were extracted and analyzed. Regarding speech recording, ACC signals were recorded after evaluating neck positions and filter bandwidths. To test this method, language disturbances were recorded in patients with dysphonia and after applying DCS in the inferior frontal gyrus. In contrast, HS reaction time was recorded during speech execution. DKI-tract showed increased number of arcuate fascicle tracts in PBT patients. Lower spurious tracts were identified with DKI-tract. Intraoperative fMRI and DCS showed similar stimuli in comparison with DCS alone. Increased local centrality accompanied language ipsilateral and contralateral reorganization. ACC recordings showed minor artifact contamination when placed at the suprasternal notch using a 20-200 Hz filter bandwidth. Patients with dysphonia showed decreased amplitude and frequency in comparison with HS. ACC detected an additional 11% disturbances after DCS, and a shortening of latency within the presence of a loud stimuli during speech execution. This work improved current knowledge on presurgical planning techniques based on brain structural and functional neuroimaging connectivity, and speech recordingA função linguística do ser humano pode ser afetada pela presença de tumores cerebrais (TC) A melhoria de técnicas de planeamento pré-cirurgico diminui a morbilidade neurológica iatrogénica associada ao seu tratamento cirúrgico. O objetivo deste trabalho é: 1. Testar a fiabilidade da tractografia estimada por difusor de kurtose (tract-DKI), dos feixes cerebrais envolvidos na linguagem 2. Identificar os fatores que contribuem para o mapeamento linguagem por ressonância magnética funcional (RMf) na predição da neuroplasticidade. 3. Identificar aspetos técnicos do registo da linguagem por accelerometria (ACC). A DKI-tract foi estimada após realização de RM cerebral com 1.5T. O número e percurso das fibras foi avaliado. A RMf foi adquirida durante realização de tarefas linguísticas, motoras, e em repouso. Foi testada influência dos mapas de ativação calculados por RMf, no número de estímulos realizados durante a estimulação direta cortical (EDC) intraoperatória. Medidas de conectividade foram extraídas de regiões cerebrais. A posição e filtragem de sinal ACC foram estudadas após vocalização de palavras. O sinal ACC obtido em voluntários foi comparado com doentes disfónicos, após estimulação do giro inferior frontal, e após a adição de um estímulo sonoro perturbador durante vocalização. A tract-DKI estimou um elevado número de fascículos do feixe arcuato com menos falsos negativos. Os mapas linguísticos de RMf intraoperatória, não influenciou a EDC. Medidas de centralidade aumentaram após neuroplasticidade ipsilateral e contralateral. A posição supraesternal e a filtragem de sinal ACC entre 20-200Hz demonstrou menor ruido de contaminação. Este método identificou diminuição de frequência e amplitude em doentes com disfonia, 11% de erros linguísticos adicionais após estimulação e diminuição do tempo de latência quando presente o sinal sonoro perturbador. Este trabalho promoveu a utilização de novas técnicas no planeamento pré-cirúrgico do doente com tumor cerebral e alterações da linguagem através do estudo de conectividade estrutural, funcional e registo da linguagem

    The Digital Fish Library: Using MRI to Digitize, Database, and Document the Morphological Diversity of Fish

    Get PDF
    Museum fish collections possess a wealth of anatomical and morphological data that are essential for documenting and understanding biodiversity. Obtaining access to specimens for research, however, is not always practical and frequently conflicts with the need to maintain the physical integrity of specimens and the collection as a whole. Non-invasive three-dimensional (3D) digital imaging therefore serves a critical role in facilitating the digitization of these specimens for anatomical and morphological analysis as well as facilitating an efficient method for online storage and sharing of this imaging data. Here we describe the development of the Digital Fish Library (DFL, http://www.digitalfishlibrary.org), an online digital archive of high-resolution, high-contrast, magnetic resonance imaging (MRI) scans of the soft tissue anatomy of an array of fishes preserved in the Marine Vertebrate Collection of Scripps Institution of Oceanography. We have imaged and uploaded MRI data for over 300 marine and freshwater species, developed a data archival and retrieval system with a web-based image analysis and visualization tool, and integrated these into the public DFL website to disseminate data and associated metadata freely over the web. We show that MRI is a rapid and powerful method for accurately depicting the in-situ soft-tissue anatomy of preserved fishes in sufficient detail for large-scale comparative digital morphology. However these 3D volumetric data require a sophisticated computational and archival infrastructure in order to be broadly accessible to researchers and educators
    corecore