Search CORE

1,872 research outputs found

A multilinear tongue model derived from speech related MRI data of the human vocal tract

Author: Alexander Hewer
Allen
Ananthakrishnan
Badin
Badin
Badin
Baer
Beautemps
Bijar
Blandin
Blanz
Bolkart
Botsch
Brunner
Buchaillard
Buchaillard
Burdumy
De Silva
Demolin
Dryden
Elie
Engwall
Engwall
Engwall
Eryildirim
Fang
Foldvik
Fu
Fuchs
Geng
Harandi
Harandi
Harshman
Harshman
Hewer
Hewer
Honda
Hoole
Hoole
Ingmar Steiner
International Phonetic Association
Jackson
Johnson
Kaburagi
Kiers
Kim
Korin Richmond
Kröger
Ladefoged
Ladefoged
Le Maguer
Lee
Li
Lingala
Lingala
Liu
McGurk
Mermelstein
Narayanan
Narayanan
Narayanan
Niebergall
Otsu
Peng
Raeesy
Richmond
Rodrigues
Rosset
Rudy
Scott
Serrurier
Shadle
Stefanie Wuhrer
Steiner
Stone
Stone
Stone
Styner
Tiede
Toutios
Tucker
Valdés Vargas
Valdés Vargas
Weickert
Weirich
Weirich
Woo
Woo
Wu
Yunusova
Zheng
Publication venue: 'Elsevier BV'
Publication date: 21/02/2018
Field of study

We present a multilinear statistical model of the human tongue that captures anatomical and tongue pose related shape variations separately. The model is derived from 3D magnetic resonance imaging data of 11 speakers sustaining speech related vocal tract configurations. The extraction is performed by using a minimally supervised method that uses as basis an image segmentation approach and a template fitting technique. Furthermore, it uses image denoising to deal with possibly corrupt data, palate surface information reconstruction to handle palatal tongue contacts, and a bootstrap strategy to refine the obtained shapes. Our evaluation concludes that limiting the degrees of freedom for the anatomical and speech related variations to 5 and 4, respectively, produces a model that can reliably register unknown data while avoiding overfitting effects. Furthermore, we show that it can be used to generate a plausible tongue animation by tracking sparse motion capture data

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Edinburgh Research Explorer

A stabilized finite element method for the mixed wave equation in an ALE framework with application to diphthong production

Author: Arnela Marc
Codina Ramon
Espinoza Román Héctor Gabriel
Guasch Fortuny Oriol
Publication venue: 'S. Hirzel Verlag'
Publication date: 01/01/2016
Field of study

The archived file is not the final published version of the article. © (2016) S. Hirzel Verlag/European Acoustics Association The definitive publisher-authenticated version is available online at http://www.ingentaconnect.com/contentone/dav/aaua/2016/00000102/00000001/art00012 Readers must contact the publisher for reprint or permission to use the material in any form.Working with the wave equation in mixed rather than irreducible form allows one to directly account for both, the acoustic pressure field and the acoustic particle velocity field. Indeed, this becomes the natural option in many problems, such as those involving waves propagating in moving domains, because the equations can easily be set in an arbitrary Lagrangian-Eulerian (ALE) frame of reference. Yet, when attempting a standard Galerkin finite element solution (FEM) for them, it turns out that an inf-sup compatibility constraint has to be satisfied, which prevents from using equal interpolations for the approximated acoustic pressure and velocity fields. In this work it is proposed to resort to a subgrid scale stabilization strategy to circumvent this condition and thus facilitate code implementation. As a possible application, we address the generation of diphthongs in voice production.Peer ReviewedPostprint (author's final draft

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC

Scipedia

Registration and statistical analysis of the tongue shape during speech production

Author: Hewer Alexander
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/01/2019
Field of study

This thesis analyzes the human tongue shape during speech production. First, a semi-supervised approach is derived for estimating the tongue shape from volumetric magnetic resonance imaging data of the human vocal tract. Results of this extraction are used to derive parametric tongue models. Next, a framework is presented for registering sparse motion capture data of the tongue by means of such a model. This method allows to generate full three-dimensional animations of the tongue. Finally, a multimodal and statistical text-to-speech system is developed that is able to synthesize audio and synchronized tongue motion from text.Diese Dissertation beschäftigt sich mit der Analyse der menschlichen Zungenform während der Sprachproduktion. Zunächst wird ein semi-überwachtes Verfahren vorgestellt, mit dessen Hilfe sich Zungenformen von volumetrischen Magnetresonanztomographie- Aufnahmen des menschlichen Vokaltrakts schätzen lassen. Die Ergebnisse dieses Extraktionsverfahrens werden genutzt, um ein parametrisches Zungenmodell zu konstruieren. Danach wird eine Methode hergeleitet, die ein solches Modell nutzt, um spärliche Bewegungsaufnahmen der Zunge zu registrieren. Dieser Ansatz erlaubt es, dreidimensionale Animationen der Zunge zu erstellen. Zuletzt wird ein multimodales und statistisches Text-to-Speech-System entwickelt, das in der Lage ist, Audio und die dazu synchrone Zungenbewegung zu synthetisieren.German Research Foundatio

Universaar

Acronym

Accelerated partial separable model using dimension-reduced optimization technique for ultra-fast cardiac MRI

Author: Fu Mingzhu
Li Rui
Li Zhongsen
Liu Chuyu
Sun Aiqi
Wang Shuai
Wei Haining
Publication venue
Publication date: 01/04/2023
Field of study

Objective. Imaging dynamic object with high temporal resolution is challenging in magnetic resonance imaging (MRI). Partial separable (PS) model was proposed to improve the imaging quality by reducing the degrees of freedom of the inverse problem. However, PS model still suffers from long acquisition time and even longer reconstruction time. The main objective of this study is to accelerate the PS model, shorten the time required for acquisition and reconstruction, and maintain good image quality simultaneously. Approach. We proposed to fully exploit the dimension reduction property of the PS model, which means implementing the optimization algorithm in subspace. We optimized the data consistency term, and used a Tikhonov regularization term based on the Frobenius norm of temporal difference. The proposed dimension-reduced optimization technique was validated in free-running cardiac MRI. We have performed both retrospective experiments on public dataset and prospective experiments on in-vivo data. The proposed method was compared with four competing algorithms based on PS model, and two non-PS model methods. Main results. The proposed method has robust performance against shortened acquisition time or suboptimal hyper-parameter settings, and achieves superior image quality over all other competing algorithms. The proposed method is 20-fold faster than the widely accepted PS+Sparse method, enabling image reconstruction to be finished in just a few seconds. Significance. Accelerated PS model has the potential to save much time for clinical dynamic MRI examination, and is promising for real-time MRI applications.Comment: 23 pages, 11 figures. Accepted as manuscript on Physics in Medicine & Biolog

arXiv.org e-Print Archive

Magnetic Resonance Imaging of the Paediatric Respiratory Tract

Author: Elders Bernadette
Publication venue: Erasmus University Rotterdam (EUR)
Publication date: 10/05/2022
Field of study

EUR Research Repository

Magnetic Resonance Imaging of the Paediatric Respiratory Tract

Author: Elders Bernadette
Publication venue: Erasmus University Rotterdam (EUR)
Publication date: 10/05/2022
Field of study

EUR Research Repository

Combined brain language connectivity and intraoperative neurophysiologic techniques in awake craniotomy for eloquent-area brain tumor resection

Author: Leote Joao
Publication venue
Publication date: 01/12/2019
Field of study

Speech processing can be disturbed by primary brain tumors (PBT). Improvement of presurgical planning techniques decrease neurological morbidity associated to tumor resection during awake craniotomy. The aims of this work were: 1. To perform Diffusion Kurtosis Imaging based tractography (DKI-tract) in the detection of brain tracts involved in language; 2. To investigate which factors contribute to functional magnetic resonance imaging (fMRI) maps in predicting eloquent language regional reorganization; 3. To determine the technical aspects of accelerometric (ACC) recording of speech during surgery. DKI-tracts were streamlined using a 1.5T magnetic resonance scanner. Number of tracts and fiber pathways were compared between DKI and standard Diffusion Tensor Imaging (DTI) in healthy subjects (HS) and PBT patients. fMRI data were acquired using task-specific and resting-state paradigms during language and motor tasks. After testing intraoperative fMRI’s influence on direct cortical stimulation (DCS) number of stimuli, graph-theory measures were extracted and analyzed. Regarding speech recording, ACC signals were recorded after evaluating neck positions and filter bandwidths. To test this method, language disturbances were recorded in patients with dysphonia and after applying DCS in the inferior frontal gyrus. In contrast, HS reaction time was recorded during speech execution. DKI-tract showed increased number of arcuate fascicle tracts in PBT patients. Lower spurious tracts were identified with DKI-tract. Intraoperative fMRI and DCS showed similar stimuli in comparison with DCS alone. Increased local centrality accompanied language ipsilateral and contralateral reorganization. ACC recordings showed minor artifact contamination when placed at the suprasternal notch using a 20-200 Hz filter bandwidth. Patients with dysphonia showed decreased amplitude and frequency in comparison with HS. ACC detected an additional 11% disturbances after DCS, and a shortening of latency within the presence of a loud stimuli during speech execution. This work improved current knowledge on presurgical planning techniques based on brain structural and functional neuroimaging connectivity, and speech recordingA função linguística do ser humano pode ser afetada pela presença de tumores cerebrais (TC) A melhoria de técnicas de planeamento pré-cirurgico diminui a morbilidade neurológica iatrogénica associada ao seu tratamento cirúrgico. O objetivo deste trabalho é: 1. Testar a fiabilidade da tractografia estimada por difusor de kurtose (tract-DKI), dos feixes cerebrais envolvidos na linguagem 2. Identificar os fatores que contribuem para o mapeamento linguagem por ressonância magnética funcional (RMf) na predição da neuroplasticidade. 3. Identificar aspetos técnicos do registo da linguagem por accelerometria (ACC). A DKI-tract foi estimada após realização de RM cerebral com 1.5T. O número e percurso das fibras foi avaliado. A RMf foi adquirida durante realização de tarefas linguísticas, motoras, e em repouso. Foi testada influência dos mapas de ativação calculados por RMf, no número de estímulos realizados durante a estimulação direta cortical (EDC) intraoperatória. Medidas de conectividade foram extraídas de regiões cerebrais. A posição e filtragem de sinal ACC foram estudadas após vocalização de palavras. O sinal ACC obtido em voluntários foi comparado com doentes disfónicos, após estimulação do giro inferior frontal, e após a adição de um estímulo sonoro perturbador durante vocalização. A tract-DKI estimou um elevado número de fascículos do feixe arcuato com menos falsos negativos. Os mapas linguísticos de RMf intraoperatória, não influenciou a EDC. Medidas de centralidade aumentaram após neuroplasticidade ipsilateral e contralateral. A posição supraesternal e a filtragem de sinal ACC entre 20-200Hz demonstrou menor ruido de contaminação. Este método identificou diminuição de frequência e amplitude em doentes com disfonia, 11% de erros linguísticos adicionais após estimulação e diminuição do tempo de latência quando presente o sinal sonoro perturbador. Este trabalho promoveu a utilização de novas técnicas no planeamento pré-cirúrgico do doente com tumor cerebral e alterações da linguagem através do estudo de conectividade estrutural, funcional e registo da linguagem

Universidade de Lisboa: Repositório.UL

The Digital Fish Library: Using MRI to Digitize, Database, and Document the Morphological Diversity of Fish

Author: A Petiet
A Prieto-Marquez
A Van der Linden
A Ziegler
A Ziegler
A Ziegler
A Ziegler
A Ziegler
AE Petiet
AG Hart
Allyson H. Doan
AM Heemskerk
Andrew Iwaniuk
AR Kherlopian
AV Suarez
B Chanet
B Chanet
B Dogdas
B Driehuys
BD Metscher
BL Rogers
C Bock
C Clabaut
CA Sepulveda
CN Perry
D Houle
DI Thickman
E Veliyulin
F Liste
FL Bookstein
G De Groof
G Giribet
G Waller
GJ Strijkers
GNH Waller
Gregory T. Baxter
H Lauridsen
H Pena
H. J. Walker
HC Peng
HJ Weinmann
HK Mok
HK Mok
HP Schultze
I Plyusnin
J Dinley
J Herberholz
J Sharpe
JD Newman
JE Winston
JFP Ullmann
JFP Ullmann
JFP Ullmann
JG Forbes
JM Tyszka
JM Tyszka
JO Cleary
JR Corfield
JR Mathiassen
JS Nelson
JS Sparks
K Seltman
Kara E. Yopak
KC Blits
KE Yopak
KE Yopak
KF Liem
KP Nott
KP Nott
Kristen M. Gledhill
KW Fishbein
L Fishelson
L Shen
Lawrence R. Frank
M Dhenain
MA Bernstein
MA Hayat
Matthew W. Peterson
ME Dickinson
MW Westneat
Ning Kang
O Betz
P Chakrabarty
P Johnston
PA Hastings
PA Yushkevich
PD Thacker
PE Thelwall
Philip A. Hastings
PJ Basser
PT Callaghan
R Cloutier
R Winterbottom
Rachel M. Berquist
RE Jacobs
RJ Bryson-Richardson
RL Pyle
RM Runcie
RW Prager
RW Scotland
SJ Blackband
SW Ruffins
SX Vasquez
T Rowe
T Rowe
T Walter
TM Kalat
TM Shepherd
U Richtarski
V Cnudde
WJAJ Smeets
WR Taylor
Y Jiang
Publication venue: Public Library of Science
Publication date: 06/04/2012
Field of study

Museum fish collections possess a wealth of anatomical and morphological data that are essential for documenting and understanding biodiversity. Obtaining access to specimens for research, however, is not always practical and frequently conflicts with the need to maintain the physical integrity of specimens and the collection as a whole. Non-invasive three-dimensional (3D) digital imaging therefore serves a critical role in facilitating the digitization of these specimens for anatomical and morphological analysis as well as facilitating an efficient method for online storage and sharing of this imaging data. Here we describe the development of the Digital Fish Library (DFL, http://www.digitalfishlibrary.org), an online digital archive of high-resolution, high-contrast, magnetic resonance imaging (MRI) scans of the soft tissue anatomy of an array of fishes preserved in the Marine Vertebrate Collection of Scripps Institution of Oceanography. We have imaged and uploaded MRI data for over 300 marine and freshwater species, developed a data archival and retrieval system with a web-based image analysis and visualization tool, and integrated these into the public DFL website to disseminate data and associated metadata freely over the web. We show that MRI is a rapid and powerful method for accurately depicting the in-situ soft-tissue anatomy of preserved fishes in sufficient detail for large-scale comparative digital morphology. However these 3D volumetric data require a sophisticated computational and archival infrastructure in order to be broadly accessible to researchers and educators

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Sound pressure distribution in a long, narrow hallway: Measurements versus results from a computer model with scattering from surface roughness and diffraction

Author: Christensen Claus Lynge
Rathsam Jonathan
Rindel Jens Holger
Wang Lily M.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2005
Field of study

Online Research Database In Technology