Search CORE

11,741 research outputs found

Effects of Palatal Expansion on Speech Production

Author: Meinhardt Jason Milton
Publication venue: e-Publications@Marquette
Publication date: 01/07/2017
Field of study

Introduction: Rapid palatal expanders (RPEs) are a commonly used orthodontic adjunct for the treatment of posterior crossbites. RPEs are cemented to bilateral posterior teeth across the palate and thus may interfere with proper tongue movement and linguopalatal contact. The purpose of this study was to identify what specific role RPEs have on speech sound production for the child and early adolescent orthodontic patient. Materials and Methods: RPEs were treatment planned for patients seeking orthodontics at Marquette University. Speech recordings were made using a phonetically balanced reading passage (“The Caterpillar”) at 3 time points: 1) before RPE placement; 2) immediately after cementation; and 3) 10-14 days post appliance delivery. Measures of vocal tract resonance (formant center frequencies) were obtained for vowels and measures of noise distribution (spectral moments) were obtained for consonants. Two-way repeated measures (ANOVA) was used along with post-hoc tests for statistical analysis. Results: For the vowel /i/, the first formant increased and the second formant decreased indicating a more inferior and posterior tongue position. For /e/, only the second formant decreased resulting in a more posterior tongue position. The formants did not return to baseline within the two-week study period. For the fricatives /s/, //, /t/, and /k/, a significant shift from high to low frequencies indicated distortion upon appliance placement. Of these, only /t/ fully returned to baseline during the study period. Conclusion: Numerous phonemes were distorted upon RPE placement which indicated altered speech sound production. For most phonemes, it takes longer than two weeks for speech to return to baseline, if at all. Clinically, the results of this study will help with pre-treatment and interdisciplinary counseling for orthodontic patients receiving palatal expanders

epublications@Marquette

Color and texture associations in voice-induced synesthesia

Author: Moos A.
Simmons D.
Simner J.
Smith R.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2013
Field of study

Voice-induced synesthesia, a form of synesthesia in which synesthetic perceptions are induced by the sounds of people's voices, appears to be relatively rare and has not been systematically studied. In this study we investigated the synesthetic color and visual texture perceptions experienced in response to different types of “voice quality” (e.g., nasal, whisper, falsetto). Experiences of three different groups—self-reported voice synesthetes, phoneticians, and controls—were compared using both qualitative and quantitative analysis in a study conducted online. Whilst, in the qualitative analysis, synesthetes used more color and texture terms to describe voices than either phoneticians or controls, only weak differences, and many similarities, between groups were found in the quantitative analysis. Notable consistent results between groups were the matching of higher speech fundamental frequencies with lighter and redder colors, the matching of “whispery” voices with smoke-like textures, and the matching of “harsh” and “creaky” voices with textures resembling dry cracked soil. These data are discussed in the light of current thinking about definitions and categorizations of synesthesia, especially in cases where individuals apparently have a range of different synesthetic inducers

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

PubMed Central

Edinburgh Research Explorer

Enlighten

Testing the assumptions of linear prediction analysis in normal vowels

Author: Fant G.
Fisher W.
I. M. Moroz
Kantz H.
Kleijn W.
Kroon P.
Kubin G.
M. A. Little
McSharry P.
P. E. McSharry
Proakis J.
Quatieri T.
S. J. Roberts
Publication venue
Publication date: 01/01/2006
Field of study

This paper develops an improved surrogate data test to show experimental evidence, for all the simple vowels of US English, for both male and female speakers, that Gaussian linear prediction analysis, a ubiquitous technique in current speech technologies, cannot be used to extract all the dynamical structure of real speech time series. The test provides robust evidence undermining the validity of these linear techniques, supporting the assumptions of either dynamical nonlinearity and/or non-Gaussianity common to more recent, complex, efforts at dynamical modelling speech time series. However, an additional finding is that the classical assumptions cannot be ruled out entirely, and plausible evidence is given to explain the success of the linear Gaussian theory as a weak approximation to the true, nonlinear/non-Gaussian dynamics. This supports the use of appropriate hybrid linear/nonlinear/non-Gaussian modelling. With a calibrated calculation of statistic and particular choice of experimental protocol, some of the known systematic problems of the method of surrogate data testing are circumvented to obtain results to support the conclusions to a high level of significance

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Objective dysphonia quantification in vocal fold paralysis: comparing nonlinear with classical measures

Author: Declan A. E. Costello
Max A. Little
Meredydd L. Harries
Publication venue
Publication date: 20/04/2009
Field of study

Clinical acoustic voice recording analysis is usually performed using classical perturbation measures including jitter, shimmer and noise-to-harmonic ratios. However, restrictive mathematical limitations of these measures prevent analysis for severely dysphonic voices. Previous studies of alternative nonlinear random measures addressed wide varieties of vocal pathologies. Here, we analyze a single vocal pathology cohort, testing the performance of these alternative measures alongside classical measures.

We present voice analysis pre- and post-operatively in unilateral vocal fold paralysis (UVFP) patients and healthy controls, patients undergoing standard medialisation thyroplasty surgery, using jitter, shimmer and noise-to-harmonic ratio (NHR), and nonlinear recurrence period density entropy (RPDE), detrended fluctuation analysis (DFA) and correlation dimension. Systematizing the preparative editing of the recordings, we found that the novel measures were more stable and hence reliable, than the classical measures, on healthy controls.

RPDE and jitter are sensitive to improvements pre- to post-operation. Shimmer, NHR and DFA showed no significant change (p > 0.05). All measures detect statistically significant and clinically important differences between controls and patients, both treated and untreated (p < 0.001, AUC > 0.7). Pre- to post-operation, GRBAS ratings show statistically significant and clinically important improvement in overall dysphonia grade (G) (AUC = 0.946, p < 0.001).

Re-calculating AUCs from other study data, we compare these results in terms of clinical importance. We conclude that, when preparative editing is systematized, nonlinear random measures may be useful UVFP treatment effectiveness monitoring tools, and there may be applications for other forms of dysphonia.&#xa

Nature Precedings

Recommended from our members

The evolution of rhythmic cognition: New perspectives and technologies in comparative research

Author: Asano R.
Fitch W.
Gingras B.
Matellan V.
Ravignani A.
Sonnweber R.
Publication venue
Publication date: 01/01/2013
Field of study

Music is a pervasive phenomenon in human culture, and musical rhythm is virtually present in all musical traditions. Research on the evolution and cognitive underpinnings of rhythm can benefit from a number of approaches. We outline key concepts and definitions, allowing fine-grained analysis of rhythmic cognition in experimental studies. We advocate comparative animal research as a useful approach to answer questions about human music cognition and review experimental evidence from different species. Finally, we suggest future directions for research on the cognitive basis of rhythm. Apart from research in semi-natural setups, possibly allowed by “drum set for chimpanzees” prototypes presented here for the first time, mathematical modeling and systematic use of circular statistics may allow promising advances

eScholarship - University of California

MPG.PuRe

Articulating: the neural mechanisms of speech production

Author: Guenther Frank H.
Kearney Elaine
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2019
Field of study

Speech production is a highly complex sensorimotor task involving tightly coordinated processing across large expanses of the cerebral cortex. Historically, the study of the neural underpinnings of speech suffered from the lack of an animal model. The development of non-invasive structural and functional neuroimaging techniques in the late 20th century has dramatically improved our understanding of the speech network. Techniques for measuring regional cerebral blood flow have illuminated the neural regions involved in various aspects of speech, including feedforward and feedback control mechanisms. In parallel, we have designed, experimentally tested, and refined a neural network model detailing the neural computations performed by specific neuroanatomical regions during speech. Computer simulations of the model account for a wide range of experimental findings, including data on articulatory kinematics and brain activity during normal and perturbed speech. Furthermore, the model is being used to investigate a wide range of communication disorders.R01 DC002852 - NIDCD NIH HHS; R01 DC007683 - NIDCD NIH HHS; R01 DC016270 - NIDCD NIH HHSAccepted manuscrip

Boston University Institutional Repository (OpenBU)

Queensland University of Technology ePrints Archive

Constricted channel flow with different cross-section shapes

Author: Gao H.
Luo X.Y.
Van Hirtum A.
Wu B.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Pressure driven steady flow through a uniform circular channel containing a constricted portion is a common problem considering physiological flows such as underlying human speech sound production. The influence of the constriction’s cross-section shape (circle, ellipse, circular sector) on the flow within and downstream from the constriction is experimentally quantified. An analytical boundary layer flow model is proposed which takes into account the hydraulic diameter of the cross-section shape. Comparison of the model outcome with experimental and three-dimensional numerically simulated flow data shows that the pressure distribution within the constriction can be modeled accurately so that the model is of interest for analytical models of fluid–structure interaction without the assumption of two-dimensional flow

Hal - Université Grenoble Alpes

Enlighten

Magnetic resonance imaging of the brain and vocal tract:Applications to the study of speech production and language learning

Author: Badin
Baer
Berken
Bresch
Bresch
Bresch
Bressman
Bressmann
Buchsbaum
Carolyn McGettigan
Cartei
Cheng
Dagenais
Daniel Carey
Delvaux
Devereux
Drissi
Dronkers
Evans
Fitch
Flege
Flege
Garnier
Gibbon
Golestani
Goozée
Goozée
Guenther
Guenther
Hagedorn
Hashizume
Hickok
Hickok
Hickok
Hu
Hughes
Jacquemot
Jacquemot
Kappes
Katz
Kriegeskorte
Kriegeskorte
Krishnan
McGettigan
McGettigan
McLeod
Moser
Narayanan
Niebergall
Oh
Pardo
Pardo
Pardo
Pardo
Peschke
Peschke
Pisanski
Piske
Proctor
Rauschecker
Reiterer
Reiterer
Sagar
Schoenle
Scott
Scott
Segawa
Silva
Silva
Simmonds
Simmonds
Simmonds
Simmonds
Tourville
Vasquez Miloro
Vorperian
Weirich
Weiss-Croft
Publication venue: 'Elsevier BV'
Publication date: 01/04/2017
Field of study

The human vocal system is highly plastic, allowing for the flexible expression of language, mood and intentions. However, this plasticity is not stable throughout the life span, and it is well documented that adult learners encounter greater difficulty than children in acquiring the sounds of foreign languages. Researchers have used magnetic resonance imaging (MRI) to interrogate the neural substrates of vocal imitation and learning, and the correlates of individual differences in phonetic “talent”. In parallel, a growing body of work using MR technology to directly image the vocal tract in real time during speech has offered primarily descriptive accounts of phonetic variation within and across languages. In this paper, we review the contribution of neural MRI to our understanding of vocal learning, and give an overview of vocal tract imaging and its potential to inform the field. We propose methods by which our understanding of speech production and learning could be advanced through the combined measurement of articulation and brain activity using MRI – specifically, we describe a novel paradigm, developed in our laboratory, that uses both MRI techniques to for the first time map directly between neural, articulatory and acoustic data in the investigation of vocalisation. This non-invasive, multimodal imaging method could be used to track central and peripheral correlates of spoken language learning, and speech recovery in clinical settings, as well as provide insights into potential sites for targeted neural interventions

Crossref

Royal Holloway - Pure

UCL Discovery

Processing of Graded Signaling Systems

Author: Wadewitz Philip
Publication venue
Publication date: 04/12/2015
Field of study

Georg-August-University Göttingen