Search CORE

9,105 research outputs found

Enkinaesthetic polyphony: the underpinning for first-order languaging

Author: Stuart Susan A.J.
Thibault Paul J.
Publication venue: 'John Benjamins Publishing Company'
Publication date: 01/01/2015
Field of study

We contest two claims: (1) that language, understood as the processing of abstract symbolic forms, is an instrument of cognition and rational thought, and (2) that conventional notions of turn-taking, exchange structure, and move analysis, are satisfactory as a basis for theorizing communication between living, feeling agents. We offer an enkinaesthetic theory describing the reciprocal affective neuro-muscular dynamical flows and tensions of co- agential dialogical sense-making relations. This “enkinaesthetic dialogue” is characterised by a preconceptual experientially recursive temporal dynamics forming the deep extended melodies of relationships in time. An understanding of how those relationships work, when we understand and are ourselves understood, when communication falters and conflict arises, will depend on a grasp of our enkinaesthetic intersubjectivity

Enlighten

A multispeaker dataset of raw and reconstructed speech production real-time MRI video and 3D volumetric images

Author: Bliesener Yannick
Byrd Dani
Chen Weiyi
Godinez Bianca
Goldstein Louis
Harper Sarah
Lee Yoonjeong
Lim Yongwan
Lingala Sajan Goud
Montesserin Mairym Lloréns
Narayanan Shrikanth S.
Nayak Krishna S.
Oh Miran
Smith Caitlin
Sorensen Tanner
Tian Ye
Toutios Asterios
Töger Johannes
Vaz Colin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/02/2021
Field of study

Real-time magnetic resonance imaging (RT-MRI) of human speech production is enabling significant advances in speech science, linguistics, bio-inspired speech technology development, and clinical applications. Easy access to RT-MRI is however limited, and comprehensive datasets with broad access are needed to catalyze research across numerous domains. The imaging of the rapidly moving articulators and dynamic airway shaping during speech demands high spatio-temporal resolution and robust reconstruction methods. Further, while reconstructed images have been published, to-date there is no open dataset providing raw multi-coil RT-MRI data from an optimized speech production experimental setup. Such datasets could enable new and improved methods for dynamic image reconstruction, artifact correction, feature extraction, and direct extraction of linguistically-relevant biomarkers. The present dataset offers a unique corpus of 2D sagittal-view RT-MRI videos along with synchronized audio for 75 subjects performing linguistically motivated speech tasks, alongside the corresponding first-ever public domain raw RT-MRI data. The dataset also includes 3D volumetric vocal tract MRI during sustained speech sounds and high-resolution static anatomical T2-weighted upper airway MRI for each subject.Comment: 27 pages, 6 figures, 5 tables, submitted to Nature Scientific Dat

arXiv.org e-Print Archive

Directory of Open Access Journals

Respiratory, postural and spatio-kinetic motor stabilization, internal models, top-down timed motor coordination and expanded cerebello-cerebral circuitry: a review

Author: John R. Skoyles
Publication venue
Publication date: 18/07/2008
Field of study

Human dexterity, bipedality, and song/speech vocalization in Homo are reviewed within a motor evolution perspective in regard to 

(i) brain expansion in cerebello-cerebral circuitry, 
(ii) enhanced predictive internal modeling of body kinematics, body kinetics and action organization, 
(iii) motor mastery due to prolonged practice, 
(iv) task-determined top-down, and accurately timed feedforward motor adjustment of multiple-body/artifact elements, and 
(v) reduction in automatic preflex/spinal reflex mechanisms that would otherwise restrict such top-down processes. 

Dual-task interference and developmental neuroimaging research argues that such internal modeling based motor capabilities are concomitant with the evolution of 
(vi) enhanced attentional, executive function and other high-level cognitive processes, and that 
(vii) these provide dexterity, bipedality and vocalization with effector nonspecific neural resources. 

The possibility is also raised that such neural resources could 
(viii) underlie human internal model based nonmotor cognitions. &#xa

Crossref

Nature Precedings

Magnetic resonance imaging of the brain and vocal tract:Applications to the study of speech production and language learning

Author: Badin
Baer
Berken
Bresch
Bresch
Bresch
Bressman
Bressmann
Buchsbaum
Carolyn McGettigan
Cartei
Cheng
Dagenais
Daniel Carey
Delvaux
Devereux
Drissi
Dronkers
Evans
Fitch
Flege
Flege
Garnier
Gibbon
Golestani
Goozée
Goozée
Guenther
Guenther
Hagedorn
Hashizume
Hickok
Hickok
Hickok
Hu
Hughes
Jacquemot
Jacquemot
Kappes
Katz
Kriegeskorte
Kriegeskorte
Krishnan
McGettigan
McGettigan
McLeod
Moser
Narayanan
Niebergall
Oh
Pardo
Pardo
Pardo
Pardo
Peschke
Peschke
Pisanski
Piske
Proctor
Rauschecker
Reiterer
Reiterer
Sagar
Schoenle
Scott
Scott
Segawa
Silva
Silva
Simmonds
Simmonds
Simmonds
Simmonds
Tourville
Vasquez Miloro
Vorperian
Weirich
Weiss-Croft
Publication venue: 'Elsevier BV'
Publication date: 01/04/2017
Field of study

The human vocal system is highly plastic, allowing for the flexible expression of language, mood and intentions. However, this plasticity is not stable throughout the life span, and it is well documented that adult learners encounter greater difficulty than children in acquiring the sounds of foreign languages. Researchers have used magnetic resonance imaging (MRI) to interrogate the neural substrates of vocal imitation and learning, and the correlates of individual differences in phonetic “talent”. In parallel, a growing body of work using MR technology to directly image the vocal tract in real time during speech has offered primarily descriptive accounts of phonetic variation within and across languages. In this paper, we review the contribution of neural MRI to our understanding of vocal learning, and give an overview of vocal tract imaging and its potential to inform the field. We propose methods by which our understanding of speech production and learning could be advanced through the combined measurement of articulation and brain activity using MRI – specifically, we describe a novel paradigm, developed in our laboratory, that uses both MRI techniques to for the first time map directly between neural, articulatory and acoustic data in the investigation of vocalisation. This non-invasive, multimodal imaging method could be used to track central and peripheral correlates of spoken language learning, and speech recovery in clinical settings, as well as provide insights into potential sites for targeted neural interventions

Crossref

Royal Holloway - Pure

UCL Discovery

Acoustic Cues Of Physical Formidability In Cage Fighters

Author: Goetz Stefan Mattias-Maria
Publication venue: DigitalCommons@WayneState
Publication date: 01/01/2015
Field of study

Across the animal kingdom, the sex that experiences the most reproductive variance tends to evolve sexually dimorphic traits—both behavioral and morphological—which aid in reproduction. Human evolution has been marked by greater male intrasexual selection and as a result, men display a variety of secondary sexual characteristics, putatively serving to enhance biological fitness. Among these, fundamental frequency, closely related to perception of pitch, among men is half that of women. Likewise, monotonicity, that is, variance in pitch across an utterance, is higher in men (i.e., women show greater variance in pitch) while formant dispersion, which gives the voice its timbre, is lower. The honesty of these vocal parameters as signaling mechanisms used in context of intrasexual competition has been investigated by a host of researchers; however no research to date has directly assessed the degree to which these parameters predict actual physical formidability, a key step in establishing the honesty of a signal. Here, I address this gap by testing whether these parameters are associated with fighting ability in a large sample of mixed martial arts fighters. Pitch, monotonicity, and timbre were extracted from interviews taken from 292 UFC® fighters and compared with the fighters’ records. Pitch and monotonicity were associated with formidability such that a more masculine profile was associated with higher formidability; timbre however was not associated with formidability. Taken together, these results indicate that pitch and monotonicity may be honest signals of physical formidability

Digital Commons@Wayne State University

The Impact of Emotion Focused Features on SVM and MLR Models for Depression Detection

Author: Mulligan Alexandria
Publication venue: Technological University Dublin
Publication date: 01/01/2022
Field of study

Major depressive disorder (MDD) is a common mental health diagnosis with estimates upwards of 25% of the United States population remain undiagnosed. Psychomotor symptoms of MDD impacts speed of control of the vocal tract, glottal source features and the rhythm of speech. Speech enables people to perceive the emotion of the speaker and MDD decreases the mood magnitudes expressed by an individual. This study asks the questions: “if high level features deigned to combine acoustic features related to emotion detection are added to glottal source features and mean response time in support vector machines and multivariate logistic regression models, would that improve the recall of the MDD class?” To answer this question, a literature review goes through common features in MDD detection, especially features related to emotion recognition. Using feature transformation, emotion recognition composite features are produced and added to glottal source features for model evaluation

Arrow@TUDublin

Vocal Tract-related Acoustic Cues Across Human and Non-Human Vocal Signals

Author: Gamba Marco
Publication venue
Publication date: 01/01/2014
Field of study

Institutional Research Information System University of Turin

Impaired generalization of speaker identity in the perception of familiar and unfamiliar voices

Author: Lavan Nadine
McGettigan Carolyn
Scott Sophie K.
Publication venue: 'American Psychological Association (APA)'
Publication date: 15/09/2016
Field of study

In 2 behavioral experiments, we explored how the extraction of identity-related information from familiar and unfamiliar voices is affected by naturally occurring vocal flexibility and variability, introduced by different types of vocalizations and levels of volitional control during production. In a first experiment, participants performed a speaker discrimination task on vowels, volitional (acted) laughter, and spontaneous (authentic) laughter from 5 unfamiliar speakers. We found that performance was significantly impaired for spontaneous laughter, a vocalization produced under reduced volitional control. We additionally found that the detection of identity-related information fails to generalize across different types of nonverbal vocalizations (e.g., laughter vs. vowels) and across mismatches in volitional control within vocalization pairs (e.g., volitional laughter vs. spontaneous laughter), with performance levels indicating an inability to discriminate between speakers. In a second experiment, we explored whether personal familiarity with the speakers would afford greater accuracy and better generalization of identity perception. Using new stimuli, we largely replicated our previous findings: whereas familiarity afforded a consistent performance advantage for speaker discriminations, the experimental manipulations impaired performance to similar extents for familiar and unfamiliar listener groups. We discuss our findings with reference to prototype-based models of voice processing and suggest potential underlying mechanisms and representations of familiar and unfamiliar voice perception. (PsycINFO Database Record (c) 2016 APA, all rights reserved

Royal Holloway - Pure

UCL Discovery

Queen Mary Research Online