Search CORE

132 research outputs found

Fast Speech in Unit Selection Speech Synthesis

Author: Moers-Prinz Donata
Publication venue: Universität Bielefeld
Publication date: 01/01/2020
Field of study

Moers-Prinz D. Fast Speech in Unit Selection Speech Synthesis. Bielefeld: Universität Bielefeld; 2020.Speech synthesis is part of the everyday life of many people with severe visual disabilities. For those who are reliant on assistive speech technology the possibility to choose a fast speaking rate is reported to be essential. But also expressive speech synthesis and other spoken language interfaces may require an integration of fast speech. Architectures like formant or diphone synthesis are able to produce synthetic speech at fast speech rates, but the generated speech does not sound very natural. Unit selection synthesis systems, however, are capable of delivering more natural output. Nevertheless, fast speech has not been adequately implemented into such systems to date. Thus, the goal of the work presented here was to determine an optimal strategy for modeling fast speech in unit selection speech synthesis to provide potential users with a more natural sounding alternative for fast speech output

Publications at Bielefeld University

Spontaneous Voice Gender Imitation Abilities in Adult Speakers

Background The frequency components of the human voice play a major role in signalling the gender of the speaker. A voice imitation study was conducted to investigate individuals' ability to make behavioural adjustments to fundamental frequency (F0), and formants (Fi) in order to manipulate their expression of voice gender. Methodology/Principal Findings Thirty-two native British-English adult speakers were asked to read out loud different types of text (words, sentence, passage) using their normal voice and then while sounding as ‘masculine’ and ‘feminine’ as possible. Overall, the results show that both men and women raised their F0 and Fi when feminising their voice, and lowered their F0 and Fi when masculinising their voice. Conclusions/Significance These observations suggest that adult speakers are capable of spontaneous glottal and vocal tract length adjustments to express masculinity and femininity in their voice. These results point to a “gender code”, where speakers make a conventionalized use of the existing sex dimorphism to vary the expression of their gender and gender-related attributes

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Sussex Research Online

FigShare

A neurocognitive approach to the study of private speech

Author: Dolors Girbau
Publication venue: 'Confederacion Academica Nipona, Espanola y Latinoamericana'
Publication date: 01/01/2007
Field of study

The paper presents the current state of the art of research identifying the neurophysiological and neuroanatomical substrates of private speech, both in typical and clinical (or atypical) populations. First, it briefly describes the evolution of private speech research, which goes from classic traditions as the naturalistic and referential paradigms to the neurocognitive approach. An overview of the neurophysiological (e.g., event-related potentials or ERPs) and neuroimaging techniques (e.g., functional magnetic resonance imaging or fMRI) is also presented. The next three sections review empirical works about the neurocognitive basis of private speech, across three groups of techniques: ERPs; fMRI/MRI; and other neuroimaging techniques (positron emission tomography [PET], magnetoencephalogram [MEG], and repetitive transcranial magnetic stimulation [rTMS]). Such neurocognitive research analyzes the neural activity of individuals during a variety of task settings, including spontaneous and instructed overt and inner private speech use, subvocal verbalizations, and silent and overt reading. The fifth section focuses on electrophysiological and neuroimaging studies of private speech in atypical populations, for example: schizophrenia, pure alexia, hearing impairment, blindness, social phobia, alexithymia, Parkinson, and multiple sclerosis. The neurocognitive study of the various forms of private speech appears to be very promising in the understanding of these pathologies. Lastly, the advances and new challenges in the field are discussed. Keywords: private speech, neurocognition, neurolinguistics, psychopathologies, language disorders Este trabajo presenta el estado actual de la investigación que identifica los sustratos neurofisiológicos y neuroanatómicos del lenguaje privado, tanto en poblaciones típicas como en clínicas (o atípicas). Primero describe brevemente la evolución de la investigación del lenguaje privado, que van desde las tradiciones clásicas como los paradigmas naturalistas y referenciales al abordaje neurocognitivo. También se presenta una revisión de las técnicas neurofisiológicas (por ejemplo, potenciales relacionados con eventos o ERPs) y de neuroimagen (por ejemplo, imagen de resonancia magnética funcional o fMRI). Las siguientes tres secciones revisan los trabajos empíricos sobre la base neurocognitiva del lenguaje privado a través de tres grupos de técnicas: ERPs; fMRI/MRI; y otras técnicas de neuroimagen (tomografía de emisión de positrones [PET], magnetoencefalograma [MEG] y la estimulación magnética repetitiva transcraneal [rTMS]). Esta investigación neurocognitiva analiza la actividad neuronal de los individuos durante diversas tareas, incluyendo el uso del lenguaje privado espontáneo y observable bajo instrucciones y el lenguaje privado interno, las verbalizaciones subvocales y la lectura silenciosa y observable. La quinta sección se centra en los estudios electrofisiológicos y de neuroimágenes del lenguaje privado en poblaciones atípicas, por ejemplo, esquizofrenia, alexia pura, hipoacusia, ceguera, fobia social, alexithymia, Parkinson, y esclerosis múltiple. El estudio neurocognitivo de varias formas del lenguaje privado parece muy prometedor para la comprensión de estas patologías. Por último, se comentan los avances y los nuevos retos en el campo. Palabras clave: lenguaje privado, neurocognición, neurolingüística, psicopatologías, trastornos del lenguaj

CiteSeerX

Sixth-Grade Sight-Singing for Low Vision Students: A Course to Enhance Confidence and Music Reading Ability

Author: Carpenter Jamie Lane
Publication venue: Scholars Crossing
Publication date: 01/05/2021
Field of study

Students enter the private or public-school system with a wide variety of emotional, mental, and physical impairments that impact their confidence, self-esteem, and overall path to their future. One misunderstood and under-represented population of students are those who are not fully blind but fall under the category of low vision. Since the low vision spectrum is wide, students must advocate for themselves according to their unique visual condition. While some students may be nearsighted, others may experience color blindness, tunnel vision, a wide variety of partial blindness in one or both eyes, and more. If educators are not aware of a student\u27s slight visual impairment and students are not comfortable advocating for themselves, students could be missing out on fully exploring their passion and aptitude for music. This study will examine the existing research on the array of low vision impairments and how to help sixth-grade students understand and overcome their impairments using tailored techniques to successfully meet their goals of sight-singing music. A twelve-week curriculum is provided to guide music educators as they help low vision sixth-grade students meet musical goals despite their visual impairment

Liberty University Digital Commons

An auditory, acoustic, articulatory and sociophonetic study of Swedish Viby-i

Author: Westerberg Fabienne E.
Publication venue
Publication date: 01/01/2016
Field of study

The study investigates the acoustic, articulatory and sociophonetic properties of the Swedish /iː/ variant known as 'Viby-i' in 13 speakers of Central Swedish from Stockholm, Gothenburg, Varberg, Jönköping and Katrineholm. The vowel is described in terms of its auditory quality, its acoustic F1 and F2 values, and its tongue configuration. A brief, qualitative description of lip position is also included. Variation in /iː/ production is mapped against five sociolinguistic factors: city, dialectal region, metropolitan vs. urban location, sex and socioeconomic rating. Articulatory data is collected using ultrasound tongue imaging (UTI), for which the study proposes and evaluates a methodology. The study shows that Viby-i varies in auditory strength between speakers, and that strong instances of the vowel are associated with a high F1 and low F2, a trend which becomes more pronounced as the strength of Viby-i increases. The articulation of Viby-i is characterised by a lowered and backed tongue body, sometimes accompanied by a double-bunched tongue shape. The relationship between tongue position and acoustic results appears to be non-linear, suggesting either a measurement error or the influence of additional articulatory factors. Preliminary images of the lips show that Viby-i is produced with a spread but lax lip posture. The lip data also reveals parts of the tongue, which in many speakers appears to be extremely fronted and braced against the lower teeth, or sometimes protruded, when producing Viby-i. No sociophonetic difference is found between speakers from different cities or dialect regions. Metropolitan speakers are found to have an auditorily and acoustically stronger Viby-i than urban speakers, but this pattern is not matched in tongue backing or lowering. Overall the data shows a weak trend towards higher-class females having stronger Viby-i, but these results are tentative due to the limited size and stratification of the sample. Further research is needed to fully explore the sociophonetic properties of Viby-i

Glasgow Theses Service

Semantic radical consistency and character transparency effects in Chinese: an ERP study

Author: Su IF
Weekes BS
Publication venue: 'United States Sports Academy'
Publication date: 01/01/2009
Field of study

BACKGROUND: This event-related potential (ERP) study aims to investigate the representation and temporal dynamics of Chinese orthography-to-semantics mappings by simultaneously manipulating character transparency and semantic radical consistency. Character components, referred to as radicals, make up the building blocks used dur...postprin

HKU Scholars Hub