132 research outputs found

    Fast Speech in Unit Selection Speech Synthesis

    Get PDF
    Moers-Prinz D. Fast Speech in Unit Selection Speech Synthesis. Bielefeld: UniversitÀt Bielefeld; 2020.Speech synthesis is part of the everyday life of many people with severe visual disabilities. For those who are reliant on assistive speech technology the possibility to choose a fast speaking rate is reported to be essential. But also expressive speech synthesis and other spoken language interfaces may require an integration of fast speech. Architectures like formant or diphone synthesis are able to produce synthetic speech at fast speech rates, but the generated speech does not sound very natural. Unit selection synthesis systems, however, are capable of delivering more natural output. Nevertheless, fast speech has not been adequately implemented into such systems to date. Thus, the goal of the work presented here was to determine an optimal strategy for modeling fast speech in unit selection speech synthesis to provide potential users with a more natural sounding alternative for fast speech output

    Spontaneous Voice Gender Imitation Abilities in Adult Speakers

    Get PDF
    Background The frequency components of the human voice play a major role in signalling the gender of the speaker. A voice imitation study was conducted to investigate individuals' ability to make behavioural adjustments to fundamental frequency (F0), and formants (Fi) in order to manipulate their expression of voice gender. Methodology/Principal Findings Thirty-two native British-English adult speakers were asked to read out loud different types of text (words, sentence, passage) using their normal voice and then while sounding as ‘masculine’ and ‘feminine’ as possible. Overall, the results show that both men and women raised their F0 and Fi when feminising their voice, and lowered their F0 and Fi when masculinising their voice. Conclusions/Significance These observations suggest that adult speakers are capable of spontaneous glottal and vocal tract length adjustments to express masculinity and femininity in their voice. These results point to a “gender code”, where speakers make a conventionalized use of the existing sex dimorphism to vary the expression of their gender and gender-related attributes

    A neurocognitive approach to the study of private speech

    Get PDF
    The paper presents the current state of the art of research identifying the neurophysiological and neuroanatomical substrates of private speech, both in typical and clinical (or atypical) populations. First, it briefly describes the evolution of private speech research, which goes from classic traditions as the naturalistic and referential paradigms to the neurocognitive approach. An overview of the neurophysiological (e.g., event-related potentials or ERPs) and neuroimaging techniques (e.g., functional magnetic resonance imaging or fMRI) is also presented. The next three sections review empirical works about the neurocognitive basis of private speech, across three groups of techniques: ERPs; fMRI/MRI; and other neuroimaging techniques (positron emission tomography [PET], magnetoencephalogram [MEG], and repetitive transcranial magnetic stimulation [rTMS]). Such neurocognitive research analyzes the neural activity of individuals during a variety of task settings, including spontaneous and instructed overt and inner private speech use, subvocal verbalizations, and silent and overt reading. The fifth section focuses on electrophysiological and neuroimaging studies of private speech in atypical populations, for example: schizophrenia, pure alexia, hearing impairment, blindness, social phobia, alexithymia, Parkinson, and multiple sclerosis. The neurocognitive study of the various forms of private speech appears to be very promising in the understanding of these pathologies. Lastly, the advances and new challenges in the field are discussed. Keywords: private speech, neurocognition, neurolinguistics, psychopathologies, language disorders Este trabajo presenta el estado actual de la investigaciĂłn que identifica los sustratos neurofisiolĂłgicos y neuroanatĂłmicos del lenguaje privado, tanto en poblaciones tĂ­picas como en clĂ­nicas (o atĂ­picas). Primero describe brevemente la evoluciĂłn de la investigaciĂłn del lenguaje privado, que van desde las tradiciones clĂĄsicas como los paradigmas naturalistas y referenciales al abordaje neurocognitivo. TambiĂ©n se presenta una revisiĂłn de las tĂ©cnicas neurofisiolĂłgicas (por ejemplo, potenciales relacionados con eventos o ERPs) y de neuroimagen (por ejemplo, imagen de resonancia magnĂ©tica funcional o fMRI). Las siguientes tres secciones revisan los trabajos empĂ­ricos sobre la base neurocognitiva del lenguaje privado a travĂ©s de tres grupos de tĂ©cnicas: ERPs; fMRI/MRI; y otras tĂ©cnicas de neuroimagen (tomografĂ­a de emisiĂłn de positrones [PET], magnetoencefalograma [MEG] y la estimulaciĂłn magnĂ©tica repetitiva transcraneal [rTMS]). Esta investigaciĂłn neurocognitiva analiza la actividad neuronal de los individuos durante diversas tareas, incluyendo el uso del lenguaje privado espontĂĄneo y observable bajo instrucciones y el lenguaje privado interno, las verbalizaciones subvocales y la lectura silenciosa y observable. La quinta secciĂłn se centra en los estudios electrofisiolĂłgicos y de neuroimĂĄgenes del lenguaje privado en poblaciones atĂ­picas, por ejemplo, esquizofrenia, alexia pura, hipoacusia, ceguera, fobia social, alexithymia, Parkinson, y esclerosis mĂșltiple. El estudio neurocognitivo de varias formas del lenguaje privado parece muy prometedor para la comprensiĂłn de estas patologĂ­as. Por Ășltimo, se comentan los avances y los nuevos retos en el campo. Palabras clave: lenguaje privado, neurocogniciĂłn, neurolingĂŒĂ­stica, psicopatologĂ­as, trastornos del lenguaj

    Sixth-Grade Sight-Singing for Low Vision Students: A Course to Enhance Confidence and Music Reading Ability

    Get PDF
    Students enter the private or public-school system with a wide variety of emotional, mental, and physical impairments that impact their confidence, self-esteem, and overall path to their future. One misunderstood and under-represented population of students are those who are not fully blind but fall under the category of low vision. Since the low vision spectrum is wide, students must advocate for themselves according to their unique visual condition. While some students may be nearsighted, others may experience color blindness, tunnel vision, a wide variety of partial blindness in one or both eyes, and more. If educators are not aware of a student\u27s slight visual impairment and students are not comfortable advocating for themselves, students could be missing out on fully exploring their passion and aptitude for music. This study will examine the existing research on the array of low vision impairments and how to help sixth-grade students understand and overcome their impairments using tailored techniques to successfully meet their goals of sight-singing music. A twelve-week curriculum is provided to guide music educators as they help low vision sixth-grade students meet musical goals despite their visual impairment

    An auditory, acoustic, articulatory and sociophonetic study of Swedish Viby-i

    Get PDF
    The study investigates the acoustic, articulatory and sociophonetic properties of the Swedish /iː/ variant known as 'Viby-i' in 13 speakers of Central Swedish from Stockholm, Gothenburg, Varberg, Jönköping and Katrineholm. The vowel is described in terms of its auditory quality, its acoustic F1 and F2 values, and its tongue configuration. A brief, qualitative description of lip position is also included. Variation in /iː/ production is mapped against five sociolinguistic factors: city, dialectal region, metropolitan vs. urban location, sex and socioeconomic rating. Articulatory data is collected using ultrasound tongue imaging (UTI), for which the study proposes and evaluates a methodology. The study shows that Viby-i varies in auditory strength between speakers, and that strong instances of the vowel are associated with a high F1 and low F2, a trend which becomes more pronounced as the strength of Viby-i increases. The articulation of Viby-i is characterised by a lowered and backed tongue body, sometimes accompanied by a double-bunched tongue shape. The relationship between tongue position and acoustic results appears to be non-linear, suggesting either a measurement error or the influence of additional articulatory factors. Preliminary images of the lips show that Viby-i is produced with a spread but lax lip posture. The lip data also reveals parts of the tongue, which in many speakers appears to be extremely fronted and braced against the lower teeth, or sometimes protruded, when producing Viby-i. No sociophonetic difference is found between speakers from different cities or dialect regions. Metropolitan speakers are found to have an auditorily and acoustically stronger Viby-i than urban speakers, but this pattern is not matched in tongue backing or lowering. Overall the data shows a weak trend towards higher-class females having stronger Viby-i, but these results are tentative due to the limited size and stratification of the sample. Further research is needed to fully explore the sociophonetic properties of Viby-i

    Semantic radical consistency and character transparency effects in Chinese: an ERP study

    Get PDF
    BACKGROUND: This event-related potential (ERP) study aims to investigate the representation and temporal dynamics of Chinese orthography-to-semantics mappings by simultaneously manipulating character transparency and semantic radical consistency. Character components, referred to as radicals, make up the building blocks used dur...postprin
    • 

    corecore