705 research outputs found

    Acoustic Intensity and Speech Breathing Kinematics in a Patient with Parkinson’s Disease

    Get PDF
    Parkinson’s disease (PD) is a neurodegenerative disease which affects the basal ganglia control circuit (Duffy, 2013). The motor speech disorder most strongly associated with PD is hypokinetic dysarthria, which presents with distinctive speech characteristics including reduced loudness and the inability to adequately maintain loud speech (Darley, Aronson, & Brown 1969; Duffy 2013). This is due to the variable kinematics for speech breathing associated with PD, which may result in abnormal muscular excursions, reduced vital capacity, and irregular breathing cycles (Duffy, 2013). The impaired ventilatory control can be attributed to the rigidity of muscles of inhalation and exhalation, as well as bradykinesia and hypokinesia. The study aimed to evaluate whether a patient with PD was able to manipulate their acoustic intensity, and if such intensity changes were accompanied by changes in speech breathing kinematics in a novel intraoperative environment. The study’s data were collected intra-operatively during surgery for deep brain stimulation and recordings from the subthalamic nucleus and cortex. The patient was instructed to modulate acoustic intensity while repeating three syllable CV triplets. Speech breathing kinematics of the rib cage were obtained using a Piezo Crystal Effort Sensor with a double buckle band throughout speech production. The speech breathing kinematics of interest were duration, displacement, and peak velocity of inhalation, peak velocity of exhalation, and duration from onset of exhalation to onset of speech, as well as a descriptive comparison between tidal breathing and speech breathing. Spearman Rho correlations indicated that there were weak to no relationships observed between speech breathing kinematics and intensity in this specific participant. However, a medium effect size (Hedge’s g) was observed between tidal and speech breathing for inhalation duration, and small to medium effect size for inhalation displacement and peak velocity. While previous literature suggests that people with PD can manipulate intensity when cued as a result of kinematic modulations for speech breathing, the current study does not support these findings for this one patient. However, previously reported differences between tidal and speech breathing were supported. Potential explanations for the lack of intensity modulation are explored, including constraints induced by the intra-operative environment

    Speech Breathing in Virtual Humans: An Interactive Model and Empirical Study

    Get PDF
    Human speech production requires the dynamic regulation of air through the vocal system. While virtual character systems commonly are capable of speech output, they rarely take breathing during speaking - speech breathing - into account. We believe that integrating dynamic speech breathing systems in virtual characters can significantly contribute to augmenting their realism. Here, we present a novel control architecture aimed at generating speech breathing in virtual characters. This architecture is informed by behavioral, linguistic and anatomical knowledge of human speech breathing. Based on textual input and controlled by a set of low-and high-level parameters, the system produces dynamic signals in real-time that control the virtual character's anatomy (thorax, abdomen, head, nostrils, and mouth) and sound production (speech and breathing). In addition, we perform a study to determine the effects of including breathing-motivated speech movements, such as head tilts and chest expansions during dialogue on a virtual character, as well as breathing sounds. This study includes speech that is generated both from a text-to-speech engine as well as from recorded voice

    The analysis of breathing and rhythm in speech

    Get PDF
    Speech rhythm can be described as the temporal patterning by which speech events, such as vocalic onsets, occur. Despite efforts to quantify and model speech rhythm across languages, it remains a scientifically enigmatic aspect of prosody. For instance, one challenge lies in determining how to best quantify and analyse speech rhythm. Techniques range from manual phonetic annotation to the automatic extraction of acoustic features. It is currently unclear how closely these differing approaches correspond to one another. Moreover, the primary means of speech rhythm research has been the analysis of the acoustic signal only. Investigations of speech rhythm may instead benefit from a range of complementary measures, including physiological recordings, such as of respiratory effort. This thesis therefore combines acoustic recording with inductive plethysmography (breath belts) to capture temporal characteristics of speech and speech breathing rhythms. The first part examines the performance of existing phonetic and algorithmic techniques for acoustic prosodic analysis in a new corpus of rhythmically diverse English and Mandarin speech. The second part addresses the need for an automatic speech breathing annotation technique by developing a novel function that is robust to the noisy plethysmography typical of spontaneous, naturalistic speech production. These methods are then applied in the following section to the analysis of English speech and speech breathing in a second, larger corpus. Finally, behavioural experiments were conducted to investigate listeners' perception of speech breathing using a novel gap detection task. The thesis establishes the feasibility, as well as limits, of automatic methods in comparison to manual annotation. In the speech breathing corpus analysis, they help show that speakers maintain a normative, yet contextually adaptive breathing style during speech. The perception experiments in turn demonstrate that listeners are sensitive to the violation of these speech breathing norms, even if unconsciously so. The thesis concludes by underscoring breathing as a necessary, yet often overlooked, component in speech rhythm planning and production

    Respiratory differences in spontaneous and scripted speech among bilingual adults

    Get PDF
    This investigation explored respiratory differences between a speaker’s first language (L1) and second language (L2) during spontaneous and scripted speech in six adult bilingual speakers (two native bilingual speakers and four non-native bilingual speakers). Respiratory kinematic data using Respitrace respiratory inductance plethysmography and acoustic recordings were collected during five tasks: tidal breathing at rest, scripted speech in L1, spontaneous speech in L1, scripted speech in L2, and spontaneous speech in L2. Results indicated a significant interaction effect between proficiency and syllables produced during spontaneous speech, but no other significant differences were found among inspiratory/expiratory duration, task, proficiency or language. The data provides insight into how a higher cognitive-linguistic load of speaking in a second language may affect speech breathing and contributes to the existing pool of knowledge on monolingual cognitive-linguistic demands and speech breathing

    Disfluencies in German adult- and infant-directed speech

    Get PDF
    Bellinghausen C, Betz S, Zahner K, Sasdrich A, Schröer M, Schröder B. Disfluencies in German adult- and infant-directed speech. In: Proceedings of SEFOS: 1st International Seminar on the Foundations of Speech. Breathing, Pausing and The Voice. 2019: 44-46

    The phonetics of speech breathing : pauses, physiology, acoustics, and perception

    Get PDF
    Speech is made up of a continuous stream of speech sounds that is interrupted by pauses and breathing. As phoneticians are primarily interested in describing the segments of the speech stream, pauses and breathing are often neglected in phonetic studies, even though they are vital for speech. The present work adds to a more detailed view of both pausing and speech breathing with a special focus on the latter and the resulting breath noises, investigating their acoustic, physiological, and perceptual aspects. We present an overview of how a selection of corpora annotate pauses and pause-internal particles, as well as a recording setup that can be used for further studies on speech breathing. For pauses, this work emphasized their optionality and variability under different tempos, as well as the temporal composition of silence and breath noise in breath pauses. For breath noises, we first focused on acoustic and physiological characteristics: We explored alignment between the onsets and offsets of audible breath noises with the start and end of expansion of both rib cage and abdomen. Further, we found similarities between speech breath noises and aspiration phases of /k/, as well as that breath noises may be produced with a more open and slightly more front place of articulation than realizations of schwa. We found positive correlations between acoustic and physiological parameters, suggesting that when speakers inhale faster, the resulting breath noises were more intense and produced more anterior in the mouth. Inspecting the entire spectrum of speech breath noises, we showed relatively flat spectra and several weak peaks. These peaks largely overlapped with resonances reported for inhalations produced with a central vocal tract configuration. We used 3D-printed vocal tract models representing four vowels and four fricatives to simulate in- and exhalations by reversing airflow direction. We found the direction to not have a general effect for all models, but only for those with high-tongue configurations, as opposed to those that were more open. Then, we compared inhalations produced with the schwa-model to human inhalations in an attempt to approach the vocal tract configuration in speech breathing. There were some similarities, however, several complexities of human speech breathing not captured in the models complicated comparisons. In two perception studies, we investigated how much information listeners could auditorily extract from breath noises. First, we tested categorizing different breath noises into six different types, based on airflow direction and airway usage, e.g. oral inhalation. Around two thirds of all answers were correct. Second, we investigated how well breath noises could be used to discriminate between speakers and to extract coarse information on speaker characteristics, such as age (old/young) and sex (female/male). We found that listeners were able to distinguish between two breath noises coming from the same or different speakers in around two thirds of all cases. Hearing one breath noise, classification of sex was successful in around 64%, while for age it was 50%, suggesting that sex was more perceivable than age in breath noises.Deutsche Forschungsgemeinschaft (DFG) – Projektnummer 418659027: "Pause-internal phonetic particles in speech communication

    Влияние дыхательных упражнений на функциональное состояние дошкольников 5–6 лет с нарушением речи

    Get PDF
    Статья посвящена влиянию дыхательных упражнений на функциональное состояние дошкольников 5–6 лет с нарушением речи. Цель дыхательных упражнений – увеличить объем дыхания, нормализовать его ритм, выработать плавный, цельный, экономный выдох. Правильное речевое дыхание является основой звучащей речи. Оно обеспечивает нормальное голосо– и звукообразование, сохраняет плавность и музыкальность речи. Ключевые слова: дошкольники, дыхательные упражнения, нарушение речи, речевое дыхание. The purpose of breathing exercises – breathing increase and normalize its rhythm, to develop a smooth, solid, economical exhale. Correct speech breathing is the basis of sounding speech. It provides normal voting and sound formation, retains smoothness and musicality of speech. Key words: preschoolers, breathing exercises, speech, speech breathing

    Using the Biological Feedback Method in Speech Therapy for the Formation of Speech Breathing in Preschool Children with Erased Dysarthria

    Get PDF
    The article discusses the problem of the formation of speech breathing in preschool children with an erased form of dysarthria using the biological feedback method. In children suffering from speech impairment, the work of the respiratory system is not normal. Disorders of prosody supplement violations of the sound-producing side of speech. As a result, in children, including the ones with dysarthria, speech becomes unclear and slurred. The theoretical part of the study considers the specificity of the prosodic side of speech in children with an erased form of dysarthria from the point of view of various researchers. The problem of dependence between the prosodic side of speech and the severity of a speech defect is considered. The practical part of the study reveals the content of the organisation of the empirical study on the formation of diaphragmatic relaxation breathing using a hardware-diagnostic complex “BF” (Biological Feedback). It has been established that the biological feedback method contributes to the formation of diaphragmatic-relaxation breathing in preschool children with an erased form of dysarthria. In pre-schoolers, suffering from erased dysarthria and having a history of attention deficit hyperactivity syndrome, the process of formation of diaphragmatic relaxation breathing becomes more complicated
    corecore