544 research outputs found
Do red deer stags (Cervus elaphus) use roar fundamental frequency (F0) to assess rivals?
It is well established that in humans, male voices are disproportionately lower pitched than female voices, and recent studies suggest that this dimorphism in fundamental frequency (F0) results from both intrasexual (male competition) and intersexual (female mate choice) selection for lower pitched voices in men. However, comparative investigations indicate that sexual dimorphism in F0 is not universal in terrestrial mammals. In the highly polygynous and sexually dimorphic Scottish red deer Cervus elaphus scoticus, more successful males give sexually-selected calls (roars) with higher minimum F0s, suggesting that high, rather than low F0s advertise quality in this subspecies. While playback experiments demonstrated that oestrous females prefer higher pitched roars, the potential role of roar F0 in male competition remains untested. Here we examined the response of rutting red deer stags to playbacks of re-synthesized male roars with different median F0s. Our results show that stags’ responses (latencies and durations of attention, vocal and approach responses) were not affected by the F0 of the roar. This suggests that intrasexual selection is unlikely to strongly influence the evolution of roar F0 in Scottish red deer stags, and illustrates how the F0 of terrestrial mammal vocal sexual signals may be subject to different selection pressures across species. Further investigations on species characterized by different F0 profiles are needed to provide a comparative background for evolutionary interpretations of sex differences in mammalian vocalizations
Using Active Shape Modeling Based on MRI to Study Morphologic and Pitch-Related Functional Changes Affecting Vocal Structures and the Airway
Copyright © 2013 The Voice Foundation. Published by Mosby, Inc. All rights reserved.Peer reviewedPostprin
Recommended from our members
A novel framework for high-quality voice source analysis and synthesis
This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University.The analysis, parameterization and modeling of voice source estimates obtained via inverse filtering of recorded speech are some of the most challenging areas of speech processing owing to the fact humans produce a wide range of voice source realizations and that the voice source estimates commonly contain artifacts due to the non-linear time-varying source-filter coupling. Currently, the most widely adopted representation of voice source signal is Liljencrants-Fant's (LF) model which was developed in late 1985. Due to the overly simplistic interpretation of voice source dynamics, LF model can not represent the fine temporal structure of glottal flow derivative realizations nor can it carry the sufficient spectral richness to facilitate a truly natural sounding speech synthesis. In this thesis we have introduced Characteristic Glottal Pulse Waveform Parameterization and Modeling (CGPWPM) which constitutes an entirely novel framework for voice source analysis, parameterization and reconstruction. In comparative evaluation of CGPWPM and LF model we have demonstrated that the proposed method is able to preserve higher levels of speaker dependant information from the voice source estimates and realize a more natural sounding speech synthesis. In general, we have shown that CGPWPM-based speech synthesis rates highly on the scale of absolute perceptual acceptability and that speech signals are faithfully reconstructed on consistent basis, across speakers, gender. We have applied CGPWPM to voice quality profiling and text-independent voice quality conversion method. The proposed voice conversion method is able to achieve the desired perceptual effects and the modified
speech remained as natural sounding and intelligible as natural speech. In this thesis, we have also developed an optimal wavelet thresholding strategy for voice source signals which is able to suppress aspiration noise and still retain both the slow and the rapid variations in the voice source estimate
Recommended from our members
Laryngeal manual therapy: a preliminary study to examine its treatment effects in the management of muscle tension dysphonia
The objectives of this study were to determine appropriate acoustic and outcome measures for the evaluation of a method of laryngeal manual therapy (LMT) used in the treatment of patients with muscle tension dysphonia (MTD). The effects of this technique were also investigated. The study was based on the hypotheses that the vertical position of the larynx in the vocal tract would lower, that the quality of the voice would normalize, and that a reduction in any vocal tract discomfort (VTD) would occur after LMT. This was a small, prospective, repeated measures pilot study in which each member of the research team was "blinded" to all other stages of the study and during which all data were anonymized until the final stage of data analysis. Ten subjects presenting with MTD completed outcome measures and provided audiorecordings immediately before, immediately after, and 1 week after LMT. The Kay CSL 4150 was used for signal acquisition and for some acoustic measurements. Spectrographic evaluation was accomplished with Praat. A new perceptual, self-rating scale, the VTD scale, and a new proforma for use by the clinician for palpatory evaluation, were developed for the study. Relative average perturbation during connected speech was significantly reduced after LMT, indicating a reduction in abnormal vocal function. The severity and frequency of VTD was shown to have reduced after LMT. This pilot study showed positive evidence for LMT as a method of therapy in the treatment of hyperfunctional voice disorders. Its effects were shown to be measurable with both acoustical analysis and the VTD scale
Acoustic and electroglottographic (EGG) characteristics of tracheoesophageal speech of Cantonese
This study investigated the acoustic and electroglottographic (EGG) characteristics of tracheoesophageal (TE) speech of Cantonese. Seven TE speakers and seven aged-matched laryngeal (NL) speakers produced sustained vowel phonation and passage reading. Both acoustic and EGG signals were recorded and analyzed using software programs Praat and Voce Vista. Results indicated that there was a significantly lower fundamental frequency (F0) for TE than NL speakers in passage reading. Significantly higher jitter, shimmer and closed quotient (CQ), and lower harmonic-to-noise ratio (H/N) values were associated with TE speech in both sustained vowel phonation and passage reading. Higher formant frequencies in sustained /i/ phonation were found for TE speakers. The findings appear to explain the perceptually hoarse, breathy and low-pitch voice of TE speech. Results were discussed in terms of higher position, greater tissue density, slower movement during closing phase and aperiodic vibration of neoglottis of TE speakers than the vocal folds of NL speakers.published_or_final_versionSpeech and Hearing SciencesBachelorBachelor of Science in Speech and Hearing Science
Vocal tract modelling in fallow deer: are male groans nasalized?
Males of several species of deer have a descended and mobile larynx, resulting in an unusually long vocal tract, which can be further extended by lowering the larynx during call production. Formant frequencies are lowered as the vocal tract is extended, as predicted when approximating the vocal tract as a uniform quarter wavelength resonator. However, formant frequencies in polygynous deer follow uneven distribution patterns, indicating that the vocal tract configuration may in fact be rather complex. We CT-scanned the head and neck region of two adult male fallow deer specimens with artificially extended vocal tracts and measured the cross-sectional areas of the supra-laryngeal vocal tract along the oral and nasal tracts. The CT data was then used to predict the resonances produced by three possible configurations, including the oral vocal tract only, the nasal vocal tract only, or combining both. We found that the area functions from the combined oral and nasal vocal tracts produced resonances more closely matching the formant pattern and scaling observed in fallow deer groans than those predicted by the area functions of the oral vocal tract only or of the nasal vocal tract only. This indicates that the nasal and oral vocal tracts are both simultaneously involved in the production of a nonhuman mammal vocalisation, and suggests that the potential for nasalization in putative oral loud-calls should be carefully considered
Cross modal perception of body size in domestic dogs (Canis familiaris)
While the perception of size-related acoustic variation in animal vocalisations is well documented, little attention has been given to how this information might be integrated with corresponding visual information. Using a cross-modal design, we tested the ability of domestic dogs to match growls resynthesised to be typical of either a large or a small dog to size- matched models. Subjects looked at the size-matched model significantly more often and for a significantly longer duration than at the incorrect model, showing that they have the ability to relate information about body size from the acoustic domain to the appropriate visual category. Our study suggests that the perceptual and cognitive mechanisms at the basis of size assessment in mammals have a multisensory nature, and calls for further investigations of the multimodal processing of size information across animal species
- …