Search CORE

10,726 research outputs found

Recommended from our members

What makes a voice masculine: physiological and acoustical correlates of women's ratings of men's vocal masculinity

Author: Bond Rod
Cartei Valentina
Reby David
Publication venue: 'Elsevier BV'
Publication date: 01/09/2014
Field of study

Men's voices contain acoustic cues to body size and hormonal status, which have been found to affect women's ratings of speaker size, masculinity and attractiveness. However, the extent to which these voice parameters mediate the relationship between speakers' fitness-related features and listener's judgments of their masculinity has not yet been investigated. We audio-recorded 37 adult heterosexual males performing a range of speech tasks and asked 20 adult heterosexual female listeners to rate speakers' masculinity on the basis of their voices only. We then used a two-level (speaker within listener) path analysis to examine the relationships between the physiological (testosterone, height), acoustic (fundamental frequency or F0, and resonances or ΔF) and perceptual dimensions (listeners' ratings) of speakers' masculinity. Overall, results revealed that male speakers who were taller and had higher salivary testosterone levels also had lower F0 and ΔF, and were in turn rated as more masculine. The relationship between testosterone and perceived masculinity was essentially mediated by F0, while that of height and perceived masculinity was partially mediated by both F0 and ΔF. These observations confirm that women listeners attend to sexually dimorphic voice cues to assess the masculinity of unseen male speakers. In turn, variation in these voice features correlate with speakers' variation in stature and hormonal status, highlighting the interdependence of these physiological, acoustic and perceptual dimensions

Sussex Research Online

Acoustic cues for the korean stop contrast-dialectal variation

Author: Choi Hansook
Publication venue
Publication date: 14/11/2013
Field of study

In this study, cross-dialectal variation in the use of the acoustic cues of VOT and F0 to mark the laryngeal contrast in Korean stops is examined with Chonnam Korean and Seoul Korean. Prior experimental results (Han & Weitzman, 1970; Hardcastle, 1973; Jun, 1993 &1998; Kim, C., 1965) show that pitch values in the vowel onset following the target stop consonants play a supplementary role to VOT in designating the three contrastive laryngeal categories. F0 contours are determined in part by the intonational system of a language, which raises the question of how the intonational system interacts with phonological contrasts. Intonational difference might be linked to dissimilar patterns in using the complementary acoustic cues of VOT and F0. This hypothesis is tested with 6 Korean speakers, three Seoul Korean and three Chonnam Korean speakers. The results show that Chonnam Korean involves more 3-way VOT and a 2-way distinction in F0 distribution in comparison to Seoul Korean that shows more 3-way F0 distribution and a 2-way VOT distinction. The two acoustic cues are complementary in that one cue is rather faithful in marking 3-way contrast, while the other cue marks the contrast less distinctively. It also seems that these variations are not completely arbitrary, but linked to the phonological characteristics in dialects. Chonnam Korean, in which the initial tonal realization in the accentual phrase is expected to be more salient, tends to minimize the F0 perturbation effect from the preceding consonants by taking more overlaps in F0 distribution. And a 3-way distribution of VOT in Chonnam Korean, as compensation, can be also understood as a durational sensitivity. Without these characteristics, Seoul Korean shows relatively more overlapping distribution in VOT and more 3-way separation in F0 distribution

Hochschulschriftenserver - Universität Frankfurt am Main

Context-related acoustic variation in male fallow deer (Dama dama) groans

Author: Charlton Benjamin D
Reby David
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2011
Field of study

While social and behavioural contexts are known to affect the acoustic structure of vocal signals in several mammal species, few studies have investigated context-related acoustic variation during inter-sexual advertisement and/or intra-sexual competition. Here we recorded male fallow deer groans during the breeding season and investigated how key acoustic parameters (fundamental frequency and formant frequencies) vary as a function of the social context in which they are produced. We found that in the presence of females, male fallow deer produced groans with higher mean fundamental frequency when vocal males were also present than they did when no vocal males were in close vicinity. We attribute this to the increased arousal state typically associated with this context. In addition, groan minimum formant frequency spacing was slightly, but significantly lower (indicating marginally more extended vocal tracts) when males were alone than when potential mates and/or competitors were nearby. This indicates that, contrary to our predictions, male fallow deer do not exaggerate the acoustic impression of their body size by further lowering their formant frequencies in the presence of potential mating partners and competitors. Furthermore, since the magnitude of the variation in groan minimum formant frequency spacing remains small compared to documented inter-individual differences, our findings are consistent with the hypothesis that formants are reliable static cues to body size during intra- and inter-sexual advertisement that do not concurrently encode dynamic motivation-related informatio

CiteSeerX

Public Library of Science (PLOS)

Directory of Open Access Journals

PubMed Central

Permanent Hosting, Archiving and Indexing of Digital Resources and Assets

Sussex Research Online

Effect of formant frequency spacing on perceived gender in pre-pubertal children's voices

Author: B Munson
D Reby
D Rendall
D Rendall
D Rendall
David Reby
DH Klatt
DR Smith
G Fant
H Hollien
HK Vorperian
HK Vorperian
HK Vorperian
Howard Nusbaum
J Van Borsel
JA Hall
JE Sussman
JM Dabbs Jr
JM Hillenbrand
JW Mullennix
K Pisanski
L Bruckert
LW Simmons
P Boersma
PA Busby
R Smits
S Bennett
S Lee
SA Collins
SJC Gaulin
SP Whiteside
SP Whiteside
TL Perry
V Cartei
Valentina Cartei
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

<div>BackgroundIt is usually possible to identify the sex of a pre-pubertal child from their voice, despite the absence of sex differences in fundamental frequency at these ages. While it has been suggested that the overall spacing between formants (formant frequency spacing - ΔF) is a key component of the expression and perception of sex in children's voices, the effect of its continuous variation on sex and gender attribution has not yet been investigated.Methodology/Principal findingsIn the present study we manipulated voice ΔF of eight year olds (two boys and two girls) along continua covering the observed variation of this parameter in pre-pubertal voices, and assessed the effect of this variation on adult ratings of speakers' sex and gender in two separate experiments. In the first experiment (sex identification) adults were asked to categorise the voice as either male or female. The resulting identification function exhibited a gradual slope from male to female voice categories. In the second experiment (gender rating), adults rated the voices on a continuum from “masculine boy” to “feminine girl”, gradually decreasing their masculinity ratings as ΔF increased.Conclusions/SignificanceThese results indicate that the role of ΔF in voice gender perception, which has been reported in adult voices, extends to pre-pubertal children's voices: variation in ΔF not only affects the perceived sex, but also the perceived masculinity or femininity of the speaker. We discuss the implications of these observations for the expression and perception of gender in children's voices given the absence of anatomical dimorphism in overall vocal tract length before puberty.</div

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Sussex Research Online

FigShare

Auditory communication in domestic dogs: vocal signalling in the extended social environment of a companion animal

Author: Adachi
Adachi
Archer
Ashdown
Aubergé
August
Bachorowski
Bachorowski
Bahrick
Banse
Baru
Bekoff
Bekoff
Bloom
Bradshaw
Brady
Bryant
Burnham
Buttelmann
Cain
Chuenwattanapranithi
Cohen
Coleman
Coppinger
Corbett
Coren
Custance
Deaux
Düpjan
Estes
Fant
Faragó
Fedderden-Petersen
Feddersen-Petersen
Fernald
Fernald
Fitch
Fitch
Fitch
Fitch
Fitch
Fitch
Fitch
Fox
Frank
Frynta
Fukuzawa
Ghazanfar
Gisiner
Gittleman
Griebel
Hare
Harrington
Hauser
Herman
Herrel
Hillis
Hirsh-Pasek
Jaeger
Joslin
Kaminski
Koler-Matznick
Landau
Leaver
Lieberman
MacNulty
Markman
Marós
Mazzini
McComb
McConnell
Merola
Miklósi
Miles
Mills
Moehlman
Molnár
Molnár
Molnár
Morton
Ohala
Ohala
Owings
Owren
Owren
Pepperberg
Peters
Pilley
Pilley
Piérard
Plotsky
Pongrácz
Pongrácz
Prato-Previde
Price
Proops
Puts
Puts
Ramos
Reby
Rendall
Rendall
Riede
Robbins
Rutter
Ryalls
Sauter
Savage-Rumbaugh
Schassburger
Scheider
Schmidt-Nielsen
Shamir
Sillero-Zubiri
Smith
Taylor
Taylor
Taylor
Taylor
Taylor
Taylor
Theberge
Titze
Titze
Van der Zee
Vitulli
Volodin
Wayne
Yin
Yin
Zaccaroni
Zuberbuhler
Publication venue: 'Elsevier BV'
Publication date: 01/01/2014
Field of study

Domestic dogs produce a range of vocalisations, including barks, growls, and whimpers, which are shared with other canid species. The source–filter model of vocal production can be used as a theoretical and applied framework to explain how and why the acoustic properties of some vocalisations are constrained by physical characteristics of the caller, whereas others are more dynamic, influenced by transient states such as arousal or motivation. This chapter thus reviews how and why particular call types are produced to transmit specific types of information, and how such information may be perceived by receivers. As domestication is thought to have caused a divergence in the vocal behaviour of dogs as compared to the ancestral wolf, evidence of both dog–human and human–dog communication is considered. Overall, it is clear that domestic dogs have the potential to acoustically broadcast a range of information, which is available to conspecific and human receivers. Moreover, dogs are highly attentive to human speech and are able to extract speaker identity, emotional state, and even some types of semantic information

Crossref

Sussex Research Online

Production and perception of English Word Final Stops By Malay Speakers

Author: Ab.Samad Kechot
Rahim Aman
Shahidi A.H.
Publication venue: Penerbit UKM
Publication date: 01/11/2012
Field of study

A few influential speech studies have been carried out using established speech learning models, which confirmed that the analysis of first language (L1) and second language (L2) at a phonemic level provides only a partial view of deeper relationships between languages in contact. Therefore, studies focusing on cross-language phonetic differences as a causative factor in L2 learner difficulties have been proposed to understand second language learners’ (L2) speech production and how listeners respond perceptually to the phonetic properties of L2. This paper presents a study of the production and perception of the final stops by English learners (L2) whose first language is Malay (L1). A total of 23 students, comprising 16 male and 7 female Malay subjects (L1 as Malay and their L2 as English) with normal hearing and speech development participated in this study. A short interview was conducted in order to gain background information about information about each subject, to introduce them to the study, to inform them about the process of recording, the materials to be used in the recording session, and how the materials should be managed during recording time. Acoustic measurements of selected segments occurring in word final positions (via spectrographic analysis, syllable rhyme duration and phonation) were taken. Results of the voicing contrast realisation in Malay accented English and Malaysian listeners' perceptual identification/discrimination abilities with final voiced/voiceless stops in Malay and English are presented and discussed. The findings revealed that the Malay students’ realisation of final stops in L2 is largely identical to their L1. In addition, the results also showed that accurate ‘perception’ may not always lead to accurate ‘production’

UKM Journal Article Repository

Acoustic cues to tonal contrasts in Mandarin: Implications for cochlear implants

Author: Andrew Faulkner
Blicher D. L.
Chao Y. R.
Fon J.
Howie J. M.
Lin H. B.
Lin M.-C.
Shen X. S.
Shih C. L.
Stuart Rosen
Tseng C. Y.
Yu-Ching Kuo
Zee E.
Publication venue: ACOUSTICAL SOC AMER AMER INST PHYSICS
Publication date: 01/05/2008
Field of study

The present study systematically manipulated three acoustic cues-fundamental frequency (f0), amplitude envelope, and duration-to investigate their contributions to tonal contrasts in Mandarin. Simplified stimuli with all possible combinations of these three cues were presented for identification to eight normal-hearing listeners, all native speakers of Mandarin from Taiwan. The f0 information was conveyed either by an f0-controlled sawtooth carrier or a modulated noise so as to compare the performance achievable by a clear indication of voice f0 and what is possible with purely temporal coding of f0. Tone recognition performance with explicit f0 was much better than that with any combination of other acoustic cues (consistently greater than 90% correct compared to 33%-65%; chance is 25%). In the absence of explicit f0, the temporal coding of f0 and amplitude envelope both contributed somewhat to tone recognition, while duration had only a marginal effect. Performance based on these secondary cues varied greatly across listeners. These results explain the relatively poor perception of tone in cochlear implant users, given that cochlear implants currently provide only weak cues to f0, so that users must rely upon the purely temporal (and secondary) features for the perception of tone. (c) 2008 Acoustical Society of America

Crossref

UCL Discovery

Listeners normalize speech for contextual speech rate even without an explicit recognition task

Author: Bosker H.
Maslowski M.
Meyer A.
Publication venue: 'Acoustical Society of America (ASA)'
Publication date: 01/01/2019
Field of study

Speech can be produced at different rates. Listeners take this rate variation into account by normalizing vowel duration for contextual speech rate: An ambiguous Dutch word /m?t/ is perceived as short /mAt/ when embedded in a slow context, but long /ma:t/ in a fast context. Whilst some have argued that this rate normalization involves low-level automatic perceptual processing, there is also evidence that it arises at higher-level cognitive processing stages, such as decision making. Prior research on rate-dependent speech perception has only used explicit recognition tasks to investigate the phenomenon, involving both perceptual processing and decision making. This study tested whether speech rate normalization can be observed without explicit decision making, using a cross-modal repetition priming paradigm. Results show that a fast precursor sentence makes an embedded ambiguous prime (/m?t/) sound (implicitly) more /a:/-like, facilitating lexical access to the long target word "maat" in a (explicit) lexical decision task. This result suggests that rate normalization is automatic, taking place even in the absence of an explicit recognition task. Thus, rate normalization is placed within the realm of everyday spoken conversation, where explicit categorization of ambiguous sounds is rare

Radboud Repository

MPG.PuRe