Search CORE

26,107 research outputs found

Perception of Alcoholic Intoxication in Speech

Author: Schiel Florian
Publication venue
Publication date: 01/01/2011
Field of study

The ALC sub-challenge of the Interspeech Speaker State Chal-lenge (ISSC) aims at the automatic classification of speech sig-nals into intoxicated and sober speech. In this context we con-ducted a perception experiment on data derived from the same corpus to analyze the human performance on the same task. The results show that human still outperform comparable baseline results of ISSC. Female and male listeners perform on the same level, but there is strong evidence that intoxication in female voices is easier to be recognized than in male voices. Prosodic features contribute to the decision of human listeners but seem not to be dominant. In analogy to Doddington’s zoo of speaker verification we find some evidence for the existence of lambs and goats but no wolves. Index Terms: alcoholic intoxication, speech perception, forced choice, intonation, Alcohol Language Corpu

CiteSeerX

Open Access LMU

Resolving the ancestry of Austronesian-speaking populations

Author: A Anderson
A Torroni
A Wollstein
AJ Drummond
AJ Drummond
AJ Redd
AMS Ko
Andreia Brandão
António Amorim
AT Duggan
B Sykes
Bruno Cavadas
C Capelli
C Hill
C Ngamphiw
C Pelejero
Catherine Hill
Christopher Snell
CO Hunt
D Bulbeck
D Bulbeck
D Pierron
David Bulbeck
DH Alexander
F Delfin
G Barker
G Hudjashov
GR Clark
GR Summerhayes
H-C Hung
H-J Bandelt
H-J Bandelt
J Diamond
J Saillard
JA Trejaut
James F. Wilson
JE Terrell
Jean A. Trejaut
JS Friedlaender
JS Friedlaender
Jun-Hun Loo
Ken Khong Eng
KN Ballantyne
L Gusmão
L Pereira
LA Zhivotovsky
Leonor Gusmão
Luísa Pereira
M Donohue
M Kayser
M Kayser
M Kayser
M Lipson
M Richards
M Ross
M Ross
M Spriggs
M Spriggs
M Spriggs
MA Abdulla
Maria Pala
Marie Lin
Martin B. Richards
Maru Mormina
MI Bird
MK Tumonggor
P Bellwood
P Blanchon
P Brotherton
P Forster
P Mellars
P Soares
P Soares
P Soares
P Soares
P Verdu
P-Y Manguin
PA Underhill
Pedro A. Soares
Q Fu
Q Fu
R Blust
R Blust
R Torrence
Ross M. Fraser
S Bedford
S Oppenheimer
S Oppenheimer
S Purcell
SM Fitzpatrick
Stephen Oppenheimer
SYW Ho
T Melton
T Rito
Teresa Rito
TM Karafet
TM Rieth
Tsang-Ming Ko
Tse-Yi Wang
V Macaulay
V Paz
Vincent Macaulay
WG Solheim
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

There are two very different interpretations of the prehistory of Island Southeast Asia (ISEA), with genetic evidence invoked in support of both. The “out-of-Taiwan” model proposes a major Late Holocene expansion of Neolithic Austronesian speakers from Taiwan. An alternative, proposing that Late Glacial/postglacial sea-level rises triggered largely autochthonous dispersals, accounts for some otherwise enigmatic genetic patterns, but fails to explain the Austronesian language dispersal. Combining mitochondrial DNA (mtDNA), Y-chromosome and genome-wide data, we performed the most comprehensive analysis of the region to date, obtaining highly consistent results across all three systems and allowing us to reconcile the models. We infer a primarily common ancestry for Taiwan/ISEA populations established before the Neolithic, but also detected clear signals of two minor Late Holocene migrations, probably representing Neolithic input from both Mainland Southeast Asia and South China, via Taiwan. This latter may therefore have mediated the Austronesian language dispersal, implying small-scale migration and language shift rather than large-scale expansion

Universidade do Minho: RepositoriUM

Edinburgh Research Explorer

Enlighten

Crossref

Springer - Publisher Connector

Winchester Research Repository

PubMed Central

Oxford University Research Archive

Repositório Aberto da Universidade do Porto

University of Huddersfield Repository

Huddersfield Research Portal

A language-familiarity effect for speaker discrimination without comprehension

Author: Belin Pascal
Caldara Roberto
Fleming David
Giordano Bruno L.
Publication venue: 'Proceedings of the National Academy of Sciences'
Publication date: 01/01/2014
Field of study

The influence of language familiarity upon speaker identification is well established, to such an extent that it has been argued that “Human voice recognition depends on language ability” [Perrachione TK, Del Tufo SN, Gabrieli JDE (2011) Science 333(6042):595]. However, 7-mo-old infants discriminate speakers of their mother tongue better than they do foreign speakers [Johnson EK, Westrek E, Nazzi T, Cutler A (2011) Dev Sci 14(5):1002–1011] despite their limited speech comprehension abilities, suggesting that speaker discrimination may rely on familiarity with the sound structure of one’s native language rather than the ability to comprehend speech. To test this hypothesis, we asked Chinese and English adult participants to rate speaker dissimilarity in pairs of sentences in English or Mandarin that were first time-reversed to render them unintelligible. Even in these conditions a language-familiarity effect was observed: Both Chinese and English listeners rated pairs of native-language speakers as more dissimilar than foreign-language speakers, despite their inability to understand the material. Our data indicate that the language familiarity effect is not based on comprehension but rather on familiarity with the phonology of one’s native language. This effect may stem from a mechanism analogous to the “other-race” effect in face recognition

Asymmetric discrimination of non-speech tonal analogues of vowels

Author: Franklin Lauren
Masapollo Matthew
Morgan James L.
Zhao T. Christina
Publication venue: 'American Psychological Association (APA)'
Publication date: 01/02/2019
Field of study

Published in final edited form as: J Exp Psychol Hum Percept Perform. 2019 February ; 45(2): 285–300. doi:10.1037/xhp0000603.Directional asymmetries reveal a universal bias in vowel perception favoring extreme vocalic articulations, which lead to acoustic vowel signals with dynamic formant trajectories and well-defined spectral prominences due to the convergence of adjacent formants. The present experiments investigated whether this bias reflects speech-specific processes or general properties of spectral processing in the auditory system. Toward this end, we examined whether analogous asymmetries in perception arise with non-speech tonal analogues that approximate some of the dynamic and static spectral characteristics of naturally-produced /u/ vowels executed with more versus less extreme lip gestures. We found a qualitatively similar but weaker directional effect with two-component tones varying in both the dynamic changes and proximity of their spectral energies. In subsequent experiments, we pinned down the phenomenon using tones that varied in one or both of these two acoustic characteristics. We found comparable asymmetries with tones that differed exclusively in their spectral dynamics, and no asymmetries with tones that differed exclusively in their spectral proximity or both spectral features. We interpret these findings as evidence that dynamic spectral changes are a critical cue for eliciting asymmetries in non-speech tone perception, but that the potential contribution of general auditory processes to asymmetries in vowel perception is limited.Accepted manuscrip

Boston University Institutional Repository (OpenBU)

Language identification with suprasegmental cues: A study based on speech resynthesis

Author: Mehler Jacques
Ramus Franck
Publication venue
Publication date: 01/01/1999
Field of study

This paper proposes a new experimental paradigm to explore the discriminability of languages, a question which is crucial to the child born in a bilingual environment. This paradigm employs the speech resynthesis technique, enabling the experimenter to preserve or degrade acoustic cues such as phonotactics, syllabic rhythm or intonation from natural utterances. English and Japanese sentences were resynthesized, preserving broad phonotactics, rhythm and intonation (Condition 1), rhythm and intonation (Condition 2), intonation only (Condition 3), or rhythm only (Condition 4). The findings support the notion that syllabic rhythm is a necessary and sufficient cue for French adult subjects to discriminate English from Japanese sentences. The results are consistent with previous research using low-pass filtered speech, as well as with phonological theories predicting rhythmic differences between languages. Thus, the new methodology proposed appears to be well-suited to study language discrimination. Applications for other domains of psycholinguistic research and for automatic language identification are considered

CogPrints Cognitive Sciences Eprint Archive

How do you say ‘hello’? Personality impressions from brief novel voices

Author: A Todorov
A Todorov
AC Little
AC Little
AC Little
Alexander Todorov
AW Young
B Yao
B Yao
BC Jones
C Ferdenzi
C Nass
CA Klofstad
CA Sutherland
CC Tigue
CD Aronovitch
Charles R. Larson
CL Apicella
CL Apicella
CP Said
CT Ferrand
CY Olivola
D Rendall
DA Kenny
DA Leopold
DA Puts
DC Funder
DR Feinberg
DR Feinberg
DS Berry
DS Berry
DS Berry
DS Berry
E Vannoni
FT Passini
G Rhodes
G Rhodes
GW Allport
IR Titze
IS Penton-Voak
J Kreiman
J Willis
JH Langlois
JH Langlois
JJ Horton
JJ Ohala
JM Montepare
JS Wiggins
JW Lewis
K Grammer
K Miyake
KR Scherer
L Bruckert
L Bruckert
L Germine
LA Zebrowitz
LA Zebrowitz
LA Zebrowitz
LA Zebrowitz
LA Zebrowitz-McArthur
LZ McArthur
M Latinus
M Latinus
M Latinus
M Shevlin
M Zuckerman
M Zuckerman
M Zuckerman
NJ Lass
NJ Lass
NN Oosterhof
O Baumann
P Belin
P Belin
P Boersma
P Borkenau
Pascal Belin
PE Bestelmeyer
PEG Bestelmeyer
Phil McAleer
R Hassin
RA Page
RM Krauss
RR Mccrae
RS Kramer
S Evans
S Patel
S Rosenberg
SA Collins
SC Verosky
SJ Ko
SM Hughes
SM Hughes
SM Hughes
ST Fiske
TK Perrachione
V Bruce
V Pivonkova
VX Luevano
WA van Dommelen
WA van Dommelen
WT Fitch
WT Fitch
WT Norman
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

On hearing a novel voice, listeners readily form personality impressions of that speaker. Accurate or not, these impressions are known to affect subsequent interactions; yet the underlying psychological and acoustical bases remain poorly understood. Furthermore, hitherto studies have focussed on extended speech as opposed to analysing the instantaneous impressions we obtain from first experience. In this paper, through a mass online rating experiment, 320 participants rated 64 sub-second vocal utterances of the word ‘hello’ on one of 10 personality traits. We show that: (1) personality judgements of brief utterances from unfamiliar speakers are consistent across listeners; (2) a two-dimensional ‘social voice space’ with axes mapping Valence (Trust, Likeability) and Dominance, each driven by differing combinations of vocal acoustics, adequately summarises ratings in both male and female voices; and (3) a positive combination of Valence and Dominance results in increased perceived male vocal Attractiveness, whereas perceived female vocal Attractiveness is largely controlled by increasing Valence. Results are discussed in relation to the rapid evaluation of personality and, in turn, the intent of others, as being driven by survival mechanisms via approach or avoidance behaviours. These findings provide empirical bases for predicting personality impressions from acoustical analyses of short utterances and for generating desired personality impressions in artificial voices

Crossref

HAL AMU

Directory of Open Access Journals

PubMed Central

Enlighten

FigShare

"Il parle normal, il parle comme nous”: self-reported usage and attitudes in a banlieue

Author: Baker
Beckford-Wassink
Bellamy
Castellotti
Cheshire
Cheshire
Coupland
Coupland
Edwards
Encrevé
FRÉDÉRIQUE ATANGANA
Gadet
Gal
Giles
Goudaillier
Hedgecock
Kerswill
Kircher
Labov
Laur
Lee
Maio
MARIA SECOVA
Mattheier
Milroy
Paltridge
Paveau
PENELOPE GARDNER-CHLOROS
Preston
Rampton
Ryan
Stewart
Stewart
Tagliamonte
Trudgill
Wiese
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/07/2018
Field of study

We report on a survey of language attitudes carried out as part of a project comparing youth language in Paris and London. As in similar studies carried out in London (Cheshire et al. 2008), Berlin (Wiese 2009) and elsewhere (Boyd et al. 2015), the focus was on features considered typical of ‘contemporary urban vernaculars’ (Rampton 2015). The respondents were pupils aged 15-18 in two secondary schools in a working-class northern suburb of Paris. The survey included (1) a written questionnaire containing examples of features potentially undergoing change in contemporary French; (2) an analysis of reactions to extracts from the project data: participants were asked to comment on the speakers and the features identified. Quantitative analysis had shown that some of these features are more widespread than others and are used by certain categories of speaker more than others (Gardner-Chloros and Secova, 2018). This study provides a qualitative dimension, showing that different features have different degrees of perceptual salience and acceptability. It demonstrates that youth varieties do not involve characteristic features being used as a ‘package’, and that such changes interact in a complex manner with attitudinal factors. The study also provides material for reflection on the role of attitude studies within sociolinguistic surveys

Crossref

Open Research Online (The Open University)

Birkbeck Institutional Research Online

Eyewitness Identification and Perry v. New Hampshire: Where Man’s Frailty Disappoints His Justice System

Author: Stotmeister Charity F.
Publication venue: Scholars Crossing
Publication date: 07/04/2016
Field of study

Liberty University Digital Commons

Speaker-normalized sound representations in the human auditory cortex

Author: Chang E.
Fox N.
Johnson K.
Sjerps M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

The acoustic dimensions that distinguish speech sounds (like the vowel differences in “boot” and “boat”) also differentiate speakers’ voices. Therefore, listeners must normalize across speakers without losing linguistic information. Past behavioral work suggests an important role for auditory contrast enhancement in normalization: preceding context affects listeners’ perception of subsequent speech sounds. Here, using intracranial electrocorticography in humans, we investigate whether and how such context effects arise in auditory cortex. Participants identified speech sounds that were preceded by phrases from two different speakers whose voices differed along the same acoustic dimension as target words (the lowest resonance of the vocal tract). In every participant, target vowels evoke a speaker-dependent neural response that is consistent with the listener’s perception, and which follows from a contrast enhancement model. Auditory cortex processing thus displays a critical feature of normalization, allowing listeners to extract meaningful content from the voices of diverse speakers

eScholarship - University of California

Radboud Repository

MPG.PuRe