Search CORE

343 research outputs found

Spectral Characteristics of Schwa in Czech Accented English

Author: Ashby
Ashby
Barry
Barry
Boersma
Boersma
Browman
Browman
Derwing
Derwing
Fry
Fry
Gobl
Gobl
Hammarberg
Hammarberg
Hanson
Hanson
Jan Volín
Keysar
Keysar
Lenka Weingartová
Lindblom
Lindblom
Nakatani
Nakatani
Radek Skarnitzl
Sluijter
Sluijter
Sundberg
Sundberg
Volín
Volín
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/03/2013
Field of study

The English central mid lax vowel (i.e., schwa) often contributes considerably to the sound differences between native and non-native speech. Many foreign speakers of English fail to reduce certain underlying vowels to schwa, which, on the suprasegmental level of description, affects the perceived rhythm of their speech. However, the problem of capturing quantitatively the differences between native and non-native schwa poses difficulties that, to this day, have been tackled only partially. We offer a technique of measurement in the acoustic domain that has not been probed properly as yet: the distribution of acoustic energy in the vowel spectrum. Our results show that spectral slope features measured in weak vowels discriminate between Czech and British speakers of English quite reliably. Moreover, the measurements of formant bandwidths turned out to be useful for the same task, albeit less direc

Crossref

Biblioteka Nauki - repozytorium artykuÅÃ³w

Repozytorium Uniwersytetu Łódzkiego (University of Lodz Repository)

Pitch Patterns in Vocal Expression of “Happiness” and “Sadness” in the Reading Aloud of Prose on the Basis of Selected Audiobooks

Author: Alter
Alter
Anthony
Anthony
Averill
Averill
Banse
Banse
Barrett
Barrett
Bergmann
Bergmann
Bezooijen
Bezooijen
Boersma
Boersma
Boves
Boves
Breitenstein
Breitenstein
Cahn
Cahn
Cahn
Cahn
Carlson
Carlson
Coleman
Coleman
Cosmides
Cosmides
Cummings
Cummings
Darwin
Darwin
Davitz
Davitz
Ekman
Ekman
Ekman
Ekman
Eldred
Eldred
Fairbanks
Fairbanks
Fairbanks
Fairbanks
Fonagy
Fonagy
Frijda
Frijda
Gobl
Gobl
Hargreaves
Hargreaves
Havrdova
Havrdova
House
House
Huttar
Huttar
Izard
Izard
Johnstone
Johnstone
Kaiser
Kaiser
Kotlyar
Kotlyar
Kuroda
Kuroda
Laukkanen
Laukkanen
Lieberman
Lieberman
Markel
Markel
Mason
Mason
McRoberts
McRoberts
Minitab
Minitab
Mozziconacci
Mozziconacci
Murray
Murray
Murray
Murray
Nesse
Nesse
Nesse
Nesse
Ortony
Ortony
Pell
Pell
Plutchik
Plutchik
Plutchik
Plutchik
Protopapas
Protopapas
Roessler
Roessler
Scherer
Scherer
Scherer
Scherer
Scherer
Scherer
Scherer
Scherer
Scherer
Scherer
Scherer
Scherer
Siegman
Siegman
Skinner
Skinner
Tooby
Tooby
Utsuki
Utsuki
Williams
Williams
Williams
Williams
Łukasz Stolarski
Publication venue: 'Walter de Gruyter GmbH'
Publication date: 01/06/2015
Field of study

The primary focus of this paper is to examine the way the emotional categories of “happiness” and “sadness” are expressed vocally in the reading aloud of prose. In particular, the two semantic categories were analysed in terms of the pitch level and the pitch variability on a corpus based on 28 works written by Charles Dickens. passages with the intended emotional colouring were selected and the fragments found in the corresponding audiobooks. They were then analysed acoustically in terms of the mean F0 and the standard deviation of F0. The results for individual emotional passages were compared with a particular reader’s mean pitch and standard deviation of pitch. The differences obtained in this way supported the initial assumptions that the pitch level and its standard deviation would raise in “happy” extracts but lower in “sad” ones. Nevertheless, not all of these tendencies could be statistically validated and additional examples taken from a selection of random novels by other nineteenth century writers were added. The statistical analysis of the larger samples confirmed the assumed tendencies but also indicated that the two semantic domains may utilise the acoustic parameters under discussion to varying degrees. While “happiness” tends to be signalled primarily by raising F0, “sadness” is communicated mostly by lowering the variability of F0. Changes in the variability of F0 seem to be of less importance in the former case, and shifts in the F0 level less significant in the latter

Biblioteka Nauki - repozytorium artykuÅÃ³w

Crossref

Repozytorium Uniwersytetu Łódzkiego (University of Lodz Repository)

Laugh Like You Mean It:Authenticity Modulates Acoustic, Physiological and Perceptual Properties of Laughter

Author: B Wild
C Darwin
C Gobl
C McGettigan
Carolyn McGettigan
CT Ferrand
D Sauter
G Habermann
G Krom de
G McKeown
GA Bryant
J Vettin
J-A Bachorowski
JA Bachorowski
JARAM Hooff Van
KJ Kohler
KR Scherer
KR Scherer
L Eskenazi
M Drolet
M Drolet
M Drolet
M Gervais
MS Edmonson
Nadine Lavan
P Juslin
P Laukka
PM Niedenthal
R Banse
RR Provine
RR Provine
RR Provine
RR Provine
S Narayanan
S Scott
S Warhurst
SK Scott
SK Scott
Sophie K. Scott
V Parsa
W Ruch
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2016
Field of study

Several authors have recently presented evidence for perceptual and neural distinctions between genuine and acted expressions of emotion. Here, we describe how differences in authenticity affect the acoustic and perceptual properties of laughter. In an acoustic analysis, we contrasted spontaneous, authentic laughter with volitional, fake laughter, finding that spontaneous laughter was higher in pitch, longer in duration, and had different spectral characteristics from volitional laughter that was produced under full voluntary control. In a behavioral experiment, listeners perceived spontaneous and volitional laughter as distinct in arousal, valence, and authenticity. Multiple regression analyses further revealed that acoustic measures could significantly predict these affective and authenticity judgements, with the notable exception of authenticity ratings for spontaneous laughter. The combination of acoustic predictors differed according to the laughter type, where volitional laughter ratings were uniquely predicted by harmonics-to-noise ratio (HNR). To better understand the role of HNR in terms of the physiological effects on vocal tract configuration as a function of authenticity during laughter production, we ran an additional experiment in which phonetically trained listeners rated each laugh for breathiness, nasality, and mouth opening. Volitional laughter was found to be significantly more nasal than spontaneous laughter, and the item-wise physiological ratings also significantly predicted affective judgements obtained in the first experiment. Our findings suggest that as an alternative to traditional acoustic measures, ratings of phonatory and articulatory features can be useful descriptors of the acoustic qualities of nonverbal emotional vocalizations, and of their perceptual implications

Crossref

Royal Holloway - Pure

UCL Discovery

Towards spoken dialect identification of Irish

Author: Chasaide Ailbhe Ní
Chiaráin Neasa Ní
Gobl Christer
Lonergan Liam
Qian Mengjie
Publication venue
Publication date: 14/07/2023
Field of study

The Irish language is rich in its diversity of dialects and accents. This compounds the difficulty of creating a speech recognition system for the low-resource language, as such a system must contend with a high degree of variability with limited corpora. A recent study investigating dialect bias in Irish ASR found that balanced training corpora gave rise to unequal dialect performance, with performance for the Ulster dialect being consistently worse than for the Connacht or Munster dialects. Motivated by this, the present experiments investigate spoken dialect identification of Irish, with a view to incorporating such a system into the speech recognition pipeline. Two acoustic classification models are tested, XLS-R and ECAPA-TDNN, in conjunction with a text-based classifier using a pretrained Irish-language BERT model. The ECAPA-TDNN, particularly a model pretrained for language identification on the VoxLingua107 dataset, performed best overall, with an accuracy of 73%. This was further improved to 76% by fusing the model's outputs with the text-based model. The Ulster dialect was most accurately identified, with an accuracy of 94%, however the model struggled to disambiguate between the Connacht and Munster dialects, suggesting a more nuanced approach may be necessary to robustly distinguish between the dialects of Irish.Comment: Accepted to Interspeech 2023 Workshop of the 2nd Annual Meeting of the Special Interest Group of Under-resourced Languages Workshop, Dublin (SiGUL

arXiv.org e-Print Archive

Low-resource speech recognition and dialect identification of Irish in a multi-task framework

Author: Chasaide Ailbhe Ní
Chiaráin Neasa Ní
Gobl Christer
Lonergan Liam
Qian Mengjie
Publication venue
Publication date: 02/05/2024
Field of study

This paper explores the use of Hybrid CTC/Attention encoder-decoder models trained with Intermediate CTC (InterCTC) for Irish (Gaelic) low-resource speech recognition (ASR) and dialect identification (DID). Results are compared to the current best performing models trained for ASR (TDNN-HMM) and DID (ECAPA-TDNN). An optimal InterCTC setting is initially established using a Conformer encoder. This setting is then used to train a model with an E-branchformer encoder and the performance of both architectures are compared. A multi-task fine-tuning approach is adopted for language model (LM) shallow fusion. The experiments yielded an improvement in DID accuracy of 10.8% relative to a baseline ECAPA-TDNN, and WER performance approaching the TDNN-HMM model. This multi-task approach emerges as a promising strategy for Irish low-resource ASR and DID.Comment: 7 pages. Accepted to Odyssey 2024 - The Speaker and Language Recognition Worksho

arXiv.org e-Print Archive

Towards dialect-inclusive recognition in a low-resource language: are balanced corpora the answer?

Author: Chasaide Ailbhe Ní
Chiaráin Neasa Ní
Gobl Christer
Lonergan Liam
Qian Mengjie
Publication venue
Publication date: 14/07/2023
Field of study

ASR systems are generally built for the spoken 'standard', and their performance declines for non-standard dialects/varieties. This is a problem for a language like Irish, where there is no single spoken standard, but rather three major dialects: Ulster (Ul), Connacht (Co) and Munster (Mu). As a diagnostic to quantify the effect of the speaker's dialect on recognition performance, 12 ASR systems were trained, firstly using baseline dialect-balanced training corpora, and then using modified versions of the baseline corpora, where dialect-specific materials were either subtracted or added. Results indicate that dialect-balanced corpora do not yield a similar performance across the dialects: the Ul dialect consistently underperforms, whereas Mu yields lowest WERs. There is a close relationship between Co and Mu dialects, but one that is not symmetrical. These results will guide future corpus collection and system building strategies to optimise for cross-dialect performance equity.Comment: Accepted to Interspeech 2023, Dubli

arXiv.org e-Print Archive

Glucometabolic Alterations in Pregnant Women with Overweight or Obesity but without Gestational Diabetes Mellitus: An Observational Study

Author: Eppel D.
Gobl C. S.
Kotzaeridi G.
Linder T.
Morettini M.
Rosicky I.
Tura A.
Yerlikaya-Schatten G.
Publication venue
Publication date: 01/01/2024
Field of study

Introduction: Maternal overweight is a risk factor for gestational diabetes mellitus (GDM). However, emerging evidence suggests that an increased maternal body mass index (BMI) promotes the development of perinatal complications even in women who do not develop GDM. This study aims to assess physiological glucometabolic changes associated with increased BMI. Methods: Twenty-one women with overweight and 21 normal weight controls received a metabolic assessment at 13 weeks of gestation, including a 60-min frequently sampled intravenous glucose tolerance test. A further investigation was performed between 24 and 28 weeks in women who remained normal glucose tolerant. Results: At baseline, mothers with overweight showed impaired insulin action, whereby the calculated insulin sensitivity index (CSI) was lower as compared to normal weight controls (3.5 vs. 6.7 10-4 min-1 [microU/mL]-1, p = 0.025). After excluding women who developed GDM, mothers with overweight showed higher average glucose during the oral glucose tolerance test (OGTT) at the third trimester. Moreover, early pregnancy insulin resistance and secretion were associated with increased placental weight in normal glucose-tolerant women. Conclusion: Mothers with overweight or obesity show an unfavorable metabolic environment already at the early stage of pregnancy, possibly associated with perinatal complications in women who remain normal glucose tolerant

IRIS UniversitÃ Politecnica delle Marche

HMM-based synthesis of creaky voice

Author: Drugman Thomas
Gobl Christer
Kane John
Raitio Tuomo
Publication venue
Publication date: 01/01/2013
Field of study

Creaky voice, also referred to as vocal fry, is a voice quality frequently produced in many languages, in both read and conversational speech. To enhance the naturalness of speech synthesis, these latter should be able to generate speech in all its expressive diversity, including creaky voice. The present study looks to exploit our recent developments, including creaky voice detection, prediction of creaky voice from context, and rendering of the creaky excitation, into a fully functioning and automatic HMM-based synthesis system. HMM-based synthetic creaky voices are built and evaluated in subjective listening tests, which show that the best synthetic creaky voices are rated more natural and more creaky compared to a conventional voice. A noncreaky voice is also successfully transformed to use creak by modifying the F0 contour and excitation of the predicted creaky parts. The transformed voice is rated equal in terms of naturalness and clearly more creaky compared to the original voice. Index Terms: speech synthesis, creaky voice, contextual factors, F0 estimation, excitation modelin

CiteSeerX

Edinburgh Research Explorer

Changes in Serum Lipid Levels During Pregnancy in Type 1 and Type 2 Diabetic Subjects

Author: A. Handisurya
A. Kautzky-Willer
A. Luger
Alberti
C. S. Gobl
D. Bancher-Todesca
Jones
K. Klein
Koukkou
L. Bozkurt
Liguori
Lu
Mazurkiewicz
Napoli
PALINSKI
Rizzo
Publication venue: American Diabetes Association
Publication date
Field of study

Crossref

PubMed Central