Search CORE

10 research outputs found

VOCE Corpus: Ecologically Collected Speech Annotated with Physiological and Psychological Stress Assessments.

Author: Abrudan T
Aguiar A
Almeida PR
Cunha M
Kaiseler M
Meinedo H
Silva J
Publication venue
Publication date: 01/01/2014
Field of study

Public speaking is a widely requested professional skill, and at the same time an activity that causes one of the most common adult phobias (Miller and Stone, 2009). It is also known that the study of stress under laboratory conditions, as it is most commonly done, may provide only limited ecological validity (Wilhelm and Grossman, 2010). Previously, we introduced an inter-disciplinary methodology to enable collecting a large amount of recordings under consistent conditions (Aguiar et al., 2013). This paper introduces the VOCE corpus of speech annotated with stress indicators under naturalistic public speaking (PS) settings. The novelty of this corpus is that the recordings are carried out in objectively stressful PS situations, as recommended in (Zanstra and Johnston, 2011). The current database contains a total of 38 recordings, 13 of which contain full psychological and physiologic annotation. We show that the collected recordings validate the assumptions of the methodology, namely that participants experience stress during the PS events. We describe the various metrics that can be used for physiologic and psychological annotation, and we characterise the sample collected so far, providing evidence that demographics do not affect the relevant psychological or physiologic annotation. The collection activities are on-going, and we expect to increase the number of complete recordings in the corpus to 30 by June 2014

Repositório Aberto da Universidade do Porto

Leeds Beckett Repository

Multilingual speech recognition for the elderly: The AALFred personal life assistant

Author: Almeida A. M. C.
Dias J.
Fegyó T.
Hämäläinen A.
Meinedo H.
Teixeira A.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

The PaeLife project is a European industry-academia collaboration in the framework of the Ambient Assisted Living Joint Programme (AAL JP), with a goal of developing a multimodal, multilingual virtual personal life assistant to help senior citizens remain active and socially integrated. Speech is one of the key interaction modalities of AALFred, the Windows application developed in the project; the application can be controlled using speech input in four European languages: French, Hungarian, Polish and Portuguese. This paper briefly presents the personal life assistant and then focuses on the speech-related achievements of the project. These include the collection, transcription and annotation of large corpora of elderly speech, the development of automatic speech recognisers optimised for elderly speakers, a speech modality component that can easily be reused in other applications, and an automatic grammar translation service that allows for fast expansion of the automatic speech recognition functionality to new languages.info:eu-repo/semantics/publishedVersio

Repositório Institucional do ISCTE-IUL

Concluding Remarks on Multi-band and Multi-stream Research for Noise-Robust ASR

Author: A. Hagen
A. Hagen
A.K. Haberstadt
H. Christensen
H. Meinedo
M. Cooke
R.A. Cole
S. Dupont
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Design of a Multimodal Input Interface for a Dialogue System

Author: H. Meinedo
K. Waters
M. Mourão
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Crossref

A Prototype System for Selective Dissemination of Broadcast News in European Portuguese

Author: D. Caseiro
H. Meinedo
I. Trancoso
J. Neto
R. Amaral
Publication venue: SpringerOpen
Publication date: 01/01/2007
Field of study

This paper describes ongoing work on selective dissemination of broadcast news. Our pipeline system includes several modules: audio preprocessing, speech recognition, and topic segmentation and indexation. The main goal of this work is to study the impact of earlier errors in the last modules. The impact of audio preprocessing errors is quite small on the speech recognition module, but quite significant in terms of topic segmentation. On the other hand, the impact of speech recognition errors on the topic segmentation and indexation modules is almost negligible. The diagnostic of the errors in these modules is a very important step for the improvement of the prototype of a media watch system described in this paper

Springer - Publisher Connector

Directory of Open Access Journals

Age and gender detection in the I-DASH project

Author: Batliner A.
Bugalho M.
Burkhardt F.
Eyben F.
Hermansky H.
Hugo Meinedo
Isabel Trancoso
Lee S.
Ma J.
Meinedo H.
Potamianos A.
Russell M.
Schuller B.
Wang L.
Wu Y.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Hot Spring Water Traced Back to Fluid Released from a Dehydrating Slab

Author: A. Hämäläinen
A. Potamianos
E.F. Strommen
F. Weninger
G. Dobry
H. Meinedo
J.E. Huber
M. Hall
S. Lee
S. Narayanan
S. Takahashi
S. Xue
S.E. Linville
S.S. Keerthi
Publication venue: 日本温泉科学会
Publication date: 01/01/2014
Field of study

日本温泉科学会第71回大会、特別講演ⅡSince 2003, we have begun a geochemical research for hot spring derived from slabdehydrated fluid and have published results of the research as papers from 2005. In addition, we were working together with researchers in metamorphic petrology and the effort contributed to development of a research theme in the interdisciplinary field and fostering of younger researchers. In a special lecture at the 67th annual meeting of the Japanese Society of Hot Spring Sciences held in September 2018, the author spoke overview our research, and furthermore, he presented two related sub-research subjects which had been introduced by previous oral presentations at several academic conferences. In this paper, the results of the sub-researches including new findings obtained during the preparation process of this oral presentation will be written down. 私たちは 2003年よりスラプ脱水流体由来の温泉の地球流体化学的探索を始め，その研究成果を論文等として公表し (2005年-2016年）．その一方で，変成岩岩石学との分野横断研究を行って新たな研究課題の創始と若手研究者の育成にも貢献した. 2018年 9月に開催された日本温泉科学会第 67回大会における特別講演において．これまでの私たちの研究の概要を紹介するとともに．学会での口頭発表にとどまっている関連の 2つのサプ研究課題についても発表した．この論文には．今回の発表の準備過程で手にした新たな知見を含むそのサプ研究の成果を書き留める

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Repositório Institucional do ISCTE-IUL

Open Archive Toulouse Archive Ouverte

Kyoto University Research Information Repository

Demo. Video scene segmentation system using audio-visual features

Author: Bugalho M. (author)
Kompatsiaris I. (author)
Meinedo H. (author)
Mezaris V. (author)
Sidiropoulos P. (author)
Trancoso I. (author)
Publication venue
Publication date: 13/04/2011
Field of study

This work demonstrates a new approach to video temporal segmentation into scenes. The utilized technique is based on an audio-visual extension of the well-known method of the Scene Transition Graph (STG). This multi-modal extension exploits both low- and high-level audio-visual descriptors to construct distinct STGs. These STGs are employed into a probabilistic framework that is used for estimating a confidence value on each shot boundary also being a scene boundary. Finally, the thresholding of these confidence values generates the set of experimentally estimated scene boundaries. In this demo both the scene segmentation outcome and some intermediate features that lead to it are demonstrated

TU Delft Repository

Investigation of Speaker Group-Dependent Modelling for Recognition of Affective States from Speech

Author: A Batliner
A Batliner
A Viterbi
Andreas Wendemuth
B Atal
B Schuller
B Schuller
B Schuller
B Schuller
D Massaro
David Philippou-Hübner
DL Olson
E Dmitrieva
EM Albornoz
F Schwenker
H Hermansky
H Meinedo
I Shahin
Ingo Siegert
J Cohen
J Gross
J Veth de
JD Morris
K McRae
K Rao
K Scherer
K Scherer
Kim Hartmann
KP Truong
L Lee
L Lee
L Rabiner
LD Butler
LK Lipovčan
M Hall
M Kockmann
M Li
M Suzuki
MW Lee
P Ruvolo
R Cowie
R Plutchik
Ronald Böck
S Davis
S Steidl
S Zhang
T Kinnunen
W Wundt
Z Zeng
Z Zeng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref