Search CORE

17 research outputs found

The MultiLis Corpus - Dealing with Individual Differences in Nonverbal Listening Behavior

Author: de Kok I.A.
Heylen Dirk K.J.
Publication venue: Springer
Publication date: 01/07/2011
Field of study

University of Twente Research Information

Analyzing Nonverbal Listener Responses using Parallel Recordings of Multiple Listeners

Author: Heylen Dirk
Kok Iwan de
Publication venue: Springer Verlag
Publication date: 01/01/2012
Field of study

In this paper we study nonverbal listener responses on a corpus with multiple parallel recorded listeners. These listeners were meant to believe that they were the sole listener, while in fact there were three persons listening to the same speaker. The speaker could only see one of the listeners. We analyze the impact of the particular setup of the corpus on the behavior and perception of the two types of listeners; the listeners that could be seen by the speaker and the listeners that could not be seen. Furthermore we compare the nonverbal listening behaviors of these three listeners to each other with regard to timing and form. We correlate these behaviors with behaviors of the speaker, like pauses and whether the speaker is looking at the listeners or not

Springer - Publisher Connector

PubMed Central

University of Twente Research Information

Continuous Interaction with a Virtual Human

Author: A Gravano
A Kendon
A Nijholt
AC Norwine
AH Anderson
AW Black
Bart van Straalen
C Goodwin
C Goodwin
CC Lee
D Heylen
D Heylen
D Neiberg
D Neiberg
D Reidsma
Daniel Neiberg
Dennis Reidsma
DT Fujimoto
E Kurtic
E Schegloff
F Eyben
G Skantze
H Sacks
H Welbergen van
H Welbergen van
Herwin van Welbergen
HH Clark
HH Clark
HH Clark
I Kok de
Iwan de Kok
J Allwood
J Edlund
J Gustafson
JB Bavelas
JB Bavelas
JC Carletta
Khiet Truong
KR Thórisson
M Heldner
M Maat ter
M Schröder
M Schröder
M Schröder
M Thiebaux
MB Walker
MF McKinneya
N Ward
N Ward
P French
PT Brady
S Benus
S Duncan Jr
S Goldwater
S Kopp
S Kopp
Sathish Chandra Pammi
T Toda
V Manusov
Publication venue: University of Amsterdam
Publication date: 01/01/2010
Field of study

Attentive Speaking and Active Listening require that a Virtual Human be capable of simultaneous perception/interpretation and production of communicative behavior. A Virtual Human should be able to signal its attitude and attention while it is listening to its interaction partner, and be able to attend to its interaction partner while it is speaking – and modify its communicative behavior on-the-fly based on what it perceives from its partner. This report presents the results of a four week summer project that was part of eNTERFACE’10. The project resulted in progress on several aspects of continuous interaction such as scheduling and interrupting multimodal behavior, automatic classification of listener responses, generation of response eliciting behavior, and models for appropriate reactions to listener responses. A pilot user study was conducted with ten participants. In addition, the project yielded a number of deliverables that are released for public access

Crossref

Springer - Publisher Connector

Publications at Bielefeld University

University of Twente Research Information

Observations on listener responses from multiple perspectives

Author: de Kok Iwan
Heylen Dirk
Publication venue
Publication date: 01/05/2011
Field of study

Proceedings of the 3rd Nordic Symposium on Multimodal Communication. Editors: Patrizia Paggio, Elisabeth Ahlsén, Jens Allwood, Kristiina Jokinen, Costanza Navarretta. NEALT Proceedings Series, Vol. 15 (2011), 48–55. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/22532

University of Twente Research Information

DSpace at Tartu University Library

Differences in Listener Responses between Procedural and Narrative Tasks

Author: de Kok I.A.
Heylen Dirk K.J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/10/2010
Field of study

In the long tradition of corpus based research on listener behavior, whether it entails linguistic analysis or social signal processing, many different tasks have been used during the recording of the corpus. So far in no study the task which has been given to the participants has been an independent variable and no studies have looked into the effect of this variable on listener responses. In this paper we present the results of our comparison between listening behavior elicited by procedural and narrative tasks which were used during the recording of our MultiLis corpus. We will show that listeners in the procedural tasks show more agreement in their responses than listeners in the narrative tasks. Furthermore we will show that the long procedural task elicits more responses per minute than the short procedural task. We will reflect on these results in light of cognitive load and grounding theory

University of Twente Research Information

Speaker-adaptive multimodal prediction model for listener responses

Author: Heylen D.K.J.
Kok I.A. de
Morency L.-P.
Publication venue: ACM
Publication date: 01/01/2013
Field of study

The goal of this paper is to analyze and model the variability in speaking styles in dyadic interactions and build a predictive algorithm for listener responses that is able to adapt to these different styles. The end result of this research will be a virtual human able to automatically respond to a human speaker with proper listener responses (e.g., head nods). Our novel speaker-adaptive prediction model is created from a corpus of dyadic interactions where speaker variability is analyzed to identify a subset of prototypical speaker styles. During a live interaction our prediction model automatically identifies the closest prototypical speaker style and predicts listener responses based on this ``communicative style". Central to our approach is the idea of ``speaker profile" which uniquely identifies each speaker and enables the matching between prototypical speakers and new speakers. The paper shows the merits of our speaker-adaptive listener response prediction model by showing improvement over a state-of-the-art approach which does not adapt to the speaker. Besides the merits of speaker-adapta-tion, our experiments highlights the importance of using multimodal features when comparing speakers to select the closest prototypical speaker style

University of Twente Research Information

Iterative Perceptual Learning for Social Behavior Synthesis

Author: de Kok I.A.
Heylen Dirk K.J.
Poppe Ronald Walter
Publication venue: Centre for Telematics and Information Technology (CTIT)
Publication date: 01/02/2012
Field of study

We introduce Iterative Perceptual Learning (IPL), a novel approach for learning computational models for social behavior synthesis from corpora of human-human interactions. The IPL approach combines perceptual evaluation with iterative model refinement. Human observers rate the appropriateness of synthesized individual behaviors in the context of a conversation. These ratings are in turn used to refine the machine learning models. As the ratings correspond to those moments in the conversation where the production of a specific social behavior is inappropriate, we can regard features extracted at these moments as negative samples for the training of a machine learning classifier. This is an advantage over traditional corpusbased approaches, in which negative samples at extracted at random from moments in the conversation where the specific social behavior does not occur. We perform a comparison between the IPL approach and the traditional corpus-based approach on the timing of backchannels for a listener in speaker-listener dialogs. While both models perform similarly in terms of precision and recall scores, the results of the IPL model are rated as more appropriate in the perceptual evaluation.We additionally investigate the effect of the amount of available training data and the variation of training data on the outcome of the models

University of Twente Research Information

Backchannel relevance spaces

Author: Anna Hjalmarsson
Jens Edlund
Mattias Heldner
Publication venue
Publication date: 01/01/2013
Field of study

This contribution introduces backchannel relevance spaces – intervals where it is relevant for a listener in a conversation to produce a backchannel. By annotating and comparing actual visual and vocal backchannels with potential backchannels established using a group of subjects acting as third-party listeners, we show (i) that visual only backchannels represent a substantial proportion of all backchannels; and (ii) that there are more opportunities for backchannels (i.e. potential backchannels or backchannel relevance spaces) than there are actual vocal and visual backchannels. These findings indicate that backchannel relevance spaces enable more accurate acoustic, prosodic, lexical (et cetera) descriptions of backchannel inviting cues than descriptions based on the context of actual vocal backchannels only

CiteSeerX

Publikationer från KTH

Publikationer från Stockholms universitet

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Proceedings

Author: Ahlsén Elisabeth
Allwood Jens
Jokinen Kristiina
Navarretta Costanza
Paggio Patrizia
Publication venue
Publication date: 30/12/2011
Field of study

Proceedings of the 3rd Nordic Symposium on Multimodal Communication. Editors: Patrizia Paggio, Elisabeth Ahlsén, Jens Allwood, Kristiina Jokinen, Costanza Navarretta. NEALT Proceedings Series, Vol. 15 (2011), vi+87 pp. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/22532

DSpace at Tartu University Library