Search CORE

5,215 research outputs found

Detecting Low Rapport During Natural Interactions in Small Groups from Non-Verbal Behaviour

Author: Bulling Andreas
Huang Michael Xuelin
Müller Philipp
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

Rapport, the close and harmonious relationship in which interaction partners are "in sync" with each other, was shown to result in smoother social interactions, improved collaboration, and improved interpersonal outcomes. In this work, we are first to investigate automatic prediction of low rapport during natural interactions within small groups. This task is challenging given that rapport only manifests in subtle non-verbal signals that are, in addition, subject to influences of group dynamics as well as inter-personal idiosyncrasies. We record videos of unscripted discussions of three to four people using a multi-view camera system and microphones. We analyse a rich set of non-verbal signals for rapport detection, namely facial expressions, hand motion, gaze, speaker turns, and speech prosody. Using facial features, we can detect low rapport with an average precision of 0.7 (chance level at 0.25), while incorporating prior knowledge of participants' personalities can even achieve early prediction without a drop in performance. We further provide a detailed analysis of different feature sets and the amount of information contained in different temporal segments of the interactions.Comment: 12 pages, 6 figure

arXiv.org e-Print Archive

MPG.PuRe

Speech-driven Animation with Meaningful Behaviors

Author: Busso Carlos
Sadoughi Najmeh
Publication venue
Publication date: 04/08/2017
Field of study

Conversational agents (CAs) play an important role in human computer interaction. Creating believable movements for CAs is challenging, since the movements have to be meaningful and natural, reflecting the coupling between gestures and speech. Studies in the past have mainly relied on rule-based or data-driven approaches. Rule-based methods focus on creating meaningful behaviors conveying the underlying message, but the gestures cannot be easily synchronized with speech. Data-driven approaches, especially speech-driven models, can capture the relationship between speech and gestures. However, they create behaviors disregarding the meaning of the message. This study proposes to bridge the gap between these two approaches overcoming their limitations. The approach builds a dynamic Bayesian network (DBN), where a discrete variable is added to constrain the behaviors on the underlying constraint. The study implements and evaluates the approach with two constraints: discourse functions and prototypical behaviors. By constraining on the discourse functions (e.g., questions), the model learns the characteristic behaviors associated with a given discourse class learning the rules from the data. By constraining on prototypical behaviors (e.g., head nods), the approach can be embedded in a rule-based system as a behavior realizer creating trajectories that are timely synchronized with speech. The study proposes a DBN structure and a training approach that (1) models the cause-effect relationship between the constraint and the gestures, (2) initializes the state configuration models increasing the range of the generated behaviors, and (3) captures the differences in the behaviors across constraints by enforcing sparse transitions between shared and exclusive states per constraint. Objective and subjective evaluations demonstrate the benefits of the proposed approach over an unconstrained model.Comment: 13 pages, 12 figures, 5 table

arXiv.org e-Print Archive

Spotting Agreement and Disagreement: A Survey of Nonverbal Audiovisual Cues and Tools

Author: Bousmalis Konstantinos
Mehu Marc
Pantic Maja
Publication venue: IEEE Computer Society Press
Publication date: 01/01/2009
Field of study

While detecting and interpreting temporal patterns of non–verbal behavioral cues in a given context is a natural and often unconscious process for humans, it remains a rather difficult task for computer systems. Nevertheless, it is an important one to achieve if the goal is to realise a naturalistic communication between humans and machines. Machines that are able to sense social attitudes like agreement and disagreement and respond to them in a meaningful way are likely to be welcomed by users due to the more natural, efficient and human–centered interaction they are bound to experience. This paper surveys the nonverbal cues that could be present during agreement and disagreement behavioural displays and lists a number of tools that could be useful in detecting them, as well as a few publicly available databases that could be used to train these tools for analysis of spontaneous, audiovisual instances of agreement and disagreement

CiteSeerX

University of Twente Research Information

Recommended from our members

Always on my mind: Cross-brain associations of mental health symptoms during simultaneous parent-child scanning.

Author: Antonacci Chase
Aupperle Robin L
Bodurka Jerzy
Burrows Kaiping
Cosgrove Kelly T
DeVille Danielle C
Kerr Kara L
Misaki Masaya
Moore Andrew J
Morris Amanda Sheffield
Ratliff Erin L
Silk Jennifer S
Simmons W Kyle
Tapert Susan F
Publication venue: eScholarship, University of California
Publication date: 01/12/2019
Field of study

How parents manifest symptoms of anxiety or depression may affect how children learn to modulate their own distress, thereby influencing the children's risk for developing an anxiety or mood disorder. Conversely, children's mental health symptoms may impact parents' experiences of negative emotions. Therefore, mental health symptoms can have bidirectional effects in parent-child relationships, particularly during moments of distress or frustration (e.g., when a parent or child makes a costly mistake). The present study used simultaneous functional magnetic resonance imaging (fMRI) of parent-adolescent dyads to examine how brain activity when responding to each other's costly errors (i.e., dyadic error processing) may be associated with symptoms of anxiety and depression. While undergoing simultaneous fMRI scans, healthy dyads completed a task involving feigned errors that indicated their family member made a costly mistake. Inter-brain, random-effects multivariate modeling revealed that parents who exhibited decreased medial prefrontal cortex and posterior cingulate cortex activation when viewing their child's costly error response had children with more symptoms of depression and anxiety. Adolescents with increased anterior insula activation when viewing a costly error made by their parent had more anxious parents. These results reveal cross-brain associations between mental health symptomatology and brain activity during parent-child dyadic error processing

eScholarship - University of California

Discriminatively Trained Latent Ordinal Model for Video Classification

Author: Sharma Gaurav
Sikka Karan
Publication venue
Publication date: 14/08/2017
Field of study

We study the problem of video classification for facial analysis and human action recognition. We propose a novel weakly supervised learning method that models the video as a sequence of automatically mined, discriminative sub-events (eg. onset and offset phase for "smile", running and jumping for "highjump"). The proposed model is inspired by the recent works on Multiple Instance Learning and latent SVM/HCRF -- it extends such frameworks to model the ordinal aspect in the videos, approximately. We obtain consistent improvements over relevant competitive baselines on four challenging and publicly available video based facial analysis datasets for prediction of expression, clinical pain and intent in dyadic conversations and on three challenging human action datasets. We also validate the method with qualitative results and show that they largely support the intuitions behind the method.Comment: Paper accepted in IEEE TPAMI. arXiv admin note: substantial text overlap with arXiv:1604.0150

arXiv.org e-Print Archive

MPG.PuRe