Search CORE

20,176 research outputs found

Recommended from our members

Generation of multi-modal dialogue for a net environment

Author: Baumann S.
Grice M.
Gstrein E.
Klesen M.
Krenn B.
Pirker H.
Piwek P.
Schroeder M.
van Deemter K.
Publication venue
Publication date: 01/01/2002
Field of study

In this paper an architecture and special purpose markup language for simulated affective face-to-face communication is presented. In systems based on this architecture, users will be able to watch embodied conversational agents interact with each other in virtual locations on the internet. The markup language, or Rich Representation Language (RRL), has been designed to provide an integrated representation of speech, gesture, posture and facial animation

Open Research Online (The Open University)

Learning how to learn: an adaptive dialogue agent for incrementally learning visually grounded word meanings

Author: Eshghi Arash
Lemon Oliver
Yu Yanchao
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

We present an optimised multi-modal dialogue agent for interactive learning of visually grounded word meanings from a human tutor, trained on real human-human tutoring data. Within a life-long interactive learning period, the agent, trained using Reinforcement Learning (RL), must be able to handle natural conversations with human users and achieve good learning performance (accuracy) while minimising human effort in the learning process. We train and evaluate this system in interaction with a simulated human tutor, which is built on the BURCHAK corpus -- a Human-Human Dialogue dataset for the visual learning task. The results show that: 1) The learned policy can coherently interact with the simulated user to achieve the goal of the task (i.e. learning visual attributes of objects, e.g. colour and shape); and 2) it finds a better trade-off between classifier accuracy and tutoring costs than hand-crafted rule-based policies, including ones with dynamic policies.Comment: 10 pages, RoboNLP Workshop from ACL Conferenc

arXiv.org e-Print Archive

Crossref

Improving Context Modelling in Multimodal Dialogue Generation

Author: Agarwal Shubham
Dusek Ondrej
Konstas Ioannis
Rieser Verena
Publication venue
Publication date: 01/01/2018
Field of study

In this work, we investigate the task of textual response generation in a multimodal task-oriented dialogue system. Our work is based on the recently released Multimodal Dialogue (MMD) dataset (Saha et al., 2017) in the fashion domain. We introduce a multimodal extension to the Hierarchical Recurrent Encoder-Decoder (HRED) model and show that this extension outperforms strong baselines in terms of text-based similarity metrics. We also showcase the shortcomings of current vision and language models by performing an error analysis on our system's output

arXiv.org e-Print Archive

Heriot Watt Pure

Crossref

Designing and Implementing Embodied Agents: Learning from Experience

Author: Heylen D.K.J.
Nijholt A.
Publication venue: AMAAS
Publication date: 01/01/2001
Field of study

In this paper, we provide an overview of part of our experience in designing and implementing some of the embodied agents and talking faces that we have used for our research into human computer interaction. We focus on the techniques that were used and evaluate this with respect to the purpose that the agents and faces were to serve and the costs involved in producing and maintaining the software. We discuss the function of this research and development in relation to the educational programme of our graduate students

CiteSeerX

University of Twente Research Information

Multimodal Interaction in a Haptic Environment

Author: Kole S.
Nijholt A.
Zwiers J.
Publication venue: IEEE Computer Society
Publication date: 01/01/2005
Field of study

In this paper we investigate the introduction of haptics in a multimodal tutoring environment. In this environment a haptic device is used to control a virtual piece of sterile cotton and a virtual injection needle. Speech input and output is provided to interact with a virtual tutor, available as a talking head, and a virtual patient. We introduce the haptic tasks and how different agents in the multi-agent system are made responsible for them. Notes are provided about the way we introduce an affective model in the tutor agent

University of Twente Research Information