Search CORE

4,401 research outputs found

A Survey of Current Datasets for Vision and Language Research

Author: Devlin Jacob
Ferraro Francis
Galley Michel
Huang
Mitchell Margaret
Mostafazadeh Nasrin
Ting-Hao
Vanderwende Lucy
Publication venue
Publication date: 01/01/2015
Field of study

Integrating vision and language has long been a dream in work on artificial intelligence (AI). In the past two years, we have witnessed an explosion of work that brings together vision and language from images to videos and beyond. The available corpora have played a crucial role in advancing this area of research. In this paper, we propose a set of quality metrics for evaluating and analyzing the vision & language datasets and categorize them accordingly. Our analyses show that the most recent datasets have been using more complex language and more abstract concepts, however, there are different strengths and weaknesses in each.Comment: To appear in EMNLP 2015, short proceedings. Dataset analysis and discussion expanded, including an initial examination into reporting bias for one of them. F.F. and N.M. contributed equally to this wor

arXiv.org e-Print Archive

Crossref

Deep Active Learning for Dialogue Generation

Author: Asghar Nabiha
Jiang Xin
Li Hang
Poupart Pascal
Publication venue
Publication date: 01/01/2017
Field of study

We propose an online, end-to-end, neural generative conversational model for open-domain dialogue. It is trained using a unique combination of offline two-phase supervised learning and online human-in-the-loop active learning. While most existing research proposes offline supervision or hand-crafted reward functions for online reinforcement, we devise a novel interactive learning mechanism based on hamming-diverse beam search for response generation and one-character user-feedback at each step. Experiments show that our model inherently promotes the generation of semantically relevant and interesting responses, and can be used to train agents with customized personas, moods and conversational styles.Comment: Accepted at 6th Joint Conference on Lexical and Computational Semantics (*SEM) 2017 (Previously titled "Online Sequence-to-Sequence Active Learning for Open-Domain Dialogue Generation" on ArXiv

arXiv.org e-Print Archive

Crossref

A Neural Network Approach to Context-Sensitive Generation of Conversational Responses

Author: Auli Michael
Brockett Chris
Dolan Bill
Galley Michel
Gao Jianfeng
Ji Yangfeng
Mitchell Margaret
Nie Jian-Yun
Sordoni Alessandro
Publication venue
Publication date: 01/01/2015
Field of study

We present a novel response generation system that can be trained end to end on large quantities of unstructured Twitter conversations. A neural network architecture is used to address sparsity issues that arise when integrating contextual information into classic statistical models, allowing the system to take into account previous dialog utterances. Our dynamic-context generative models show consistent gains over both context-sensitive and non-context-sensitive Machine Translation and Information Retrieval baselines.Comment: A. Sordoni, M. Galley, M. Auli, C. Brockett, Y. Ji, M. Mitchell, J.-Y. Nie, J. Gao, B. Dolan. 2015. A Neural Network Approach to Context-Sensitive Generation of Conversational Responses. In Proc. of NAACL-HLT. Pages 196-20

arXiv.org e-Print Archive

Crossref

How to do things with modals

Author: Mandelkern Matthew
Publication venue
Publication date: 01/01/2019
Field of study

Mind &Language, Volume 35, Issue 1, Page 115-138, February 2020

PhilPapers

Crossref

Oxford University Research Archive

Generating Natural Questions About an Image

Author: Devlin Jacob
He Xiaodong
Misra Ishan
Mitchell Margaret
Mostafazadeh Nasrin
Vanderwende Lucy
Publication venue
Publication date: 01/01/2016
Field of study

There has been an explosion of work in the vision & language community during the past few years from image captioning to video transcription, and answering questions about images. These tasks have focused on literal descriptions of the image. To move beyond the literal, we choose to explore how questions about an image are often directed at commonsense inference and the abstract events evoked by objects in the image. In this paper, we introduce the novel task of Visual Question Generation (VQG), where the system is tasked with asking a natural and engaging question when shown an image. We provide three datasets which cover a variety of images from object-centric to event-centric, with considerably more abstract training data than provided to state-of-the-art captioning systems thus far. We train and test several generative and retrieval models to tackle the task of VQG. Evaluation results show that while such models ask reasonable questions for a variety of images, there is still a wide gap with human performance which motivates further work on connecting images with commonsense knowledge and pragmatics. Our proposed task offers a new challenge to the community which we hope furthers interest in exploring deeper connections between vision & language.Comment: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistic

arXiv.org e-Print Archive

Crossref

Design Knowledge for the Lifecycle Management of Conversational Agents

Author: Böhmann Tilo
Heuer Marvin
Lewandowski Tom
Vogel Pascal
Publication venue: AIS Electronic Library (AISeL)
Publication date: 17/01/2022
Field of study

Organizations spend extensive resources on artificial intelligence (AI) solutions in customer service in order to remain customer-focused and competitive. A rising language-based application of AI emerges in the context of conversational agents (CAs), such as chatbots, which represent increasingly intelligent, autonomous, scalable, and cost-effective service platforms. However, AI-based CAs bring new organizational challenges. They are underrepresented in current research, leading to many unanswered questions and research potential regarding the management of their introduction, operation, and improvement. To address this issue, we provide design knowledge that considers the organizational perspective of CAs. Therefore, we conducted a systematic literature review (SLR) and qualitative interview study to reveal and analyze individual issues and challenges, develop meta-requirements, and finally, use them to create design principles. We contribute to the emerging field of CAs that has previously focused mainly on the individual, behavioral, interactional, or technical design

AIS Electronic Library (AISeL)