MQA: Answering the Question via Robotic Manipulation

Deng, Yuhong; Guo, Di; Guo, Xiaofeng; Liu, Huaping; Sun, Fuchun; Zhang, Naifu

MQA: Answering the Question via Robotic Manipulation

Authors: Yuhong Deng
Di Guo
Xiaofeng Guo
Huaping Liu
Fuchun Sun
Naifu Zhang
Publication date: 12 December 2020
Publisher

Abstract

In this paper, we propose a novel task -- Manipulation Question Answering (MQA), where the robot is required to find the answer to the question by actively exploring the environment via manipulation. A framework consisting of a QA model and a manipulation model is proposed to solve this problem. For the QA model, we adopt the method of Visual Question Answering (VQA). For the manipulation model, a Deep Q Network (DQN) model is proposed to generate manipulations. By manipulating objects, the robot can continuously explore the bin until the answer to the question is found. Besides, a novel dataset for simulation that contains a variety of object models, complicated scenarios and corresponding question-answer pairs is established. Extensive experiments have been conducted to validate the effectiveness of the proposed framework

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2003.04641

Last time updated on 12/10/2020