Search CORE

554 research outputs found

A Quantum Many-body Wave Function Inspired Language Modeling Approach

Author: Blacoe William
Hitchcock Frank L
Hu Baotian
Kim Yoon
Kingma Diederik P
Mikolov Tomas
Sordoni Alessandro
Wan Shengxian
Wang Mengqiu
Yin Wenpeng
Zhang Peng
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/09/2018
Field of study

The recently proposed quantum language model (QLM) aimed at a principled approach to modeling term dependency by applying the quantum probability theory. The latest development for a more effective QLM has adopted word embeddings as a kind of global dependency information and integrated the quantum-inspired idea in a neural network architecture. While these quantum-inspired LMs are theoretically more general and also practically effective, they have two major limitations. First, they have not taken into account the interaction among words with multiple meanings, which is common and important in understanding natural language text. Second, the integration of the quantum-inspired LM with the neural network was mainly for effective training of parameters, yet lacking a theoretical foundation accounting for such integration. To address these two issues, in this paper, we propose a Quantum Many-body Wave Function (QMWF) inspired language modeling approach. The QMWF inspired LM can adopt the tensor product to model the aforesaid interaction among words. It also enables us to reveal the inherent necessity of using Convolutional Neural Network (CNN) in QMWF language modeling. Furthermore, our approach delivers a simple algorithm to represent and match text/sentence pairs. Systematic evaluation shows the effectiveness of the proposed QMWF-LM algorithm, in comparison with the state of the art quantum-inspired LMs and a couple of CNN-based methods, on three typical Question Answering (QA) datasets.Comment: 10 pages,4 figures,CIK

arXiv.org e-Print Archive

Crossref

A Novel Framework for Robustness Analysis of Visual QA Models

Author: Alfadly Modar
Dao Cuong Duc
Ghanem Bernard
Huang Jia-Hong
Publication venue
Publication date: 24/12/2018
Field of study

Deep neural networks have been playing an essential role in many computer vision tasks including Visual Question Answering (VQA). Until recently, the study of their accuracy was the main focus of research but now there is a trend toward assessing the robustness of these models against adversarial attacks by evaluating their tolerance to varying noise levels. In VQA, adversarial attacks can target the image and/or the proposed main question and yet there is a lack of proper analysis of the later. In this work, we propose a flexible framework that focuses on the language part of VQA that uses semantically relevant questions, dubbed basic questions, acting as controllable noise to evaluate the robustness of VQA models. We hypothesize that the level of noise is positively correlated to the similarity of a basic question to the main question. Hence, to apply noise on any given main question, we rank a pool of basic questions based on their similarity by casting this ranking task as a LASSO optimization problem. Then, we propose a novel robustness measure, R_score, and two large-scale basic question datasets (BQDs) in order to standardize robustness analysis for VQA models.Comment: Accepted by the Thirty-Third AAAI Conference on Artificial Intelligence, (AAAI-19), as an oral pape

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

CNM: An Interpretable Complex-valued Network for Matching

Author: Li Qiuchi
Melucci Massimo
Wang Benyou
Publication venue
Publication date: 10/04/2019
Field of study

This paper seeks to model human language by the mathematical framework of quantum physics. With the well-designed mathematical formulations in quantum physics, this framework unifies different linguistic units in a single complex-valued vector space, e.g. words as particles in quantum states and sentences as mixed systems. A complex-valued network is built to implement this framework for semantic matching. With well-constrained complex-valued components, the network admits interpretations to explicit physical meanings. The proposed complex-valued network for matching (CNM) achieves comparable performances to strong CNN and RNN baselines on two benchmarking question answering (QA) datasets

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Padova