Fine-grained Linguistic Evaluation of Question Answering Systems

El Ayari, Sarra; Grau, Brigitte; Ligozat, Anne-Laure

Fine-grained Linguistic Evaluation of Question Answering Systems

Authors: Sarra El Ayari
Brigitte Grau
Anne-Laure Ligozat
Publication date: 19 May 2010
Publisher: HAL CCSD

Abstract

International audienceQuestion answering systems are complex systems using natural language processing. Some evaluation campaigns are organized to evaluate such systems in order to propose a classification of systems based on final results (number of correct answers). Nevertheless, teams need to evaluate more precisely the results obtained by their systems if they want to do a diagnostic evaluation. There are no tools or methods to do these evaluations systematically. We present REVISE, a tool for glass box evaluation based on diagnostic of question answering system results