Search CORE

1,787 research outputs found

The Production of Speech Corpora

Author: Baumann Angela
Draxler Christoph
Ellbogen Tania
Schiel Florian
Steffen Alexander
Publication venue
Publication date: 21/03/2012
Field of study

Automatic Construction of Evaluation Suites for Natural Language Generation Datasets

Author: Dhole Kaustubh
Gangal Varun Prashant
Gehrmann Sebastian
Kale Mihir
Mahamood Saad
Mille Simon
Miltenburg Emiel van
Perez-Beltrachini Laura
Publication venue
Publication date: 01/01/2021
Field of study

Tilburg University Repository

IMAGINE Final Report

Author: Arana C
Dattani I
Pick R
Recio I
Schmidt P
Publication venue: s.n.
Publication date: 01/09/2003
Field of study

Southampton (e-Prints Soton)

Automatic Construction of Evaluation Suites for Natural Language Generation Datasets

Author: Dhole Kaustubh D.
Gangal Varun
Gehrmann Sebastian
Kale Mihir
Mahamood Saad
Mille Simon
Perez-Beltrachini Laura
van Miltenburg Emiel
Publication venue
Publication date: 01/01/2021
Field of study

Machine learning approaches applied to NLP are often evaluated by summarizing their performance in a single number, for example accuracy. Since most test sets are constructed as an i.i.d. sample from the overall data, this approach overly simplifies the complexity of language and encourages overfitting to the head of the data distribution. As such, rare language phenomena or text about underrepresented groups are not equally included in the evaluation. To encourage more in-depth model analyses, researchers have proposed the use of multiple test sets, also called challenge sets, that assess specific capabilities of a model. In this paper, we develop a framework based on this idea which is able to generate controlled perturbations and identify subsets in text-to-scalar, text-to-text, or data-to-text settings. By applying this framework to the GEM generation benchmark, we propose an evaluation suite made of 80 challenge sets, demonstrate the kinds of analyses that it enables and shed light onto the limits of current generation models

arXiv.org e-Print Archive

Tilburg University Repository