English recipe flow graph corpus

Carroll, John; Mori, Shinsuke; Yamakata, Yoko

English recipe flow graph corpus

Authors: John Carroll
Shinsuke Mori
Yoko Yamakata
Publication date: 15 May 2020
Publisher: European Language Resources Association (ELRA)

Abstract

We present an annotated corpus of English cooking recipe procedures, and describe and evaluate computational methods for learning these annotations. The corpus consists of 300 recipes written by members of the public, which we have annotated with domain-specific linguistic and semantic structure. Each recipe is annotated with (1) `recipe named entities' (r-NEs) specific to the recipe domain, and (2) a flow graph representing in detail the sequencing of steps, and interactions between cooking tools, food ingredients and the products of intermediate steps. For these two kinds of annotations, inter-annotator agreement ranges from 82.3 to 90.5 F1, indicating that our annotation scheme is appropriate and consistent. We experiment with producing these annotations automatically. For r-NE tagging we train a deep neural network NER tool; to compute flow graphs we train a dependency-style parsing procedure which we apply to the entire sequence of r-NEs in a recipe.In evaluations, our systems achieve 71.1 to 87.5 F1, demonstrating that our annotation scheme is learnable

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Sustaining member

Sussex Research Online

oai:figshare.com:article/23476...

Last time updated on 05/12/2023