University of Sussex
Browse
2020.lrec-1.638.pdf (276.29 kB)

English recipe flow graph corpus

Download (276.29 kB)
conference contribution
posted on 2023-06-09, 21:16 authored by Yoko Yamakata, Shinsuke Mori, John Carroll
We present an annotated corpus of English cooking recipe procedures, and describe and evaluate computational methods for learning these annotations. The corpus consists of 300 recipes written by members of the public, which we have annotated with domain-specific linguistic and semantic structure. Each recipe is annotated with (1) `recipe named entities' (r-NEs) specific to the recipe domain, and (2) a flow graph representing in detail the sequencing of steps, and interactions between cooking tools, food ingredients and the products of intermediate steps. For these two kinds of annotations, inter-annotator agreement ranges from 82.3 to 90.5 F1, indicating that our annotation scheme is appropriate and consistent. We experiment with producing these annotations automatically. For r-NE tagging we train a deep neural network NER tool; to compute flow graphs we train a dependency-style parsing procedure which we apply to the entire sequence of r-NEs in a recipe.In evaluations, our systems achieve 71.1 to 87.5 F1, demonstrating that our annotation scheme is learnable.

History

Publication status

  • Published

File Version

  • Published version

Journal

Proceedings of the 12th Language Resources and Evaluation Conference

Publisher

European Language Resources Association (ELRA)

Page range

5187-5194

Pages

8.0

Event name

12th Language Resources and Evaluation Conference

Event location

Marseille, France

Event type

conference

Event date

11th - 16th May 2020

Department affiliated with

  • Informatics Publications

Full text available

  • Yes

Peer reviewed?

  • Yes

Legacy Posted Date

2020-06-08

First Open Access (FOA) Date

2020-06-08

First Compliant Deposit (FCD) Date

2020-06-08

Usage metrics

    University of Sussex (Publications)

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC