A comparison of cooking recipe named entities between Japanese and English

Yamakata, Yoko, Carroll, John and Mori, Shinsuke (2017) A comparison of cooking recipe named entities between Japanese and English. Published in: Proceedings of the 9th Workshop on Multimedia for Cooking and Eating Activities (CEA2017); Melbourne, Australia; 20 August 2017. Association for Computing Machinery ISBN 9781450352673 (Accepted)

[img] PDF - Accepted Version
Restricted to SRO admin only until 20 September 2017.

Download (147kB)

Abstract

In this paper, we analyze the structural differences between the instructional text in Japanese and English cooking recipes. First, we constructed an English recipe corpus of 100 recipes, designed to be comparable to an existing Japanese recipe corpus. We annotated recipe named entities (r-NEs) in the English corpus according to guidelines previously defined for Japanese. We trained a state-of-art NE recognizer, PWNER, on the English r-NEs, and achieved very similar accuracy and coverage to previous results for the Japanese corpus, thus demonstrating the quality and consistency of the annotations. Second, we compared the r-NEs annotated in the Japanese and English corpora, and uncovered lexical, semantic, and underlying structural differences between Japanese and English recipes. We discuss reasons for these differences, which have significant implications for cross-language retrieval and automatic translation of recipes.

Item Type: Conference Proceedings
Keywords: Cooking recipe named entity, NER, Japanese and English Comparison
Schools and Departments: School of Engineering and Informatics > Informatics
Research Centres and Groups: Data Science Research Group
Subjects: Q Science > QA Mathematics > QA0075 Electronic computers. Computer science
Related URLs:
Depositing User: John Carroll
Date Deposited: 05 Jul 2017 13:36
Last Modified: 11 Jul 2017 09:57
URI: http://sro.sussex.ac.uk/id/eprint/69061

View download statistics for this item

📧 Request an update