Title :
Data collection and language understanding of food descriptions
Author :
Korpusik, Mandy ; Schmidt, Nicole ; Drexler, Jennifer ; Cyphers, Scott ; Glass, James
Author_Institution :
MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
Abstract :
This paper presents initial data collection and language understanding experiments conducted as part of a larger effort to create a nutrition dialogue system that automatically extracts food concepts from a user´s spoken meal description. We first summarize the data collection and annotation of food descriptions performed via Amazon Mechanical Turk. We then present semantic labeling experiments using a semi-Markov conditional random field (CRF) that obtains an F1 test score of 85.1. Finally, we report food segmentation experiments that explored three methods for associating foods with their corresponding attributes: a generative Markov model, transformation-based learning, and a CRF classifier. The CRF performed best, achieving an F1 test score of 87.1.
Keywords :
Markov processes; data handling; health care; information retrieval; interactive systems; learning (artificial intelligence); pattern classification; speech-based user interfaces; Amazon Mechanical Turk; CRF classifier; F1 test score; data collection; food concept extraction; food description annotation; food segmentation experiments; generative Markov model; language understanding; nutrition dialogue system; semantic labeling experiments; semi Markov conditional random field; spoken meal description; transformation-based learning; Dairy products; Data collection; Data models; Labeling; Markov processes; Semantics; Tagging; CRF; Data collection; Markov model; Semantic tagging; Transformation-based learning;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078635