• DocumentCode
    3585088
  • Title

    Data collection and language understanding of food descriptions

  • Author

    Korpusik, Mandy ; Schmidt, Nicole ; Drexler, Jennifer ; Cyphers, Scott ; Glass, James

  • Author_Institution
    MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
  • fYear
    2014
  • Firstpage
    560
  • Lastpage
    565
  • Abstract
    This paper presents initial data collection and language understanding experiments conducted as part of a larger effort to create a nutrition dialogue system that automatically extracts food concepts from a user´s spoken meal description. We first summarize the data collection and annotation of food descriptions performed via Amazon Mechanical Turk. We then present semantic labeling experiments using a semi-Markov conditional random field (CRF) that obtains an F1 test score of 85.1. Finally, we report food segmentation experiments that explored three methods for associating foods with their corresponding attributes: a generative Markov model, transformation-based learning, and a CRF classifier. The CRF performed best, achieving an F1 test score of 87.1.
  • Keywords
    Markov processes; data handling; health care; information retrieval; interactive systems; learning (artificial intelligence); pattern classification; speech-based user interfaces; Amazon Mechanical Turk; CRF classifier; F1 test score; data collection; food concept extraction; food description annotation; food segmentation experiments; generative Markov model; language understanding; nutrition dialogue system; semantic labeling experiments; semi Markov conditional random field; spoken meal description; transformation-based learning; Dairy products; Data collection; Data models; Labeling; Markov processes; Semantics; Tagging; CRF; Data collection; Markov model; Semantic tagging; Transformation-based learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2014 IEEE
  • Type

    conf

  • DOI
    10.1109/SLT.2014.7078635
  • Filename
    7078635