• DocumentCode
    2418993
  • Title

    Data collection and normalization for building the Scenario-Based Lexical Knowledge Resource of a text-to-scene conversion system

  • Author

    Rouhizadeh, Masoud ; Bowler, Margit ; Sproat, Richard ; Coyne, Bob

  • Author_Institution
    Center for Spoken Language Understanding, Oregon Health & Sci. Univ., Portland, OR, USA
  • fYear
    2010
  • fDate
    9-10 Dec. 2010
  • Firstpage
    25
  • Lastpage
    30
  • Abstract
    WordsEye is a system for converting from English text into three-dimensional graphical scenes that represent that text. It works by performing syntactic and semantic analyses on the input text, producing a description of the arrangement of objects in a scene. At the core of WordsEye is the Scenario-Based Lexical Knowledge Resource (SBLR), a unified knowledge base and representational system for expressing lexical and real-world knowledge needed to depict scenes from text. This paper explores information collection methods for building the SBLR, using Amazon´s Mechanical Turk (AMT) and manual normalization of raw AMT data. The paper follows with manual review of existing relations in the SBLR and classification of the AMT data into existing and new semantic relations. Since manual annotation is a time-consuming and expensive approach, we also explored the use of automatic normalization of AMT data through log-odds and log-likelihood ratios extracted from the English Gigaword corpus, as well as through WordNet similarity measures.
  • Keywords
    knowledge representation; multimedia computing; natural language interfaces; natural language processing; natural scenes; text analysis; Amazon mechanical turk; WordsEye; data collection; data normalization; scenario based lexical knowledge resource; text to scene conversion system; three dimensional graphical scene; Animation; Data mining; Libraries; Manuals; Natural languages; Semantics; Three dimensional displays;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Media Adaptation and Personalization (SMAP), 2010 5th International Workshop on
  • Conference_Location
    Limmassol
  • Print_ISBN
    978-1-4244-8603-8
  • Electronic_ISBN
    978-1-4244-8601-4
  • Type

    conf

  • DOI
    10.1109/SMAP.2010.5706851
  • Filename
    5706851