• DocumentCode
    589181
  • Title

    Accurate Product Name Recognition from User Generated Content

  • Author

    Wu, Sen ; Fang, Zhanpeng ; Tang, Jie

  • fYear
    2012
  • fDate
    10-10 Dec. 2012
  • Firstpage
    874
  • Lastpage
    877
  • Abstract
    This paper presents the solution of the team "ISSSID" for the Consumer Products Contest #1(CPROD1) of ICDM 2012. The contest provides a dataset including hundreds of thousands of text items, a product catalog with over fifteen million products, and hundreds of manually annotated product mentions. The goal of the competition is to automatically recognize product mentions in the textual content and disambiguate which product(s) in the product catalog are referenced by the mentions. We propose a hybrid approach which combines the results obtained by several separately trained recognition models. Specifically, the approach uses a standard matching model, a rule template model, and a conditional random field model, and finally combines the results using a blending model. The proposed approach achieves the best performance in the contest.
  • Keywords
    Catalogs; Data mining; Educational institutions; Lead; Semantics; Standards; Training data; CPROD1; Named Entity Recognition; Nature Language Processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels, Belgium
  • Print_ISBN
    978-1-4673-5164-5
  • Type

    conf

  • DOI
    10.1109/ICDMW.2012.129
  • Filename
    6406534