• DocumentCode
    566754
  • Title

    A product named entity normalization method based on entity relations

  • Author

    Sun, Chengjie ; Lin, Lei ; Liu, Ming ; Liu, Bingquan ; Sha, Xuejun

  • Author_Institution
    Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
  • Volume
    1
  • fYear
    2012
  • fDate
    26-28 June 2012
  • Firstpage
    166
  • Lastpage
    169
  • Abstract
    With the popularity and prosperity of e-commerce, text mining technologies for e-commerce information processing have been become more and more important. Product named entity normalization technology plays a vital role for the performance of e-commerce information processing because it can resolve the ambiguities of product named entities which is caused by the rich aliases and the complex structures of product names. This work proposed a relation based method for product named entity normalization. The proposed method first detected the relations between entities, and then used the relations to inference the full form of an entity. After that the similarities between the target entity with full form and the entries in a dictionary were calculated. The corresponding identifier of the most similar entries in the dictionary was chosen as the normalization result for the target entity. When calculating the similarity between two entities, the structures of the two entities were considered. Experiments on an annotated corpus consisting of web documents related to electronic product showed promising results of the proposed method, which achieved an accuracy of 88.09%.
  • Keywords
    Internet; data mining; electronic commerce; text analysis; Web documents; dictionary; e-commerce information processing; electronic product; entity relation; popularity; product named entity normalization method; product named entity normalization technology; product names; prosperity; text mining technology; Accuracy; Computational linguistics; Data mining; Dictionaries; Feature extraction; Joining processes; Knowledge based systems; entity similarity; product named entity normalization; relation extraction; semi-supervised learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Digital Content Technology (ICIDT), 2012 8th International Conference on
  • Conference_Location
    Jeju
  • Print_ISBN
    978-1-4673-1288-2
  • Type

    conf

  • Filename
    6269249