DocumentCode :
566754
Title :
A product named entity normalization method based on entity relations
Author :
Sun, Chengjie ; Lin, Lei ; Liu, Ming ; Liu, Bingquan ; Sha, Xuejun
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Volume :
1
fYear :
2012
fDate :
26-28 June 2012
Firstpage :
166
Lastpage :
169
Abstract :
With the popularity and prosperity of e-commerce, text mining technologies for e-commerce information processing have been become more and more important. Product named entity normalization technology plays a vital role for the performance of e-commerce information processing because it can resolve the ambiguities of product named entities which is caused by the rich aliases and the complex structures of product names. This work proposed a relation based method for product named entity normalization. The proposed method first detected the relations between entities, and then used the relations to inference the full form of an entity. After that the similarities between the target entity with full form and the entries in a dictionary were calculated. The corresponding identifier of the most similar entries in the dictionary was chosen as the normalization result for the target entity. When calculating the similarity between two entities, the structures of the two entities were considered. Experiments on an annotated corpus consisting of web documents related to electronic product showed promising results of the proposed method, which achieved an accuracy of 88.09%.
Keywords :
Internet; data mining; electronic commerce; text analysis; Web documents; dictionary; e-commerce information processing; electronic product; entity relation; popularity; product named entity normalization method; product named entity normalization technology; product names; prosperity; text mining technology; Accuracy; Computational linguistics; Data mining; Dictionaries; Feature extraction; Joining processes; Knowledge based systems; entity similarity; product named entity normalization; relation extraction; semi-supervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Digital Content Technology (ICIDT), 2012 8th International Conference on
Conference_Location :
Jeju
Print_ISBN :
978-1-4673-1288-2
Type :
conf
Filename :
6269249
Link To Document :
بازگشت