Title :
The research of label-mapping-based entity attribute extraction
Author :
Liu, Huilin ; Chen, Chen ; Zhang, Liwei ; Wang, Guoren
Author_Institution :
Coll. of Inf. Sci. & Eng., Northeastern Univ., Shenyang, China
Abstract :
With the rapid development of new media, such as computer and Internet, extract valuable entity attribute information from Web text can be significant. Aiming at this problem, this paper puts forward SALmap, this model calls seed method at first, which will create common candidate attribute label sets by defining data format rules. Then we construct the mapping relationship between the attributes and the labels using attribute value information and the maximum entropy model, and label the entity instance as well. Finally, hidden Markov model is applied to the relevant entity attribute extraction. Experiments prove SALmap model can significantly improve the precision and performance of entity attribute extraction.
Keywords :
Internet; data mining; entropy; hidden Markov models; information retrieval; set theory; SALmap; Web text; attribute value information; data format rule; entity attribute information; hidden Markov model; mapping relationship; maximum entropy model; Artificial neural networks; Books; Computational modeling; Entropy; Hidden Markov models; Information filters; attribute extraction; attribute values; hidden Markov model; label mapping; maximum entropy model;
Conference_Titel :
Progress in Informatics and Computing (PIC), 2010 IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-6788-4
DOI :
10.1109/PIC.2010.5687859