DocumentCode :
2181848
Title :
Enhancing Object Distinction Utilizing Probabilistic Topic Model
Author :
Yumin Zhu ; Qingzhong Li
Author_Institution :
Sch. of Comput. Sci. & Technol., Shandong Univ., Jinan, China
fYear :
2013
fDate :
16-19 Dec. 2013
Firstpage :
177
Lastpage :
182
Abstract :
We develop a novel method for enhancing object distinction problem, which is to distinct different objects with identical names. The traditional approach is based on the attributes or contexts of the objects, not using the information of some attribute which may propagate between the objects and the attributes. To this end, we propose an approach which uses probabilistic topic model to improve the accuracy. We first model the attribute which has unstructured information and get its topics which may propagate to object as a new attribute, and then calculate the similarity values of the context attributes of the two objects with identical names, then we use these similarity values to build a decision tree model based on the training set. For the problem of object distinction for people, we combine the affiliation similarity with other context attributes similarity to judge whether the two people who share the same name correspond to the same people in real life. Experiments show that our method using Probabilistic Topic Model can achieve high accuracy.
Keywords :
Big Data; data analysis; decision trees; probability; big data; context attribute similarity values; decision tree model; enhancing object distinction problem; identical named objects; object attribute; object context; probabilistic topic model; unstructured information; Accuracy; Context; Context modeling; Decision trees; Probabilistic logic; Probability distribution; Training; Latent Dirichlet Allocation; context attribute; decision tree; object distinction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cloud Computing and Big Data (CloudCom-Asia), 2013 International Conference on
Conference_Location :
Fuzhou
Print_ISBN :
978-1-4799-2829-3
Type :
conf
DOI :
10.1109/CLOUDCOM-ASIA.2013.61
Filename :
6820990
Link To Document :
بازگشت