Title :
Mining Latent Associations of Objects Using a Typed Mixture Model--A Case Study on Expert/Expertise Mining
Author :
Bao, Shenghua ; Cao, Yunbo ; Liu, Bing ; Yu, Yong ; Li, Hang
Author_Institution :
Comput. Sci. & Eng., Shanghai Jiao Tong Univ., Shanghai
Abstract :
This paper studies the problem of discovering latent associations among objects in text documents. Specifically, given two sets of objects and various types of co-occurrence data concerning the objects existing in texts, we aim to discover the hidden or latent associative relationships between the two sets of objects. Existing methods are not directly applicable as they are unable to consider all this information. For example, the probabilistic mixture model called Separable Mixture Model (SMM) proposed by Hofmann can use only one type of co-occurrences to mine latent associations. This paper proposes a more general probabilistic mixture model called the Typed Separable Mixture Model (TSMM), which is able to use all types of co-occurrences within a single framework. Experimental results based on the expert/expertise mining task show that TSMM outperforms SMM significantly.
Keywords :
data mining; document handling; probability; co-occurrence data; expert mining; expertise mining; latent associations discovery; latent associations mining; latent associative relationships; probabilistic mixture model; text documents; typed mixture model; typed separable mixture model; Asia; Collaboration; Computer science; Data mining; Information filtering; Information filters; Information retrieval; Natural language processing;
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2701-7
DOI :
10.1109/ICDM.2006.109