Title :
A practical approach to topic detection based on credible association rule mining
Author :
Lihua Wu ; Bo Xiao ; Zhiqing Lin ; Yueming Lu
Author_Institution :
Beijing Univ. of Posts & Telecommun., Beijing, China
Abstract :
Topic detection is to develop automatic methods to identify topically related documents within a stream of data; many approaches have been developed to classify documents with predefined knowledge. This paper presents a new approach for topic detection and tracking based on credible association rule (CAR). This paper considers topic detection without any prior knowledge of category structure or possible categories. Topic features are selected primarily based on CAR. Results on the test set show a marginal improvement by using CAR and its maximal cliques mining algorithm. The CAR maximal cliques mining algorithm is now applied on real topic detection and tracking system which gives us a lot of experience in adjusting and refining the algorithm. This algorithm also presents many useful interface extensions for other modules of the system to use.
Keywords :
data mining; CAR maximal clique mining algorithm; category structure; credible association rule mining; interface extensions; topic detection and tracking system; Accuracy; Adaptation models; Algorithm design and analysis; Buildings; Classification algorithms; Dictionaries; Feature extraction; Credible association rule; Feature selection; Maximal cliques mining; Quasi maximal cliques mining; Topic detection;
Conference_Titel :
Network Infrastructure and Digital Content (IC-NIDC), 2012 3rd IEEE International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4673-2201-0
DOI :
10.1109/ICNIDC.2012.6418749