Title :
Multi-level topic detection algorithm for Netnews Specials
Author :
Yu Peng ; Zhiqing Lin ; Bo Xiao ; Chuang Zhang
Author_Institution :
PRIS laboratory, Beijing University of Posts and Telecommunications BUPT, China
Abstract :
This paper investigates the topic detection method in Netnews Specials Detection (NSD). We found that when the traditional clustering algorithms are used in NSD, the same topic is usually split into several pieces and the result is not satisfying. So a new algorithm is proposed which uses a multi-level model, better suited for NSD. Firstly, such algorithm elevates the accuracy of single-layer clustering by introducing hot search words, a selective dictionary, and an advanced weight formula. Secondly, the multiple-level model not only avoids the problem of topic over-split but also establishes a structure for Netnews Specials, which lays the foundation for quick viewing, positioning and retrieval. Experimental results show that the algorithm in the real test corpus have high accuracy, doing a better job than the traditional clustering method.
Keywords :
Algorithm design and analysis; Clustering algorithms; Lead; Natural Language Processing (NLP); Netnews Specials; Vector Space Model (VSM); topic detection model;
Conference_Titel :
Conference Anthology, IEEE
Conference_Location :
China
DOI :
10.1109/ANTHOLOGY.2013.6784968