DocumentCode :
2319257
Title :
Detecting hot topics in technology news streams
Author :
You, Bo ; Liu, Ming ; Liu, Bing-quan ; Wang, Xiao-long
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., Harbin, China
Volume :
5
fYear :
2012
fDate :
15-17 July 2012
Firstpage :
1968
Lastpage :
1974
Abstract :
Detecting hot topics with a fine granularity in technology news streams is an interesting and important problem given the large amount of reports and a relatively narrow range of topics. In this paper, a three-phase method is proposed. In the first phase, the document topic distribution vector is generated and keywords are extracted for each document using topic model pachinko allocation. In the second phase, the documents are clustered based on the document topic distribution vector obtained from the previous phase using affinity propagation. And in the last phase, actual events denoted by combinations of keywords within each cluster are found out using frequent pattern mining algorithms. We evaluate our approach on a collection of technology news reports from various sites in a fixed time period. T he results show that this method is effective.
Keywords :
data mining; document handling; information resources; affinity propagation; document topic distribution vector; hot topics detection; pattern mining; technology news streams; topic model pachinko allocation; Abstracts; Merging; Document clustering; Frequent pattern mining; Hot topic; Technology news streams; Topic model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics (ICMLC), 2012 International Conference on
Conference_Location :
Xian
ISSN :
2160-133X
Print_ISBN :
978-1-4673-1484-8
Type :
conf
DOI :
10.1109/ICMLC.2012.6359678
Filename :
6359678
Link To Document :
بازگشت