DocumentCode :
673200
Title :
Hot topic extraction based on frequency, position, scattering and topical weight for time sliced news documents
Author :
Jahnavi, Y. ; Radhika, Y.
Author_Institution :
Comput. Sci. & Eng. Dept., GITAM Univ., Hyderabad, India
fYear :
2013
fDate :
21-22 Sept. 2013
Firstpage :
1
Lastpage :
6
Abstract :
Internet based news documents are the basic information transmission media. In such a case detecting hot topics and tracking the event development is most important. However, it is almost impossible to view all the generated topics, due to its large amount of size. Therefore it is necessary to rank the topics. The topic ranking should be done on the importance basis. But this importance is determined by how frequently a topic appears and this importance varies in different time slots. For extracting hot topics, most of the text mining approaches with vector space model need to determine the weighting of the feature terms. Existing traditional algorithms can´t achieve high accuracy for retrieving hot terms, because they have not considered position, scattering and topicality. This paper presents an innovative and effective hot term extraction by considering position, scattering and topicality of terms along with frequency.
Keywords :
Internet; data mining; document handling; information retrieval; text analysis; Internet based news documents; event development tracking; hot term retrieval; hot topic extraction; information transmission media; innovative hot term extraction; text mining approach; time sliced news documents; topic frequency; topic position; topic scattering; topical weight; vector space model; Algorithm design and analysis; Feature extraction; Scattering; Text mining; Time-frequency analysis; Vectors; TF-PDF; Term Weighting; Text Mining; Vector Space Model (VSM);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Computing Technologies (ICACT), 2013 15th International Conference on
Conference_Location :
Rajampet
Print_ISBN :
978-1-4673-2816-6
Type :
conf
DOI :
10.1109/ICACT.2013.6710495
Filename :
6710495
Link To Document :
بازگشت