DocumentCode :
1927569
Title :
Topic Detection from Blog Documents Using Users´ Interests
Author :
Sekiguchi, Y. ; Kawashima, H. ; Okuda, H. ; Oku, M.
Author_Institution :
NTT Corporation, Japan
fYear :
2006
fDate :
10-12 May 2006
Firstpage :
108
Lastpage :
108
Abstract :
In this paper, we describe a method to detect topic words from blog documents. We define "topic words" as words frequently used by people who share the same interests. In this method, each blogger’s interests are extracted from each blog site, and interest similarities between bloggers are calculated. Unusual words that are used by bloggers who have a high level of similarity are then extracted as topic words. We evaluated the precision of this method using blog documents, and the results show that the proposed method is superior (by 4.4 %) to the traditional TF-IDF method in terms of precision.
Keywords :
Conference management; Data mining; Databases; Frequency conversion; Information services; Internet; Laboratories; Web services; Web sites;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Mobile Data Management, 2006. MDM 2006. 7th International Conference on
Conference_Location :
Nara, Japan
ISSN :
1551-6245
Print_ISBN :
0-7695-2526-1
Type :
conf
DOI :
10.1109/MDM.2006.153
Filename :
1630644
Link To Document :
بازگشت