DocumentCode :
2551624
Title :
Topic Detection and Extraction in Chat
Author :
Adams, Paige H. ; Martell, Craig H.
Author_Institution :
Dept. of Comput. Sci., Naval Postgrad. Sch., Monterey, CA
fYear :
2008
fDate :
4-7 Aug. 2008
Firstpage :
581
Lastpage :
588
Abstract :
Internet-based Chat environments such as Internet relay Chat and instant messaging pose a challenge for data mining and information retrieval systems due to the multi-threaded, overlapping nature of the dialog and the nonstandard usage of language. In this paper we present preliminary methods of topic detection and topic thread extraction that augment a typical TF-IDF-based vector space model approach with temporal relationship information between posts of the Chat dialog combined with WordNet hypernym augmentation. We show results that promise better performance than using only a TF-IDF bag-of-words vector space model.
Keywords :
Internet; computer mediated communication; feature extraction; information retrieval; Chat extraction; Internet relay Chat; Internet-based Chat environments; WordNet hypernym augmentation; information retrieval systems; instant messaging; temporal relationship information; thread extraction; vector space model approach; Computer mediated communication; Computer science; Data mining; Facsimile; Frequency; Information retrieval; Internet; Relays; Telephony; Yarn; chat; thread extraction; topic detection; vector space model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing, 2008 IEEE International Conference on
Conference_Location :
Santa Clara, CA
Print_ISBN :
978-0-7695-3279-0
Electronic_ISBN :
978-0-7695-3279-0
Type :
conf
DOI :
10.1109/ICSC.2008.61
Filename :
4597251
Link To Document :
بازگشت