DocumentCode :
3071671
Title :
Using Speech Acts to Categorize Email and Identify Email Genres
Author :
Goldstein, Jade ; Sabin, Roberta Evans
Author_Institution :
U.S. Department of Defense
Volume :
3
fYear :
2006
fDate :
04-07 Jan. 2006
Abstract :
We define genres of email as well as a subset of "speech acts" relevant to email enhanced for email specific discourse. After creating a ground truth set of emails based on these email acts, we compare the performance of two classifiers (Random Forests and SVM-light) in identifying the primary communicative intent of the email and its corresponding genre. We experiment with using feature sets derived from two verb lexicons as well as a feature set containing selected characteristics of email. Results show better classifier accuracy using the verb lexicon with the smaller number of classes over the larger, and that using part of speech tagging to focus on selecting only verbs, causes a slight drop in performance. Using the email characteristics set alone results in better performance than either of the verb lexicons alone, but the best results are obtained using a combination of the smaller verb lexicon and the email characteristics set.
Keywords :
Availability; Computer science; Educational institutions; Electronic mail; Internet; Law; Legal factors; Speech analysis; Tagging; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
System Sciences, 2006. HICSS '06. Proceedings of the 39th Annual Hawaii International Conference on
ISSN :
1530-1605
Print_ISBN :
0-7695-2507-5
Type :
conf
DOI :
10.1109/HICSS.2006.528
Filename :
1579389
Link To Document :
بازگشت