DocumentCode :
1816467
Title :
PICCIL: Interactive Learning to Support Log File Categorization
Author :
Loewenstern, David ; Ma, Sheng ; Salahshour, Abdi
Author_Institution :
IBM TJ. Watson Res. & AC, Hawthorne, NY
fYear :
2005
fDate :
13-16 June 2005
Firstpage :
311
Lastpage :
312
Abstract :
Motivated by the real-world application of categorizing system log messages into defined situation categories, this paper describes an interactive text categorization method, PICCIL, that leverages supervised machine learning to reduce the burden of assigning categories to documents in large finite data sets but, by coupling human expertise to the machine learning, does so without sacrificing accuracy. PICCIL uses keywords and keyword rules both to preclassify documents and to assist in the manual process of grouping and reviewing documents. The reviewed documents, in turn, are used to refine the keyword rules iteratively to improve subsequent grouping and document review. We apply PICCIL to the problem of assigning semantic situation labels to the entries of a catalog of log events to support on-line labeling of log events
Keywords :
cataloguing; classification; document handling; file organisation; learning (artificial intelligence); message passing; PICCIL; category assignment; document classification; document grouping; finite data set; interactive learning; interactive text categorization; keyword rules; log file categorization; supervised machine learning; system log message categorization; Clustering methods; Humans; Labeling; Machine learning; Supervised learning; Text categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Autonomic Computing, 2005. ICAC 2005. Proceedings. Second International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7965-2276-9
Type :
conf
DOI :
10.1109/ICAC.2005.46
Filename :
1498078
Link To Document :
بازگشت