DocumentCode :
3340016
Title :
Classification models for outbreak detection in oil and gas pollution area
Author :
Bakar, Afarulrazi Abu ; Idris, Norisma ; Hamdan, Abdul Razak ; Othman, Zulkifli ; Nazari, Mohd Zakri Ahmad ; Zainudin, S.
Author_Institution :
Center for Artificial Intell. Technol., Univ. Kebangsaan, Bangi, Malaysia
fYear :
2011
fDate :
17-19 July 2011
Firstpage :
1
Lastpage :
6
Abstract :
This study aim to investigate the data mining task and techniques specifically sequential pattern mining on the outbreak detection in oil and gas pollution area. The sequential pattern mining can be treated as a classification problem if enough data for certain sequence of time is available, as association problem if large number of related attributes are available, or can be seen as the deviation detection problem if the available data contain only few rare pattern or outliers. In this paper, the classification technique, decision tree is used for classification, and association rules mining is used for the outbreak detection task in oil and gas air dataset. The study found that unsupervised clustering using K-Means algorithm potentially obtain the rarely patterns of data distributing on several groups of pollutants and the average levels of supervised classification using the decision tree is a bit higher than the levels of association rules mining classification and appropriately used to classify the data by contaminants. Association rules mining on the other hand produce several sequences rules of contaminants. This study has high potential in producing quality rules for outbreak detection.
Keywords :
data mining; decision trees; environmental science computing; oil pollution; pattern classification; pattern clustering; pollution measurement; K-means algorithm; association rules mining classification; contaminants; data mining tasks; data mining techniques; decision tree; gas pollution outbreak detection; oil pollution outbreak detection; pattern classification; pollutants; sequential pattern mining; unsupervised clustering; Accuracy; Association rules; Classification algorithms; Diseases; Pollution; Training; Outbreak detection; association and sequential patterns; decision tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Electrical Engineering and Informatics (ICEEI), 2011 International Conference on
Conference_Location :
Bandung
ISSN :
2155-6822
Print_ISBN :
978-1-4577-0753-7
Type :
conf
DOI :
10.1109/ICEEI.2011.6021832
Filename :
6021832
Link To Document :
بازگشت