DocumentCode :
2526292
Title :
An extended ID3 decision tree algorithm for spatial data
Author :
Sitanggang, Imas Sukaesih ; Yaakob, Razali ; Mustapha, Norwati ; Nuruddin, Ahmad Ainuddin B
Author_Institution :
Fac. of Comput. Sci. & Inf. Technol., Univ. Putra Malaysia, Serdang, Malaysia
fYear :
2011
fDate :
June 29 2011-July 1 2011
Firstpage :
48
Lastpage :
53
Abstract :
Utilizing data mining tasks such as classification on spatial data is more complex than those on non-spatial data. It is because spatial data mining algorithms have to consider not only objects of interest itself but also neighbours of the objects in order to extract useful and interesting patterns. One of classification algorithms namely the ID3 algorithm which originally designed for a non-spatial dataset has been improved by other researchers in the previous work to construct a spatial decision tree from a spatial dataset containing polygon features only. The objective of this paper is to propose a new spatial decision tree algorithm based on the ID3 algorithm for discrete features represented in points, lines and polygons. As in the ID3 algorithm that use information gain in the attribute selection, the proposed algorithm uses the spatial information gain to choose the best splitting layer from a set of explanatory layers. The new formula for spatial information gain is proposed using spatial measures for point, line and polygon features. Empirical result demonstrates that the proposed algorithm can be used to join two spatial objects in constructing spatial decision trees on small spatial dataset. The proposed algorithm has been applied to the real spatial dataset consisting of point and polygon features. The result is a spatial decision tree with 138 leaves and the accuracy is 74.72%.
Keywords :
data mining; decision trees; feature extraction; pattern classification; terrain mapping; visual databases; attribute selection; discrete feature representation; explanatory layer; extended ID3 decision tree algorithm; nonspatial dataset; pattern extraction; polygon features; spatial data classification; spatial data mining algorithm; spatial information gain; splitting layer; Classification algorithms; Data mining; Decision trees; Partitioning algorithms; Prediction algorithms; Rivers; Spatial databases; ID3 algorithm; spatial decision tree; spatial information gain; spatial measure; spatial relation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Spatial Data Mining and Geographical Knowledge Services (ICSDM), 2011 IEEE International Conference on
Conference_Location :
Fuzhou
Print_ISBN :
978-1-4244-8352-5
Type :
conf
DOI :
10.1109/ICSDM.2011.5969003
Filename :
5969003
Link To Document :
بازگشت