DocumentCode
2712533
Title
An application of spatial decision tree for classification of air pollution index
Author
Zhao, Minyue ; Li, Xiang
Author_Institution
Key Lab. of Geographic Inf. Sci., East China Normal Univ., Shanghai, China
fYear
2011
fDate
24-26 June 2011
Firstpage
1
Lastpage
6
Abstract
A decision tree is an analysis skill and a classification algorithm, whose basic principle is the combination of probability theory and an analysis tool of tree shapes. It derives a hierarchy of partition rules with respect to a target attribute of a large dataset. Nowadays, concrete coordinates exist in lots of datasets, which leads to the spatial distribution of datasets. However, conventional decision tree does not take the spatial distribution of records in the dataset into account, which makes it inadequate to deal with the geographical datasets. A number of new approaches to the analysis of geographical data have been proposed in recent years. In the purpose of evaluating the application of a spatial entropy-based decision tree, a spatial entropy-based decision tree that employed to classify the air pollution index (API) is presented in this paper. A spatial decision tree differs from a conventional tree in the way that it considers the spatial autocorrelation phenomena in the classification process. At each level of a spatial decision tree, the supporting attribute that gives the maximum spatial information gain is selected as a node. A case study oriented to the classification of API, whose study area is main cities in China, deals with the norms of the API, including density of total suspended particulate, density of SO2, density of NO2, and etc. After the process of data processing, and graphical analysis, it demonstrates a tree shape of the classification of the API and a map of the spatial distribution of the target attribute´s categories, which illustrate the practicability of spatial decision tree.
Keywords
air pollution; atmospheric composition; atmospheric techniques; decision trees; entropy; probability; China; NO2; SO2; air pollution index; data processing; geographical data; geographical datasets; graphical analysis; maximum spatial information gain; partition rules; probability theory; spatial autocorrelation phenomena; spatial distribution; spatial entropy-based decision tree; total suspended particulate; tree shapes; Cities and towns; Classification algorithms; Data mining; Decision trees; Entropy; Green products; Spatial databases; API; spatial decision tree; spatial entropy;
fLanguage
English
Publisher
ieee
Conference_Titel
Geoinformatics, 2011 19th International Conference on
Conference_Location
Shanghai
ISSN
2161-024X
Print_ISBN
978-1-61284-849-5
Type
conf
DOI
10.1109/GeoInformatics.2011.5981071
Filename
5981071
Link To Document