• DocumentCode
    2712533
  • Title

    An application of spatial decision tree for classification of air pollution index

  • Author

    Zhao, Minyue ; Li, Xiang

  • Author_Institution
    Key Lab. of Geographic Inf. Sci., East China Normal Univ., Shanghai, China
  • fYear
    2011
  • fDate
    24-26 June 2011
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    A decision tree is an analysis skill and a classification algorithm, whose basic principle is the combination of probability theory and an analysis tool of tree shapes. It derives a hierarchy of partition rules with respect to a target attribute of a large dataset. Nowadays, concrete coordinates exist in lots of datasets, which leads to the spatial distribution of datasets. However, conventional decision tree does not take the spatial distribution of records in the dataset into account, which makes it inadequate to deal with the geographical datasets. A number of new approaches to the analysis of geographical data have been proposed in recent years. In the purpose of evaluating the application of a spatial entropy-based decision tree, a spatial entropy-based decision tree that employed to classify the air pollution index (API) is presented in this paper. A spatial decision tree differs from a conventional tree in the way that it considers the spatial autocorrelation phenomena in the classification process. At each level of a spatial decision tree, the supporting attribute that gives the maximum spatial information gain is selected as a node. A case study oriented to the classification of API, whose study area is main cities in China, deals with the norms of the API, including density of total suspended particulate, density of SO2, density of NO2, and etc. After the process of data processing, and graphical analysis, it demonstrates a tree shape of the classification of the API and a map of the spatial distribution of the target attribute´s categories, which illustrate the practicability of spatial decision tree.
  • Keywords
    air pollution; atmospheric composition; atmospheric techniques; decision trees; entropy; probability; China; NO2; SO2; air pollution index; data processing; geographical data; geographical datasets; graphical analysis; maximum spatial information gain; partition rules; probability theory; spatial autocorrelation phenomena; spatial distribution; spatial entropy-based decision tree; total suspended particulate; tree shapes; Cities and towns; Classification algorithms; Data mining; Decision trees; Entropy; Green products; Spatial databases; API; spatial decision tree; spatial entropy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Geoinformatics, 2011 19th International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    2161-024X
  • Print_ISBN
    978-1-61284-849-5
  • Type

    conf

  • DOI
    10.1109/GeoInformatics.2011.5981071
  • Filename
    5981071