• DocumentCode
    2147016
  • Title

    Mining a tea insect pests database

  • Author

    Samanta, Ranjit Kumar ; Ghosh, Indradeep

  • Author_Institution
    Dept. of Comput. Sci. & Applic., Univ. of North Bengal, Siliguri, India
  • fYear
    2012
  • fDate
    30-31 March 2012
  • Firstpage
    56
  • Lastpage
    60
  • Abstract
    Data mining techniques are being applied successfully in wide varieties of databases in order to extract useful information. This paper applies data mining techniques on a new tea insect pests database created on the basis of data available from different tea gardens of North Bengal districts of India. We describe different issues related to the development of a good data mining model in the present context. We propose a novel multiple imputation - reduced dimension - clustering approach. A bootstrap-based EMB algorithm performing multiple imputation for missing values; EM- clustering technique; Id3 and C4.5 for tree based classifications have been deployed in the study. The performance of the model is found satisfactory.
  • Keywords
    agricultural products; data mining; database management systems; pattern clustering; tree data structures; C4.5; EM-clustering; Id3; India; North Bengal districts; bootstrap-based EMB algorithm; clustering approach; data mining techniques; multiple imputation approach; reduced dimension approach; tea gardens; tea insect pests database mining; tree based classifications; Analytical models; Companies; Entropy; Image color analysis; Insects; Mathematics; Medical services; Classification; Clustering; Data mining; Missing data; Reduction; Tea insect pests;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Emerging Trends and Applications in Computer Science (NCETACS), 2012 3rd National Conference on
  • Conference_Location
    Shillong
  • Print_ISBN
    978-1-4577-0749-0
  • Type

    conf

  • DOI
    10.1109/NCETACS.2012.6203298
  • Filename
    6203298