Title :
Mining a tea insect pests database
Author :
Samanta, Ranjit Kumar ; Ghosh, Indradeep
Author_Institution :
Dept. of Comput. Sci. & Applic., Univ. of North Bengal, Siliguri, India
Abstract :
Data mining techniques are being applied successfully in wide varieties of databases in order to extract useful information. This paper applies data mining techniques on a new tea insect pests database created on the basis of data available from different tea gardens of North Bengal districts of India. We describe different issues related to the development of a good data mining model in the present context. We propose a novel multiple imputation - reduced dimension - clustering approach. A bootstrap-based EMB algorithm performing multiple imputation for missing values; EM- clustering technique; Id3 and C4.5 for tree based classifications have been deployed in the study. The performance of the model is found satisfactory.
Keywords :
agricultural products; data mining; database management systems; pattern clustering; tree data structures; C4.5; EM-clustering; Id3; India; North Bengal districts; bootstrap-based EMB algorithm; clustering approach; data mining techniques; multiple imputation approach; reduced dimension approach; tea gardens; tea insect pests database mining; tree based classifications; Analytical models; Companies; Entropy; Image color analysis; Insects; Mathematics; Medical services; Classification; Clustering; Data mining; Missing data; Reduction; Tea insect pests;
Conference_Titel :
Emerging Trends and Applications in Computer Science (NCETACS), 2012 3rd National Conference on
Conference_Location :
Shillong
Print_ISBN :
978-1-4577-0749-0
DOI :
10.1109/NCETACS.2012.6203298