DocumentCode :
1723999
Title :
Classification and metaclassification in large scale data mining application for estimation of software projects
Author :
Dzega, Dorota ; Pietruszkiewicz, Wieslaw
Author_Institution :
Fac. of Econ. & Inf. Technol., West Pomeranian Bus. Sch., Szczecin, Poland
fYear :
2010
Firstpage :
1
Lastpage :
6
Abstract :
In this article we present an application of Artificial Intelligence for estimation of software projects. The research presented herein was based on several methods of classification and metaclassification. Due to increasing significance of Open Source, we have selected projects being hosted on the leading platform for Open Source projects - Sourceforge.net. In the first part of article, we describe steps of data extraction which was a large scale task because the datasource contained tens of tables and hundreds of fields, that were originally gathered to be used by project management web-based system. Therefore extraction of meaningful data required analysis of databases structure and transformation of sets of records into a four datasets. These datasets were used to predict four factors important to project management i.e skills, time, costs an effectiveness. Later, we present the results of experiments, that were performed using C4.5, RandomTree and CART algorithms. In the final part of this article, we describe how boosting and bagging metaclassifiers were applied to improve the results and we also analyse influence of their parameters on generalization abilities an prediction accuracy.
Keywords :
Web services; artificial intelligence; data analysis; data mining; pattern classification; project management; public domain software; software management; tree data structures; CART algorithms; Open Source software; Web based system; artificial intelligence; data analysis; data extraction; data metaclassification; data mining; databases structure; project management; software project estimation; Accuracy; Bagging; Data mining; Decision trees; Estimation; Project management; Software; Classification; Decision trees; Metaclassification; Project management; Software estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Cybernetic Intelligent Systems (CIS), 2010 IEEE 9th International Conference on
Conference_Location :
Reading
Print_ISBN :
978-1-4244-9023-3
Electronic_ISBN :
978-1-4244-9024-0
Type :
conf
DOI :
10.1109/UKRICIS.2010.5898136
Filename :
5898136
Link To Document :
بازگشت