• DocumentCode
    1796488
  • Title

    A machine learning technique for predicting the productivity of practitioners from individually developed software projects

  • Author

    Lopez-Martin, Cuauhtemoc ; Chavoya, Arturo ; Meda-Campana, Maria Elena

  • Author_Institution
    Inf. Syst. Dept. CUCEA, Univ. de Guadalajara Guadalajara, Jalisco, Mexico
  • fYear
    2014
  • fDate
    June 30 2014-July 2 2014
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Context: Productivity management of software developers is a challenge in Information and Communication Technology. Predictions of productivity can be useful to determine corrective actions and to assist managers in evaluating improvement alternatives. Productivity prediction models have been based on statistical regressions, statistical time series, fuzzy logic, and machine learning. Goal: To propose a machine learning model termed general regression neural network (GRNN) for predicting the productivity of software practitioners. Hypothesis: Prediction accuracy of a GRNN is better than a statistical regression model when these two models are applied for predicting productivity of software practitioners who have individually developed their software projects. Method: A sample obtained from 396 software projects developed between the years 2005 and 2011 by 99 practitioners was used for training the models, whereas a sample of 60 projects developed by 15 practitioners in the first months of 2012 was used for testing the models. All projects were developed based upon a disciplined development process within a controlled environment. The accuracy of the GRNN was compared against that of a multiple regression model (MLR). The criteria for evaluating the accuracy of these two models were the Magnitude of Error Relative to the estimate and a t-paired statistical test. Results: Prediction accuracy of an GRNN was statistically better than that of an MLR model at the 99% confidence level. Conclusion: An GRNN could be applied for predicting the productivity of practitioners when New and Changed lines of code, reused code, and programming language experience of practitioners are used as independent variables.
  • Keywords
    learning (artificial intelligence); neural nets; project management; regression analysis; software development management; statistical testing; time series; GRNN prediction accuracy; code reusage; error relative magnitude; fuzzy logic; general regression neural network; information-and-communication technology; machine learning technique; productivity management; productivity prediction models; programming language; software projects; statistical regression model; statistical time series; t-paired statistical test; Accuracy; Mathematical model; Neural networks; Predictive models; Productivity; Software; Testing; Software engineers productivity; general regression neural network; multiple statistical regression; software prediction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2014 15th IEEE/ACIS International Conference on
  • Conference_Location
    Las Vegas, NV
  • Type

    conf

  • DOI
    10.1109/SNPD.2014.6888690
  • Filename
    6888690