• DocumentCode
    729517
  • Title

    Systematic mapping study of missing values techniques in software engineering data

  • Author

    Idri, Ali ; Abnane, Ibtissam ; Abran, Alain

  • Author_Institution
    Software Project Manage. Res. Team, Mohamed V Univ., Rabat, Morocco
  • fYear
    2015
  • fDate
    1-3 June 2015
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Missing Values (MV) present a serious problem facing research in software engineering (SE) which is mainly based on statistical and/or data mining analysis of SE data. The simple method of dealing with MV is to ignore data with missing observations. This leads to losing valuable information and then obtaining biased results. Therefore, various techniques have been developed to deal adequately with MV, especially those based on imputation methods. In this paper, a systematic mapping study was carried out to summarize the existing techniques dealing with MV in SE datasets and to classify the selected studies according to six classification criteria: research type, research approach, MV technique, MV type, data types and MV objective. Publication channels and trends were also identified. As results, 35 papers concerning MV treatments of SE data were selected. This study shows an increasing interest in machine learning (ML) techniques especially the K-nearest neighbor algorithm (KNN) to deal with MV in SE datasets and found that most of the MV techniques are used to serve software development effort estimation techniques.
  • Keywords
    data mining; learning (artificial intelligence); software engineering; K-nearest neighbor algorithm; KNN; MV objective; MV technique; MV type; SE datasets; data mining analysis; data types; machine learning techniques; missing values techniques; research approach; research type; software development effort estimation techniques; software engineering data; systematic mapping study; Conferences; Data mining; Market research; Quality assessment; Software; Software engineering; Systematics; Machine learning; Missing values; Software engineering data; Systematic mapping study;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Engineering, Artificial Intelligence, Networking and Parallel/Distributed Computing (SNPD), 2015 16th IEEE/ACIS International Conference on
  • Conference_Location
    Takamatsu
  • Type

    conf

  • DOI
    10.1109/SNPD.2015.7176280
  • Filename
    7176280