• DocumentCode
    2923919
  • Title

    A Hybrid Approach to Cleansing Software Measurement Data

  • Author

    Khoshgoftaar, Taghi M. ; Van Hulse, Jason ; Seiffert, Chris

  • Author_Institution
    Florida Atlantic Univ., Boca Raton, FL
  • fYear
    2006
  • fDate
    Nov. 2006
  • Firstpage
    713
  • Lastpage
    722
  • Abstract
    Data is extremely important in empirical software engineering. Techniques that provide insight into potential anomalies or inaccuracies in a dataset are becoming an increasingly important way for a data analyst to cope with flawed data. We present a novel hybrid procedure for quantitative outcome correction along with controlled experiments using a real-world software measurement dataset to demonstrate the usefulness of our technique. Instances that are deemed to be noisy relative to the dependent variable, which represents the number of faults recorded in the program module, are cleansed by replacing the original value with a more appropriate alternative value
  • Keywords
    data integrity; software quality; data analyst; dataset anomalies; dataset inaccuracies; empirical software engineering; flawed data; program module; quantitative outcome correction; real-world software measurement dataset; software measurement data cleansing; Application software; Data analysis; Data mining; Databases; Information systems; Machine learning; Mathematical model; Software engineering; Software measurement; Software quality;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence, 2006. ICTAI '06. 18th IEEE International Conference on
  • Conference_Location
    Arlington, VA
  • ISSN
    1082-3409
  • Print_ISBN
    0-7695-2728-0
  • Type

    conf

  • DOI
    10.1109/ICTAI.2006.11
  • Filename
    4031964