DocumentCode
2923919
Title
A Hybrid Approach to Cleansing Software Measurement Data
Author
Khoshgoftaar, Taghi M. ; Van Hulse, Jason ; Seiffert, Chris
Author_Institution
Florida Atlantic Univ., Boca Raton, FL
fYear
2006
fDate
Nov. 2006
Firstpage
713
Lastpage
722
Abstract
Data is extremely important in empirical software engineering. Techniques that provide insight into potential anomalies or inaccuracies in a dataset are becoming an increasingly important way for a data analyst to cope with flawed data. We present a novel hybrid procedure for quantitative outcome correction along with controlled experiments using a real-world software measurement dataset to demonstrate the usefulness of our technique. Instances that are deemed to be noisy relative to the dependent variable, which represents the number of faults recorded in the program module, are cleansed by replacing the original value with a more appropriate alternative value
Keywords
data integrity; software quality; data analyst; dataset anomalies; dataset inaccuracies; empirical software engineering; flawed data; program module; quantitative outcome correction; real-world software measurement dataset; software measurement data cleansing; Application software; Data analysis; Data mining; Databases; Information systems; Machine learning; Mathematical model; Software engineering; Software measurement; Software quality;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 2006. ICTAI '06. 18th IEEE International Conference on
Conference_Location
Arlington, VA
ISSN
1082-3409
Print_ISBN
0-7695-2728-0
Type
conf
DOI
10.1109/ICTAI.2006.11
Filename
4031964
Link To Document