Title :
A Scalable Big Data Test Framework
Author :
Nan Li ; Escalona, Anthony ; Yun Guo ; Offutt, Jeff
Abstract :
This paper identifies three problems when testing software that uses Hadoop-based big data techniques. First, processing big data takes a long time. Second, big data is transferred and transformed among many services. Do we need to validate the data at every transition point? Third, how should we validate the transferred and transformed data? We are developing a novel big data test framework to address these problems. The test framework generates a small and representative data set from an original large data set using input space partition testing. Using this data set for development and testing would not hinder the continuous integration and delivery when using agile processes. The test framework also accesses and validates data at various transition points when data is transferred and transformed.
Keywords :
Big Data; parallel processing; program testing; Hadoop-based big data techniques; big data processing; big data test framework; data validation; input space partition testing; software testing; Big data; Clinical trials; Data mining; Data transfer; Databases; Software; Testing;
Conference_Titel :
Software Testing, Verification and Validation (ICST), 2015 IEEE 8th International Conference on
Conference_Location :
Graz
DOI :
10.1109/ICST.2015.7102619