Title :
An analytical approach for data preprocessing
Author :
Sreenivas, P. ; Srikrishna, C.V.
Author_Institution :
Dept. of MCA, PES Inst. of Technol., Bangalore, India
Abstract :
This paper revisits the preprocessing technique of Data Mining. A sequential flow diagram is proposed for different databases and data sources which are addressed through analysed framework. Through a case study and calculating cyclomatic complexity of different sequences of preprocessing the appropriateness and efficiency of proposed method is evaluated. It has been observed that right selection of an appropriate sequence in cleaning improves the data mining process by saving time taken for each step.
Keywords :
data mining; flowcharting; analytical approach; cyclomatic complexity calculation; data cleaning; data mining process improvement; data preprocessing; data sources; sequence selection; sequential flow diagram; Algorithm design and analysis; Cleaning; Data mining; Data preprocessing; Multimedia communication; Spatial databases; Data Mining; Data Preprocessing; Single source and Multi source problems; Web-log data; multimedia data; spatial data; stemming; textual data; time-series data; tokenization;
Conference_Titel :
Emerging Trends in Communication, Control, Signal Processing & Computing Applications (C2SPCA), 2013 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4799-1082-3
DOI :
10.1109/C2SPCA.2013.6749435