Title :
Scalable classification over SQL databases
Author :
Chaudhuri, Surajit ; Fayyad, Usama ; Bernhardt, Jeff
Author_Institution :
Microsoft Corp., Redmond, WA, USA
Abstract :
We identify data-intensive operations that are common to classifiers and develop a middleware that decomposes and schedules these operations efficiently using a backend SQL database. Our approach has the added advantage of not requiring any specialized physical data organization. We demonstrate the scalability characteristics of our enhanced client with experiments on Microsoft SQL Server 7.0 by varying data size, number of attributes and characteristics of decision trees
Keywords :
SQL; client-server systems; decision trees; relational databases; Microsoft SQL Server 7; SQL databases; backend SQL database; data size; data-intensive operations; decision trees; enhanced client; middleware; scalability characteristics; scalable classification; Classification algorithms; Classification tree analysis; Data mining; Databases; Decision trees; File servers; Middleware; Predictive models; Probability; Statistics;
Conference_Titel :
Data Engineering, 1999. Proceedings., 15th International Conference on
Conference_Location :
Sydney, NSW
Print_ISBN :
0-7695-0071-4
DOI :
10.1109/ICDE.1999.754963