Title :
Data mining: a tightly-coupled implementation on a parallel database server
Author :
Sousa, Mauro ; Mattoso, Marta ; Ebrecken, N.F.F.
Author_Institution :
COPPE, Fed. Univ. of Rio de Janeiro, Brazil
Abstract :
Due to the increasingly difficulty of discovering patterns in real-world databases using only conventional OLAP tools, an automated process such as data mining is currently essential. As data mining over large data sets can take a prohibitive amount of time related to the computational complexity of the algorithms, parallel processing has often been used as a solution. However, when data does not fit in memory, some solutions do not apply and a database system may be required rather than flat files. Most implementations use a database system loosely-coupled with the data mining algorithms. We address the data consuming activities through parallel processing and data fragmentation on the database server, providing a tight integration with data mining techniques. Experimental results show that the potential benefits of this integration were obtained, despite the difficulties of processing a complex application
Keywords :
file servers; knowledge acquisition; parallel programming; very large databases; OLAP tools; computational complexity; data fragmentation; data mining; parallel database server; parallel processing; Computational complexity; Concurrent computing; Data mining; Database systems; Decision trees; Machine learning algorithms; Parallel processing; Pattern analysis; Sampling methods; Spatial databases;
Conference_Titel :
Database and Expert Systems Applications, 1998. Proceedings. Ninth International Workshop on
Conference_Location :
Vienna
Print_ISBN :
0-8186-8353-8
DOI :
10.1109/DEXA.1998.707486