Title :
Using cluster computing to support automatic and dynamic database clustering
Author :
Guinepain, Sylvain ; Gruenwald, Le
Author_Institution :
Sch. of Comput. Sci., Univ. of Oklahoma Norman, Norman, OK
fDate :
Sept. 29 2008-Oct. 1 2008
Abstract :
Query response time is the number one metrics when it comes to database performance. Because of data proliferation, efficient access methods and data storage techniques have become increasingly critical to maintain an acceptable query response time. Retrieving data from disk is several orders of magnitude slower than retrieving it from memory, it is easy to see the direct correlation between query response time and the number of disk I/Os. One of the common ways to reduce disk I/Os and therefore improve query response time is database clustering, which is a process that partitions the database vertically (attribute clustering) and/or horizontally (record clustering). A clustering is optimized for a given set of queries. However in dynamic systems the queries change with time, the clustering in place becomes obsolete, and the database needs to be re-clustered dynamically. This paper presents an efficient algorithm for attribute clustering that dynamically and automatically generates attribute clusters based on closed item sets mined from the attributes sets found in the queries running against the database. The paper then discusses how this algorithm can be implemented using the cluster computing paradigm to reduce query response time even further through parallelism and data redundancy.
Keywords :
data handling; pattern clustering; query processing; attribute clustering; cluster computing; data parallelism; data proliferation; data redundancy; data retrieval; data storage; dynamic database clustering; query response; record clustering; Clustering algorithms; Computer science; Concurrent computing; Delay; Humans; Information retrieval; Memory; Operating systems; Parallel processing; Transaction databases;
Conference_Titel :
Cluster Computing, 2008 IEEE International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4244-2639-3
Electronic_ISBN :
1552-5244
DOI :
10.1109/CLUSTR.2008.4663800