Title :
Partitioning techniques for fine-grained indexing
Author :
Wu, Eugene ; Madden, Samuel
Author_Institution :
CSAIL, MIT, Cambridge, MA, USA
Abstract :
Many data-intensive websites use databases that grow much faster than the rate that users access the data. Such growing datasets lead to ever-increasing space and performance overheads for maintaining and accessing indexes. Furthermore, there is often considerable skew with popular users and recent data accessed much more frequently. These observations led us to design Shinobi, a system which uses horizontal partitioning as a mechanism for improving query performance to cluster the physical data, and increasing insert performance by only indexing data that is frequently accessed. We present database design algorithms that optimally partition tables, drop indexes from partitions that are infrequently queried, and maintain these partitions as workloads change. We show a 60× performance improvement over traditionally indexed tables using a real-world query workload derived from a traffic monitoring application.
Keywords :
database indexing; pattern clustering; query processing; Shinobi system; data cluster; data-intensive Web sites; database design algorithm; fine-grained indexing; horizontal partitioning mechanism; insert performance; partition tables; query performance; query workload; Data models; Indexing; Optimization; Partitioning algorithms; Random access memory;
Conference_Titel :
Data Engineering (ICDE), 2011 IEEE 27th International Conference on
Conference_Location :
Hannover
Print_ISBN :
978-1-4244-8959-6
Electronic_ISBN :
1063-6382
DOI :
10.1109/ICDE.2011.5767830