Title :
Load Balancing Query Processing in Metric-Space Similarity Search
Author :
Gil-Costa, Veronica ; Marin, Mauricio
Author_Institution :
Yahoo! Res. Latin America, Univ. of San Luis Argentina, San Luis Argentina, Argentina
Abstract :
Metric-space similarity search has been proven suitable for searching large collections of complex objects such as images. A number of distributed index data structures and respective parallel query processing algorithms have been proposed for clusters of distributed memory processors. Previous work has shown that best performance is achieved when using global indexing as opposed to local indexing. However global indexing is prone to performance degradation when query load becomes unbalanced across processors. This paper proposes a query scheduling algorithm that solves this problem. It adaptively load balances processing of user queries that are dynamically skewed towards particular sections of the distributed index. Sections highly hit by queries can be kept replicated. Experimental results show that with 1%-10% replication performance improves significantly (e.g., 35%) under skewed work-loads.
Keywords :
data structures; distributed databases; distributed memory systems; indexing; query processing; resource allocation; search engines; complex objects; distributed index data structures; distributed memory processors; global indexing; load balancing query processing; metric-space similarity search; parallel query processing algorithms; query load; query scheduling algorithm; skewed work-loads; user queries; Clustering algorithms; Indexing; Program processors; Query processing; Scheduling algorithms; Synchronization; distributed metric space search; load balance;
Conference_Titel :
Cluster, Cloud and Grid Computing (CCGrid), 2012 12th IEEE/ACM International Symposium on
Conference_Location :
Ottawa, ON
Print_ISBN :
978-1-4673-1395-7
DOI :
10.1109/CCGrid.2012.30