DocumentCode :
3122592
Title :
On Efficient Query Processing of Stream Counts on the Cell Processor
Author :
Thomas, Dina ; Bordawekar, Rajesh ; Aggarwal, Charu C. ; Yu, Philip S.
Author_Institution :
Stanford Univ., Palo Alto, CA
fYear :
2009
fDate :
March 29 2009-April 2 2009
Firstpage :
748
Lastpage :
759
Abstract :
In recent years, the sketch-based technique has been presented as an effective method for counting stream items on processors with limited storage and processing capabilities, such as the network processors. In this paper, we examine the implementation of a sketch-based counting algorithm on the heterogeneous multi-core Cell processor. Like the network processors, the Cell also contains on-chip special processors with limited local memories. These special processors enable parallel processing of stream items using short-vector data-parallel (SIMD) operations. We demonstrate that the inaccuracies of the estimates computed by straightforward adaptations of current sketch-based counting approaches are exacerbated by increased inaccuracies in approximating counts of low frequency items, and by the inherent space limitations of the Cell processor. To address these concerns, we implement a sketch-based counting algorithm, FCM, that is specifically adapted for the Cell processor architecture. FCM incorporates novel capabilities for improving estimation accuracy using limited space by dynamically identifying low- and high-frequency stream items, and using a variable number of hash functions per item as determined by an item´s current frequency phase. We experimentshortally demonstrate that with similar space consumption, FCM computes better frequency estimates of both the low- and high-frequency items than a naive parallelization of an existing stream counting algorithm. Using FCM as the kernel, our parallel algorithm is able to scale the over all performance linearly as well as improve the estimate accuracy as the number of processors is increased. Thus, this work demonstrates the importance of adapting the algorithm to the specifics of the underlying architecture.
Keywords :
microprocessor chips; query processing; multicore Cell processor; network processors; on-chip special processors; parallel algorithm; query processing; short-vector data-parallel operations; sketch-based counting algorithm; Acceleration; Algorithm design and analysis; Computer architecture; Concurrent computing; Coprocessors; Data engineering; Frequency estimation; Multicore processing; Query processing; USA Councils; Cell; Counting; Stream Processing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
ISSN :
1084-4627
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
Type :
conf
DOI :
10.1109/ICDE.2009.35
Filename :
4812451
Link To Document :
بازگشت