DocumentCode :
1853634
Title :
Parallel Approximate Multi-Pattern Matching on Heterogeneous Cluster Systems
Author :
Zhong, Cheng ; Fan, Zeng ; Su, Defu
Author_Institution :
Sch. of Comput. & Electron. & Inf., Guangxi Univ., Nanning
fYear :
2008
fDate :
1-4 Dec. 2008
Firstpage :
74
Lastpage :
79
Abstract :
For the given multiple patterns and a text string, firstly, a perfect hash function is constructed, the patterns are transformed into the unique pairs of integer values in parallel by the perfect hash function, the corresponding integer values are stored in a global hash table, and a recursion expression for computing hash function value of the signatures of each sub-string of text is also proposed. Secondly, based on divisible load principle, a linear programming model for the optimal text distribution strategy is created and a parallel approximate multi-pattern matching algorithm allowing one error is presented on the heterogeneous cluster system which processors have different computing speeds and distinct communication capabilities and different memory sizes by taking into account computation and communication startup time and using the assigned processor distribution order. The experimental results on the cluster system of heterogeneous personal computers show that the presented parallel algorithm is averagely 25% faster than that one using the even text distribution strategy, and it obtains a nearly linear speedup and good scalability.
Keywords :
file organisation; linear programming; pattern clustering; pattern matching; text analysis; global hash table; hash function; heterogeneous cluster systems; linear programming; optimal text distribution; parallel approximate multipattern matching; recursion expression; text string; Application software; Clustering algorithms; Computer errors; Concurrent computing; Distributed computing; Distribution strategy; Linear programming; Microcomputers; Parallel algorithms; Pattern matching; approximate multi-pattern matching; divisible load; hashing; heterogeneous cluster systems; parallel algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Parallel and Distributed Computing, Applications and Technologies, 2008. PDCAT 2008. Ninth International Conference on
Conference_Location :
Otago
Print_ISBN :
978-0-7695-3443-5
Type :
conf
DOI :
10.1109/PDCAT.2008.23
Filename :
4710965
Link To Document :
بازگشت