DocumentCode :
3168965
Title :
Clustering oriented hashing based multiple string pattern matching algorithm
Author :
Kanuga, Punit
fYear :
2015
fDate :
19-20 March 2015
Firstpage :
1
Lastpage :
6
Abstract :
Multiple string pattern matching aims at searching all occurrence of pattern set P in large data set (or text) T. Previous paper presented a new hashing based adaptive algorithm which perform search in order O(n/k) where n is length of T and k is the minimum length among all patterns in set P. This paper presents an idea of clustering the pattern set prior to searching phase. Implementation shows search speed up by more than 400%. This case study consists of search set with more than 420,000 characters. It also identifies various factors which effects search time and establishes mathematical relationship among them. It helps to theoretically determine the speed up in search time in comparison to that taken by using non-clustering approach. Experimental result shows that practical results are synchronous with respect to theoretical predictions hence proves accuracy of the mathematical establishment.
Keywords :
computational complexity; file organisation; pattern clustering; string matching; clustering oriented hashing; hashing based adaptive algorithm; multiple string pattern matching algorithm; pattern clustering; Accuracy; Algorithm design and analysis; Approximation algorithms; Clustering algorithms; Computers; Mathematical model; Pattern matching; Accuracy Factor; Clustering; ConPair; ConPair Repeatation Ratio; Multiple Pattern Matching; Redundancy Check; Speed Up Ratio;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Circuit, Power and Computing Technologies (ICCPCT), 2015 International Conference on
Conference_Location :
Nagercoil
Type :
conf
DOI :
10.1109/ICCPCT.2015.7159288
Filename :
7159288
Link To Document :
بازگشت