DocumentCode :
1941423
Title :
An efficient regular expressions compression algorithm from a new perspective
Author :
Liu, Tingwen ; Yang, Yifu ; Liu, Yanbing ; Sun, Yong ; Guo, Li
fYear :
2011
fDate :
10-15 April 2011
Firstpage :
2129
Lastpage :
2137
Abstract :
Deep packet inspection plays a increasingly important role in network security devices and applications, which use more regular expressions to depict patterns. DFA engine is usually used as a classical representation for regular expressions to perform pattern matching, because it only need O(1) time to process one input character. However, DFAs of regular expression sets require large amount of memory, which limits the practical application of regular expressions in high-speed networks. Some compression algorithms have been proposed to address this issue in recent literatures. In this paper, we reconsider this problem from a new perspective, namely observing the characteristic of transition distribution inside each state, which is different from previous algorithms that observe transition characteristic among states. Furthermore, we introduce a new compression algorithm which can reduce 95% memory usage of DFA stably without significant impact on matching speed. Moreover, our work is orthogonal to previous compression algorithms, such as D2FA, δFA. Our experiment results show that applying our work to them will have several times memory reduction, and matching speed of up to dozens of times comparing with original δFA in software implementation.
Keywords :
data compression; deterministic automata; finite automata; pattern matching; security of data; DFA engine; deep packet inspection; deterministic finite automata; network security; pattern matching; regular expression compression algorithm; Arrays; Clustering algorithms; Compression algorithms; Doped fiber amplifiers; Memory management; Pattern matching; Sparse matrices;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
INFOCOM, 2011 Proceedings IEEE
Conference_Location :
Shanghai
ISSN :
0743-166X
Print_ISBN :
978-1-4244-9919-9
Type :
conf
DOI :
10.1109/INFCOM.2011.5935024
Filename :
5935024
Link To Document :
بازگشت