Title :
An efficient regular expressions compression algorithm from a new perspective
Author :
Liu, Tingwen ; Yang, Yifu ; Liu, Yanbing ; Sun, Yong ; Guo, Li
Abstract :
Deep packet inspection plays a increasingly important role in network security devices and applications, which use more regular expressions to depict patterns. DFA engine is usually used as a classical representation for regular expressions to perform pattern matching, because it only need O(1) time to process one input character. However, DFAs of regular expression sets require large amount of memory, which limits the practical application of regular expressions in high-speed networks. Some compression algorithms have been proposed to address this issue in recent literatures. In this paper, we reconsider this problem from a new perspective, namely observing the characteristic of transition distribution inside each state, which is different from previous algorithms that observe transition characteristic among states. Furthermore, we introduce a new compression algorithm which can reduce 95% memory usage of DFA stably without significant impact on matching speed. Moreover, our work is orthogonal to previous compression algorithms, such as D2FA, δFA. Our experiment results show that applying our work to them will have several times memory reduction, and matching speed of up to dozens of times comparing with original δFA in software implementation.
Keywords :
data compression; deterministic automata; finite automata; pattern matching; security of data; DFA engine; deep packet inspection; deterministic finite automata; network security; pattern matching; regular expression compression algorithm; Arrays; Clustering algorithms; Compression algorithms; Doped fiber amplifiers; Memory management; Pattern matching; Sparse matrices;
Conference_Titel :
INFOCOM, 2011 Proceedings IEEE
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-9919-9
DOI :
10.1109/INFCOM.2011.5935024