مرکز منطقه ای اطلاع رساني علوم و فناوري - Scalable multi-pipeline architecture for high performance multi-pattern string matching

DocumentCode :

2439778

Title :

Scalable multi-pipeline architecture for high performance multi-pattern string matching

Author :

Jiang, Weirong ; Yang, Yi-Hua E. ; Prasanna, Viktor K.

Author_Institution :

Ming Hsieh Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA

fYear :

2010

fDate :

19-23 April 2010

Firstpage :

Lastpage :

Abstract :

Multi-pattern string matching remains a major performance bottleneck in network intrusion detection and anti-virus systems for high-speed deep packet inspection (DPI). Although Aho-Corasick deterministic finite automaton (AC-DFA) based solutions produce deterministic throughput and are widely used in today´s DPI systems such as Snort [1] and ClamAV [2], the high memory requirement of AC-DFA (due to the large number of state transitions in AC-DFA) inhibits efficient hardware implementation to achieve high performance. Some recent work [3], [4] has shown that the AC-DFA can be reduced to a character trie that contains only the forward transitions by incorporating pipelined processing. But they have limitations in either handling long patterns or extensions to support multi-character input per clock cycle to achieve high throughput. This paper generalizes the problem and proves formally that a linear pipeline with H stages can remove all cross transitions to the top H levels of a AC-DFA. A novel and scalable pipeline architecture for memory-efficient multi-pattern string matching is then presented. The architecture can be easily extended to support multi-character input per clock cycle by mapping a compressed AC-DFA [5] onto multiple pipelines. Simulation using Snort and ClamAV pattern sets shows that a 8-stage pipeline can remove more than 99% of the transitions in the original AC-DFA. The implementation on a state-of-the-art field programmable gate array (FPGA) shows that our architecture can store on a single FPGA device the full set of string patterns from the latest Snort rule set. Our FPGA implementation sustains 10+ Gbps throughput, while consuming a small amount of on-chip logic resources. Also desirable scalability is achieved: the increase in resource requirement of our solution is sub-linear with the throughput improvement.

Keywords :

field programmable gate arrays; finite automata; pipeline processing; security of data; string matching; Aho-Corasick deterministic finite automaton; ClamAV; Snort; anti-virus systems; field programmable gate array; high performance multipattern string matching; high-speed deep packet inspection; linear pipeline; memory-efficient multipattern string matching; multicharacter input; network intrusion detection; pipelined processing; scalable multipipeline architecture; scalable pipeline architecture; Automata; Clocks; Field programmable gate arrays; Hardware; Inspection; Intrusion detection; Memory architecture; Pipelines; Programmable logic arrays; Throughput; DFA; Deep packet inspection; FPGA; pipeline; string matching;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on

Conference_Location :

Atlanta, GA

ISSN :

1530-2075

Print_ISBN :

978-1-4244-6442-5

Type :

conf

DOI :

10.1109/IPDPS.2010.5470374

Filename :

5470374

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2439778