DocumentCode
45889
Title
Fast Regular Expression Matching Using Small TCAM
Author
Meiners, Chad R. ; Patel, Jatin ; Norige, Eric ; Liu, Alex X. ; Torng, Eric
Author_Institution
Michigan State Univ., East Lansing, MI, USA
Volume
22
Issue
1
fYear
2014
fDate
Feb. 2014
Firstpage
94
Lastpage
109
Abstract
Regular expression (RE) matching is a core component of deep packet inspection in modern networking and security devices. In this paper, we propose the first hardware-based RE matching approach that uses ternary content addressable memory (TCAM), which is available as off-the-shelf chips and has been widely deployed in modern networking devices for tasks such as packet classification. We propose three novel techniques to reduce TCAM space and improve RE matching speed: transition sharing, table consolidation, and variable striding. We tested our techniques on eight real-world RE sets, and our results show that small TCAMs can be used to store large deterministic finite automata (DFAs) and achieve potentially high RE matching throughput. For space, we can store each of the corresponding eight DFAs with 25 000 states in a 0.59-Mb TCAM chip. Using a different TCAM encoding scheme that facilitates processing multiple characters per transition, we can achieve potential RE matching throughput of 10-19 Gb/s for each of the eight DFAs using only a single 2.36-Mb TCAM chip.
Keywords
content-addressable storage; deterministic automata; finite automata; security of data; DFA; TCAM chip; TCAM encoding scheme; bit rate 10 Gbit/s to 19 Gbit/s; deep packet inspection; deterministic finite automata; fast regular expression matching; modern networking device; off-the-shelf chips; packet classification; security device; storage capacity 0.59 Mbit; storage capacity 2.36 Mbit; table consolidation; ternary content addressable memory; transition sharing; variable striding; Automata; Doped fiber amplifiers; Encoding; Inspection; Random access memory; Redundancy; Vegetation; Deep packet inspection; information security; intrusion detection; regular expression matching; security; ternary content addressable memory (TCAM);
fLanguage
English
Journal_Title
Networking, IEEE/ACM Transactions on
Publisher
ieee
ISSN
1063-6692
Type
jour
DOI
10.1109/TNET.2013.2256466
Filename
6560431
Link To Document