• DocumentCode
    45889
  • Title

    Fast Regular Expression Matching Using Small TCAM

  • Author

    Meiners, Chad R. ; Patel, Jatin ; Norige, Eric ; Liu, Alex X. ; Torng, Eric

  • Author_Institution
    Michigan State Univ., East Lansing, MI, USA
  • Volume
    22
  • Issue
    1
  • fYear
    2014
  • fDate
    Feb. 2014
  • Firstpage
    94
  • Lastpage
    109
  • Abstract
    Regular expression (RE) matching is a core component of deep packet inspection in modern networking and security devices. In this paper, we propose the first hardware-based RE matching approach that uses ternary content addressable memory (TCAM), which is available as off-the-shelf chips and has been widely deployed in modern networking devices for tasks such as packet classification. We propose three novel techniques to reduce TCAM space and improve RE matching speed: transition sharing, table consolidation, and variable striding. We tested our techniques on eight real-world RE sets, and our results show that small TCAMs can be used to store large deterministic finite automata (DFAs) and achieve potentially high RE matching throughput. For space, we can store each of the corresponding eight DFAs with 25 000 states in a 0.59-Mb TCAM chip. Using a different TCAM encoding scheme that facilitates processing multiple characters per transition, we can achieve potential RE matching throughput of 10-19 Gb/s for each of the eight DFAs using only a single 2.36-Mb TCAM chip.
  • Keywords
    content-addressable storage; deterministic automata; finite automata; security of data; DFA; TCAM chip; TCAM encoding scheme; bit rate 10 Gbit/s to 19 Gbit/s; deep packet inspection; deterministic finite automata; fast regular expression matching; modern networking device; off-the-shelf chips; packet classification; security device; storage capacity 0.59 Mbit; storage capacity 2.36 Mbit; table consolidation; ternary content addressable memory; transition sharing; variable striding; Automata; Doped fiber amplifiers; Encoding; Inspection; Random access memory; Redundancy; Vegetation; Deep packet inspection; information security; intrusion detection; regular expression matching; security; ternary content addressable memory (TCAM);
  • fLanguage
    English
  • Journal_Title
    Networking, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6692
  • Type

    jour

  • DOI
    10.1109/TNET.2013.2256466
  • Filename
    6560431