• DocumentCode
    2213358
  • Title

    Automation Framework for Large-Scale Regular Expression Matching on FPGA

  • Author

    Ganegedara, Thilan ; Yang, Yi-Hua E. ; Prasanna, Viktor K.

  • Author_Institution
    Dept. of Electr. Eng., Univ. of Southern California, Los Angeles, CA, USA
  • fYear
    2010
  • fDate
    Aug. 31 2010-Sept. 2 2010
  • Firstpage
    50
  • Lastpage
    55
  • Abstract
    We present an extensible automation framework for constructing and optimizing large-scale regular expression matching (REM) circuits on FPGA. Paralleling the technique used by software compilers, we divide our framework into two parts: a frontend that parses each PCRE-formatted regular expression (regex) into a modular non-deterministic finite automaton (RE-NFA), followed by a backend that generates the REM circuit design for a multi-pipeline architecture. With such organization, various pattern and circuit level optimizations can be applied to the frontend and backend, respectively. The multi-pipeline architecture utilizes both logic slices and on-chip BRAM for optimized character matching; in addition, it can be configured at compile-time to produce concurrent matching outputs from multiple RE-NFAs. Our framework prototype handles up to 64k "regular" regexes with arbitrary complexity and number of states, limited only by the hardware resources of the target device. Running on a commodity 2.3 GHz PC (AMD Opteron 1356), it takes less than a minute for the framework to convert ~1800 regexes used by the Snort IDS into RTL-level designs with optimized logic and memory usage. Such an automation framework could be invaluable to REM systems to update regex definitions with minimal human intervention.
  • Keywords
    circuit optimisation; field programmable gate arrays; finite state machines; logic design; parallel architectures; program compilers; FPGA; REM circuit design; RTL level design; character matching; circuit level optimization; concurrent matching; large-scale regular expression matching circuits; logic slices; multipipeline architecture; nondeterministic finite automaton; on-chip BRAM; pattern level optimization; FPGA; NFA; Regular expression; circuit-level optimization; finite state machine; non-deterministic finite automata; pattern-level optimization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Field Programmable Logic and Applications (FPL), 2010 International Conference on
  • Conference_Location
    Milano
  • ISSN
    1946-1488
  • Print_ISBN
    978-1-4244-7842-2
  • Type

    conf

  • DOI
    10.1109/FPL.2010.21
  • Filename
    5694220