• DocumentCode
    2523525
  • Title

    A parallel automaton string matching with pre-hashing and root-indexing techniques for content filtering coprocessor

  • Author

    Tseng, Kuo-Kun ; Lin, Ying-Dar ; Lee, Tsern-Huei ; Lai, Yuan-Cheng

  • Author_Institution
    National Chiao Tung Univ., Taiwan
  • fYear
    2005
  • fDate
    23-25 July 2005
  • Firstpage
    113
  • Lastpage
    118
  • Abstract
    We propose a new parallel automaton string matching approach and its hardware architecture for content filtering coprocessor. This new approach can improve the average matching time of the parallel automaton with pre-hashing and root-indexing techniques. The pre-hashing technique uses a hashing function to verify quickly the text against the partial patterns in the automaton, and the root-indexing technique matches multiple bytes for the root state in one single matching. A popular automaton algorithm, Aho-Corasick (AC) is chosen to be implemented by adding the two techniques; we employ these two techniques in a memory efficient version of AC namely bitmap AC. For the average-case time, our approach improves bitmap AC by 494% and 224% speedup for URL and virus patterns, respectively. Since pre-hashing and root-indexing techniques can be concurrently executed with bitmap AC in the hardware, our proposed approach has the same worst-case time as bitmap AC.
  • Keywords
    automata theory; computer architecture; coprocessors; cryptography; file organisation; parallel algorithms; string matching; Aho-Corasick automaton; bitmap AC; content filtering coprocessor; hardware architecture; hashing function; parallel automaton string matching; prehashing; root-indexing; Automata; Coprocessors; Filtering algorithms; Hardware; Information filtering; Information filters; Intrusion detection; Matched filters; Pattern analysis; Pattern matching;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Application-Specific Systems, Architecture Processors, 2005. ASAP 2005. 16th IEEE International Conference on
  • ISSN
    2160-0511
  • Print_ISBN
    0-7695-2407-9
  • Type

    conf

  • DOI
    10.1109/ASAP.2005.8
  • Filename
    1540374