DocumentCode
625673
Title
Hardware-Accelerated Regular Expression Matching with Overlap Handling on IBM PowerEN Processor
Author
Atasu, Kubilay ; Doerfler, Florian ; van Lunteren, Jan ; Hagleitner, Christoph
Author_Institution
IBM Res. - Zurich, Zurich, Switzerland
fYear
2013
fDate
20-24 May 2013
Firstpage
1254
Lastpage
1265
Abstract
Programmable hardware accelerators for regular expression (regex) matching are evolving into increasingly complex stream processors, which involve multiple state machines that operate in parallel, and specialized post-processors that can process instructions dispatched by the state machines. To improve the speed and the storage-efficiency, complex regexs are decomposed into simpler subexpressions, where each subexpression can fire one or more instructions. Although the impact of regex decompositions on the storage efficiency is well-known, little has been done to address the correctness and completeness. We show that regex decompositions can result in false positives if overlaps between subexpressions are not taken into account. We describe formal methods to recognize various types of subexpression overlaps that can arise in regex decompositions. We also describe efficient post-processing techniques to eliminate the associated false positives. To enable efficient mapping of the decomposed regexs to the postprocessors, we propose integer programming based register allocation methods. Our methods pack narrow variables to reduce the register and instruction usage, and take advantage of multi-register reset instructions to reduce the number of instructions that must be executed in parallel. Experiments on regex sets obtained from open-source and proprietary network intrusion detection systems demonstrate orders of magnitude improvement in the storage efficiency over state-of-the-art.
Keywords
finite state machines; integer programming; microprocessor chips; IBM PowerENTM Processor; complex stream processors; formal methods; hardware accelerated regular expression matching; integer programming; multiple state machines; overlap handling; post-processing techniques; programmable hardware accelerators; register allocation methods; storage efficiency; Automata; Doped fiber amplifiers; Hardware; Pattern matching; Radiation detectors; Registers; Throughput; automata theory; compilers; hardware accelerators; integer programming; regular expression matching;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing (IPDPS), 2013 IEEE 27th International Symposium on
Conference_Location
Boston, MA
ISSN
1530-2075
Print_ISBN
978-1-4673-6066-1
Type
conf
DOI
10.1109/IPDPS.2013.54
Filename
6569901
Link To Document