Title :
Compiling text analytics queries to FPGAs
Author :
Polig, Raphael ; Atasu, Kubilay ; Giefers, Heiner ; Chiticariu, Laura
Author_Institution :
IBM Res. - Zurich, Rueschlikon, Switzerland
Abstract :
Extracting information from unstructured text data is a compute-intensive task. The performance of general-purpose processors cannot keep up with the rapid growth of textual data. Therefore we discuss the use of FPGAs to perform large scale text analytics. We present a framework consisting of a compiler and an operator library capable of generating a Verilog processing pipeline from a text analytics query specified in the annotation query language AQL. The operator library comprises a set of configurable modules capable of performing relational and extraction tasks which can be assembled by the compiler to represent a full annotation operator graph. Leveraging the nature of text processing we show that most tasks can be performed in an efficient streaming fashion. We evaluate the performance, power consumption and hardware utilization of our approach for a set of different queries compiled to a Stratix IV FPGA. Measurements show an up to 79 times improvement of document-throughput over a 64 threaded software implementation on a POWER7 server. Moreover the accelerated system´s energy efficiency is up to 85 times better.
Keywords :
field programmable gate arrays; program compilers; query languages; query processing; text analysis; AQL annotation query language; FPGA; POWER7 server; Stratix IV FPGA; Verilog processing pipeline; compiler; configurable modules; extraction task; full annotation operator graph representation; general-purpose processors; information extraction; large scale text analytics; operator library; relational task; text analytics query compilation; text processing; Acceleration; Database languages; Field programmable gate arrays; Hardware; Hardware design languages; Libraries; Software;
Conference_Titel :
Field Programmable Logic and Applications (FPL), 2014 24th International Conference on
Conference_Location :
Munich
DOI :
10.1109/FPL.2014.6927500