DocumentCode
13899
Title
Giving Text Analytics a Boost
Author
Polig, Raphael ; Atasu, Kubilay ; Chiticariu, Laura ; Hagleitner, Christoph ; Hofstee, H. Peter ; Reiss, Frederick R. ; Zhu, Hengliang ; Sitaridi, Eva
Volume
34
Issue
4
fYear
2014
fDate
July-Aug. 2014
Firstpage
6
Lastpage
14
Abstract
The amount of textual data has reached a new scale and continues to grow at an unprecedented rate. IBM´s SystemT software is a powerful text-analytics system that offers a query-based interface to reveal the valuable information that lies within these mounds of data. However, traditional server architectures are not capable of analyzing so-called big data efficiently, despite the high memory bandwidth that is available. The authors show that by using a streaming hardware accelerator implemented in reconfigurable logic, the throughput rates of the SystemT´s information extraction queries can be improved by an order of magnitude. They also show how such a system can be deployed by extending SystemT´s existing compilation flow and by using a multithreaded communication interface that can efficiently use the accelerator´s bandwidth.
Keywords
Big Data; multi-threading; query processing; reconfigurable architectures; text analysis; user interfaces; Big Data; IBM SystemT software; compilation flow; information extraction query-based interface; multithreaded communication interface; reconfigurable logic; streaming hardware accelerator bandwidth; text-analytics system; textual data; throughput rates; Big data; Computer architecture; Field programmable gate arrays; Instruction sets; Text mining; Text processing; big data; field-programmable gate array; hardware; hardware accelerator; heterogeneous system; text analytics;
fLanguage
English
Journal_Title
Micro, IEEE
Publisher
ieee
ISSN
0272-1732
Type
jour
DOI
10.1109/MM.2014.69
Filename
6871704
Link To Document