• DocumentCode
    13899
  • Title

    Giving Text Analytics a Boost

  • Author

    Polig, Raphael ; Atasu, Kubilay ; Chiticariu, Laura ; Hagleitner, Christoph ; Hofstee, H. Peter ; Reiss, Frederick R. ; Zhu, Hengliang ; Sitaridi, Eva

  • Volume
    34
  • Issue
    4
  • fYear
    2014
  • fDate
    July-Aug. 2014
  • Firstpage
    6
  • Lastpage
    14
  • Abstract
    The amount of textual data has reached a new scale and continues to grow at an unprecedented rate. IBM´s SystemT software is a powerful text-analytics system that offers a query-based interface to reveal the valuable information that lies within these mounds of data. However, traditional server architectures are not capable of analyzing so-called big data efficiently, despite the high memory bandwidth that is available. The authors show that by using a streaming hardware accelerator implemented in reconfigurable logic, the throughput rates of the SystemT´s information extraction queries can be improved by an order of magnitude. They also show how such a system can be deployed by extending SystemT´s existing compilation flow and by using a multithreaded communication interface that can efficiently use the accelerator´s bandwidth.
  • Keywords
    Big Data; multi-threading; query processing; reconfigurable architectures; text analysis; user interfaces; Big Data; IBM SystemT software; compilation flow; information extraction query-based interface; multithreaded communication interface; reconfigurable logic; streaming hardware accelerator bandwidth; text-analytics system; textual data; throughput rates; Big data; Computer architecture; Field programmable gate arrays; Instruction sets; Text mining; Text processing; big data; field-programmable gate array; hardware; hardware accelerator; heterogeneous system; text analytics;
  • fLanguage
    English
  • Journal_Title
    Micro, IEEE
  • Publisher
    ieee
  • ISSN
    0272-1732
  • Type

    jour

  • DOI
    10.1109/MM.2014.69
  • Filename
    6871704