• DocumentCode
    1869172
  • Title

    A novel approach for indexing Arabic documents through GPU computing

  • Author

    Sophoclis, Nermine N. ; Abdeen, M. ; El-Horbaty, El-Sayed M. ; Yagoub, M.

  • Author_Institution
    Ain Shams Univ., Cairo, Egypt
  • fYear
    2012
  • fDate
    April 29 2012-May 2 2012
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In contrast to English search engines, Arabic search engines did not have their fair share in modern studies despite the continuous growth of Arabic Internet users and data. Towards bridging the gap, this paper presents a novel indexing algorithm customized for Arabic documents. Our algorithm exploits the characteristics of the Arabic language to enhance indexing and lookup. Additionally, the algorithm utilizes the highly parallel architecture of the graphics processing unit to speed-up the indexing. Finally, we discuss some of the synchronization challenges we faced and the techniques we used to overcome them. The preliminary tests of our GPU-accelerated Arabic indexer show promising speed-up factors.
  • Keywords
    Internet; graphics processing units; indexing; natural language processing; search engines; synchronisation; text analysis; Arabic Internet data; Arabic Internet users; Arabic document indexing; Arabic language characteristics; Arabic search engine; GPU computing; GPU-accelerated Arabic indexer; graphics processing unit; indexing algorithm; indexing enhancement; indexing speed-up; lookup enhancement; parallel architecture; synchronization challenge; Graphics processing unit; Indexing; Internet; Kernel; Search engines; Synchronization; Arabic Indexer; Distributed/Parallel Information Retrieval; GPGPU; GPU synchronization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Electrical & Computer Engineering (CCECE), 2012 25th IEEE Canadian Conference on
  • Conference_Location
    Montreal, QC
  • ISSN
    0840-7789
  • Print_ISBN
    978-1-4673-1431-2
  • Electronic_ISBN
    0840-7789
  • Type

    conf

  • DOI
    10.1109/CCECE.2012.6334963
  • Filename
    6334963