• DocumentCode
    573181
  • Title

    Hierarchical scheme for Arabic text recognition

  • Author

    Asi, Abedelkadir ; El-Sana, Jihad ; Märgner, Volker

  • Author_Institution
    Dept. of Comput. Sci., Ben-Gurion Univ. of the Negev, Beer-Sheva, Israel
  • fYear
    2012
  • fDate
    2-5 July 2012
  • Firstpage
    1266
  • Lastpage
    1271
  • Abstract
    The holistic approach for word recognition has become widely accepted in Arabic text recognition research. However, the large search space limits this approach to domains with small vocabularies. In this work, we present a novel approach to generate hierarchical representation for the shapes of Arabic continuous sub-words. The top levels of the hierarchy include the coarse representations of sub-words, and the low levels include the fine representations. The construction of the hierarchy is performed bottom up; the shapes at each level are simplified and classified to generate the next level. The search for an appropriate match is performed top down; at each level it traverses sub-trees, whose roots have the highest match rate.
  • Keywords
    character recognition; natural language processing; text analysis; text detection; Arabic text recognition; coarse representations; fine representations; large search space; sub-trees; the hierarchy include; word recognition; Character recognition; Feature extraction; Handwriting recognition; Hidden Markov models; Shape; Skeleton; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science, Signal Processing and their Applications (ISSPA), 2012 11th International Conference on
  • Conference_Location
    Montreal, QC
  • Print_ISBN
    978-1-4673-0381-1
  • Electronic_ISBN
    978-1-4673-0380-4
  • Type

    conf

  • DOI
    10.1109/ISSPA.2012.6310486
  • Filename
    6310486