• DocumentCode
    2013697
  • Title

    Appearance Based Models in Document Script Identification

  • Author

    Vikram, T.N. ; Guru, D.S.

  • Author_Institution
    Univ. of Mysore, Mysore
  • Volume
    2
  • fYear
    2007
  • fDate
    23-26 Sept. 2007
  • Firstpage
    709
  • Lastpage
    713
  • Abstract
    In this paper we employ appearance based models for document script identification. They are employed to identify scripts at both paragraph and word level. Elaborate experimentation has been conducted which has revealed that they are robust enough to handle highly confusing scripts and their performance does not degrade drastically even in the presence of noise. A generic script identification has been attempted, to identify both Asian and European scripts by considering a dataset of twenty different languages.
  • Keywords
    document image processing; natural language processing; Asian scripts; European scripts; appearance based models; confusing scripts; document script identification; generic script identification; Automation; Character recognition; Computer science; Covariance matrix; Degradation; Europe; Information management; Noise robustness; Principal component analysis; Sorting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
  • Conference_Location
    Parana
  • ISSN
    1520-5363
  • Print_ISBN
    978-0-7695-2822-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.2007.4377007
  • Filename
    4377007