• DocumentCode
    265988
  • Title

    A fully visual based business document classification system

  • Author

    Infantino, Ignazio ; Maniscalco, Umberto ; Stabile, Dario ; Vella, Filippo

  • Author_Institution
    Inst. of High-Performance Comput. & Networking - ICAR, Palermo, Italy
  • fYear
    2014
  • fDate
    27-29 Aug. 2014
  • Firstpage
    339
  • Lastpage
    344
  • Abstract
    A fully visual approach for business documents classification is presented. The paper describes how SURF visual features, extracted from the documents, can be usefully used for business document recognition and their classification. Some of the extracted features are used to compute a prototype aiming at speed up the comparison of a document class while obtaining the best recognition rate. Moreover, we can determine which features are relevant and we can select zones of interest in the documents. Experimental setup has been performed on a set of real business documents of different typologies and companies. We tested also the robustness of our approach adding artificial defects and noise to the original documents and classifying them taking into account exclusively visual and graphical features. The capability of documents classification without any kind of text analysis has the great advantage to make the system totally independent from the idiom.
  • Keywords
    business data processing; document image processing; feature extraction; image recognition; text analysis; SURF visual features; artificial defects; business document recognition; extracted features; fully visual based business document classification system; graphical features; recognition rate; text analysis; typologies; visual features; Companies; Feature extraction; Layout; Prototypes; Robustness; Visualization; document classification; image features extraction; image processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Science and Information Conference (SAI), 2014
  • Conference_Location
    London
  • Print_ISBN
    978-0-9893-1933-1
  • Type

    conf

  • DOI
    10.1109/SAI.2014.6918208
  • Filename
    6918208