• DocumentCode
    2526700
  • Title

    A novel method for text page up/down orientation detection based on punctuation marks

  • Author

    Zhu, Min ; Liao, Ying Han ; Deng, Xue

  • Author_Institution
    Comput. Centre, East China Normal Univ., Shanghai, China
  • fYear
    2012
  • fDate
    28-30 May 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    In this paper, we propose a novel method to determine upside whether a scanned text document is right side up or down. The text documents discussed here are limited to English, Chinese and Japanese where we find that the punctuation much marks located on the bottom of the text line have a more frequent occurrence than those on the top. Thus, by calculating the the number of punctuation marks on the bottom and top, the orientation of documents image can be detected. The experimental results demonstrate the effectiveness of the proposed method on 683 Chinese, English and Japanese document images. In the text only documents, 98% accuracy of orientation detection is achieved on the documents in three languages with higher performance in Chinese office document image. And even in office documents including tables and pictures and without text segmentation, 87.11% accuracy could be achieved in English documents, 88.52% in Chinese documents and 83.89% in Japanese documents.
  • Keywords
    document image processing; natural languages; text detection; Chinese document; English document; Japanese document; document image orientation; punctuation marks; scanned text document; text page up/down orientation detection; Accuracy; Algorithm design and analysis; Colon; Conferences; Feature extraction; Image segmentation; Noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Cognitive Information Processing (CIP), 2012 3rd International Workshop on
  • Conference_Location
    Baiona
  • Print_ISBN
    978-1-4673-1877-8
  • Type

    conf

  • DOI
    10.1109/CIP.2012.6232919
  • Filename
    6232919