DocumentCode
2526700
Title
A novel method for text page up/down orientation detection based on punctuation marks
Author
Zhu, Min ; Liao, Ying Han ; Deng, Xue
Author_Institution
Comput. Centre, East China Normal Univ., Shanghai, China
fYear
2012
fDate
28-30 May 2012
Firstpage
1
Lastpage
6
Abstract
In this paper, we propose a novel method to determine upside whether a scanned text document is right side up or down. The text documents discussed here are limited to English, Chinese and Japanese where we find that the punctuation much marks located on the bottom of the text line have a more frequent occurrence than those on the top. Thus, by calculating the the number of punctuation marks on the bottom and top, the orientation of documents image can be detected. The experimental results demonstrate the effectiveness of the proposed method on 683 Chinese, English and Japanese document images. In the text only documents, 98% accuracy of orientation detection is achieved on the documents in three languages with higher performance in Chinese office document image. And even in office documents including tables and pictures and without text segmentation, 87.11% accuracy could be achieved in English documents, 88.52% in Chinese documents and 83.89% in Japanese documents.
Keywords
document image processing; natural languages; text detection; Chinese document; English document; Japanese document; document image orientation; punctuation marks; scanned text document; text page up/down orientation detection; Accuracy; Algorithm design and analysis; Colon; Conferences; Feature extraction; Image segmentation; Noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Cognitive Information Processing (CIP), 2012 3rd International Workshop on
Conference_Location
Baiona
Print_ISBN
978-1-4673-1877-8
Type
conf
DOI
10.1109/CIP.2012.6232919
Filename
6232919
Link To Document