DocumentCode
2626333
Title
A novel approach for blind source separation of mixed document images in Farsi scanned documents
Author
Ghanbarloo, Hossein ; Razzazi, Farbod ; Alirezaee, Shahpour
Author_Institution
Sci. & Res. Branch, Dept. of Electr. Eng., Islamic Azad Univ., Tehran, Iran
fYear
2010
fDate
26-28 Aug. 2010
Firstpage
133
Lastpage
140
Abstract
In the field of mixed scanned documents separation, various studies have been carried out to reduce one (or more) unwanted artifacts from the document. Most of the approaches are based on comparison of the front and back sides of the documents. In some cases, it has been suggested to analyze the colored images, however, because of the calculation complexity of the approaches, they are not very applicable in practical applications. Furthermore none of them are tested on Farsi documents. In this paper, an applicable approach to large size images is presented which is based on image block segmentation (mosaicing). The advantages of this approach are less memory usage, combining of simultaneous and ordinal blind source separation methods in order to increase their efficiency, reducing calculation complexity of the algorithm into twenty percents of the basic algorithm, and high stability in noisy images. In noiseless conditions, the average signal to noise ratio of the output images is obtained 29.25 db. Furthermore, all of these cases have been tested on Farsi official documents. By applying the suggested ideas, considerable accuracy is achieved in the results, at minimum time. In addition, various parameters of the proposed algorithm (e.g. the size of each block, appropriate initial point, and number of iterations) were optimized.
Keywords
blind source separation; computational complexity; document image processing; image colour analysis; image segmentation; Farsi scanned documents; blind source separation; calculation complexity; image block segmentation; image color analysis; image mosaicing; mixed document images; mixed scanned documents separation; Computational complexity; Equations; Image reconstruction; Mathematical model; Signal to noise ratio; Source separation; Blind source Separation; Independent component Analysis; background removing; component; feed-through; formatting; scanned documents; show-through;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Computer Communication and Processing (ICCP), 2010 IEEE International Conference on
Conference_Location
Cluj-Napoca
Print_ISBN
978-1-4244-8228-3
Electronic_ISBN
978-1-4244-8230-6
Type
conf
DOI
10.1109/ICCP.2010.5606449
Filename
5606449
Link To Document