DocumentCode :
2401521
Title :
Word segmentation in a document image using spectral partitionin
Author :
Manikandan, V. ; Venkatachalam, V. ; Kirthiga, M. ; Harini, K. ; Devarajan, N.
Author_Institution :
Dept. of Electr. & Electron. Eng., Coimbatore Inst. of Technol., Coimbatore, India
fYear :
2010
fDate :
28-29 Dec. 2010
Firstpage :
1
Lastpage :
4
Abstract :
State of art document segmentation algorithms employ adhoc solutions which use some document properties and iteratively segment the document image. These solutions need to be adapted frequently and sometimes fail to perform well for complex scripts. This calls for a generalized solution that achieves a one shot segmentation that is globally optimal. This paper describes one such solution based on the optimization problem of spectral partitioning which makes the decision of proper segmentation based on the spectral properties of the pair wise similarity matrix. The solution described in the paper is shown to be general, global and closed form. The claims have been demonstrated on 142 page images from a Telugu book, in a script set in both poetry and prose layouts. This particular class of scripts has been proved to be challenging for the existing state of the art algorithms, where the proposed solution achieves significant results.
Keywords :
document image processing; image segmentation; optimisation; spectral analysis; word processing; Telugu book; document image segmentation; optimization problem; pair wise similarity matrix; spectral partition; word segmentation; Algorithm design and analysis; Image segmentation; Laplace equations; Layout; Optimization; Partitioning algorithms; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence and Computing Research (ICCIC), 2010 IEEE International Conference on
Conference_Location :
Coimbatore
Print_ISBN :
978-1-4244-5965-0
Electronic_ISBN :
978-1-4244-5967-4
Type :
conf
DOI :
10.1109/ICCIC.2010.5705792
Filename :
5705792
Link To Document :
بازگشت