Title :
Layout analysis of book pages
Author :
Green, Ron ; Oliver, Chad
Author_Institution :
Dept. of Comput. Sci. & Software Eng., Univ. of Canterbury, Christchurch, New Zealand
Abstract :
A method is proposed for analysing the geometric and logical structure of pages in a typical single-column book. A Gaussian blur combined with thresholding is used to form connected components which nominally represent words. A bottom-up nearest-neighbour approach is used to find textual lines, and a manually-defined line length parameter is used to remove marginal noise and find the page frame. A state machine is used to group lines and label them according to function. The proposed method is able to correctly segment and label 99.82% of all targeted features in a set of 196 sample pages. Of the sixteen errors encountered in the sample pages, eleven are instances where adjacent lines have been merged together, four are instances where paragraphs have been split in half, and the remaining error was caused by a header element being detected as part of the body text.
Keywords :
Gaussian processes; digital preservation; finite state machines; geometry; image denoising; optical character recognition; text analysis; Gaussian blur; body text; book page layout analysis; bottom-up nearest-neighbour approach; header element; logical page structure; manually-defined line length parameter; marginal noise removal; page geometric structure; single-column book; state machine; textual line finding; Algorithm design and analysis; Clustering algorithms; Image segmentation; Kernel; Layout; Noise; Sections; Geometric Layout; Logical Layout; OCR Preprocessing; Skew Detection; Structure Detection;
Conference_Titel :
Image and Vision Computing New Zealand (IVCNZ), 2013 28th International Conference of
Conference_Location :
Wellington
Print_ISBN :
978-1-4799-0882-0
DOI :
10.1109/IVCNZ.2013.6727002