DocumentCode :
2630831
Title :
Language-free layout analysis
Author :
Ittner, D.J. ; Baird, H.S.
Author_Institution :
AT&T Bell Lab., Murray Hill, NJ, USA
fYear :
1993
fDate :
20-22 Oct 1993
Firstpage :
336
Lastpage :
340
Abstract :
A system for isolating blocks, lines, words, and symbols within images of machine-printed textual documents that is, to a large existent, independent of language and writing system is described. This is achieved by exploiting a small number of nearly universal typesetting and layout conventions. The system does not require prior knowledge of page orientation (module 90°), and copes well with nonzero skew and shear angles (within 10°). Also it locates blocks of text without reliance on detailed a priori layout models, and in spite of unknown or mixed horizontal and vertical text-line orientations. Within blocks, it infers text-line orientation and isolates lines, without knowledge of the language, symbol set, text sizes, or the number of text lines. Segmentation into words and symbols, and determination of reading order, normally require some knowledge of the language: this is held to minimum by relying on shape-driven algorithms. The underlying algorithms are based on Fourier theory, digital signal processing, computational geometry, and statistical decision theory. Most of the computation occurs within algorithms that possess unambiguous semantics (that is, heuristics are kept to a minimum). The effectiveness of the method on English, Japanese, Hebrew, Thai, and Korean documents is discussed
Keywords :
computational geometry; document handling; image segmentation; Fourier theory; computational geometry; digital signal processing; language free layout analysis; layout conventions; machine-printed textual documents; nearly universal typesetting; page orientation; reading order; shape-driven algorithms; statistical decision theory; symbol set; text lines; text sizes; unambiguous semantics; vertical text-line orientations; Asia; Decision theory; Digital signal processing; Graphics; Natural languages; Runtime; Signal processing algorithms; Testing; Typesetting; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
Type :
conf
DOI :
10.1109/ICDAR.1993.395720
Filename :
395720
Link To Document :
بازگشت