Title :
Connected pattern segmentation and title grouping in newspaper images
Abstract :
This paper presents an algorithm that performs automated segmentation and classification of newspaper images. The algorithm discusses a technique for segmenting components that are connected to other components and presents another technique to correctly group titles and subtitles. The algorithm uses a bottom-up approach to initially segment the image, classify patterns and extract text lines. The classified patterns are then merged into complete regions. The algorithm is tested on a set of complex newspaper images taken from the First International Newspaper Segmentation Contest, and the results are compared with the contest results.
Keywords :
document image processing; image classification; image segmentation; publishing; First International Newspaper Segmentation Contest; automated image classification; automated image segmentation; bottom-up method; complex newspaper images; newspaper image classification; pattern classification; pattern segmentation; subtitle grouping; text line extraction; title grouping; Flowcharts; Graphics; Image analysis; Image converters; Image segmentation; Information technology; Pattern classification; Smoothing methods; Testing; Text analysis;
Conference_Titel :
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
Print_ISBN :
0-7695-2128-2
DOI :
10.1109/ICPR.2004.1334135