Title : 
Connected pattern segmentation and title grouping in newspaper images
         
        
        
        
        
        
        
            Abstract : 
This paper presents an algorithm that performs automated segmentation and classification of newspaper images. The algorithm discusses a technique for segmenting components that are connected to other components and presents another technique to correctly group titles and subtitles. The algorithm uses a bottom-up approach to initially segment the image, classify patterns and extract text lines. The classified patterns are then merged into complete regions. The algorithm is tested on a set of complex newspaper images taken from the First International Newspaper Segmentation Contest, and the results are compared with the contest results.
         
        
            Keywords : 
document image processing; image classification; image segmentation; publishing; First International Newspaper Segmentation Contest; automated image classification; automated image segmentation; bottom-up method; complex newspaper images; newspaper image classification; pattern classification; pattern segmentation; subtitle grouping; text line extraction; title grouping; Flowcharts; Graphics; Image analysis; Image converters; Image segmentation; Information technology; Pattern classification; Smoothing methods; Testing; Text analysis;
         
        
        
        
            Conference_Titel : 
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
         
        
        
            Print_ISBN : 
0-7695-2128-2
         
        
        
            DOI : 
10.1109/ICPR.2004.1334135