• DocumentCode
    3047023
  • Title

    Automatic text segmentation from complex background

  • Author

    Ye, Qixiang ; Gao, Wen ; Huang, Qingming

  • Author_Institution
    Graduate Sch. of Chinese Acad. of Sci., China
  • Volume
    5
  • fYear
    2004
  • fDate
    24-27 Oct. 2004
  • Firstpage
    2905
  • Abstract
    In this paper, we proposed an automatic method to segment text from complex background for recognition task. First, a rule-based sampling method is proposed to get portion of the text pixels. Then, the sampled pixels are used for training Gaussian mixture models of intensity and hue components in HSI color space. Finally, the trained GMMs together with the spatial connectivity information are used for segment all of text pixels form their background. We used the word recognition rate to evaluate the segmentation result. Experiments results show that the proposed algorithm can work fully automatically and performs much better than the traditional methods.
  • Keywords
    Gaussian processes; document image processing; image colour analysis; image recognition; image resolution; image sampling; image segmentation; Gaussian mixture model; HSI color space; automatic text segmentation; complex background; rule-based sampling method; spatial connectivity information; text pixel; word recognition rate; Clustering algorithms; Colored noise; Computers; Content based retrieval; Image recognition; Image retrieval; Image segmentation; Sampling methods; Text recognition; Videos;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2004. ICIP '04. 2004 International Conference on
  • ISSN
    1522-4880
  • Print_ISBN
    0-7803-8554-3
  • Type

    conf

  • DOI
    10.1109/ICIP.2004.1421720
  • Filename
    1421720