• DocumentCode
    3021618
  • Title

    Independent component analysis segmentation algorithm

  • Author

    Chen, Yan ; Leedham, Graham

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore
  • fYear
    2005
  • fDate
    29 Aug.-1 Sept. 2005
  • Firstpage
    680
  • Abstract
    In this paper we propose and investigate a new segmentation algorithm called the ICA (independent component analysis) segmentation algorithm and compare it against other existing overlapping strokes segmentation algorithms. The ICA segmentation algorithm converts the original touching or overlapping word components into a blind source matrix and then calculates the weighted value matrix before the values are re-evaluated using a fast ICA model. The readjusted weighted value matrix is applied to the blind source matrix to separate the word components. The algorithm has been evaluated on 30 overlapped document images from the CEDAR letter database and another 30 degraded historical document images, which containing many different kinds of overlapping and touching words in adjacent lines. Quantitative analysis of the results by measuring text recall, and qualitative assessment of processed document image quality is reported. The ICA segmentation algorithm is demonstrated to be effective at resolving the problem in varying forms of overlapping or touching text lines.
  • Keywords
    document image processing; image segmentation; independent component analysis; blind source matrix; document images; independent component analysis segmentation; overlapping strokes segmentation; weighted value matrix; Cities and towns; Degradation; Flowcharts; Image analysis; Image databases; Image quality; Image segmentation; Independent component analysis; Matrix converters; Spatial databases;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
  • ISSN
    1520-5263
  • Print_ISBN
    0-7695-2420-6
  • Type

    conf

  • DOI
    10.1109/ICDAR.2005.140
  • Filename
    1575631