• DocumentCode
    2628601
  • Title

    Segmentation and classification for mixed text/image documents using neural network

  • Author

    Imade, Shinichi ; Tatsuta, Seiji ; Wada, Toshiaki

  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    930
  • Lastpage
    934
  • Abstract
    A segmentation and classification method for separating a document image into printed character, handwritten character, photograph, and painted image regions is presented. A document image is segmented into rectangular areas. Each of which contains a cluster of image elements. A layered feed-forward neural network is then used to classify each segmented area using the histograms of gradient vector directions and luminance levels. A high classification performance was obtained, even with a small number of training samples. It is confirmed that the histograms of gradient vector directions and luminance levels are significantly effective features for the classification of the four kinds of image regions. Increasing the number of the discrimination areas improves the classification performance sufficiently even using a small number of training samples for the neural network
  • Keywords
    Data compression; High speed optical techniques; Histograms; Image coding; Image converters; Image segmentation; Image storage; Neural networks; Optical imaging; Pixel;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395584
  • Filename
    395584