• DocumentCode
    419524
  • Title

    Neural network-based proper names extraction in fax images

  • Author

    Azzabou, Noura ; Likforman-Sulem, Laurence

  • Author_Institution
    Ecole Nat. Superieure des Telecommun., CNRS, Paris, France
  • Volume
    1
  • fYear
    2004
  • fDate
    23-26 Aug. 2004
  • Firstpage
    421
  • Abstract
    In this paper, we are interested in the sender´s name extraction in fax cover pages through a machine learning scheme. For this purpose, two analysis methods are implemented to work in parallel. The first one is based on image document analysis (OCR recognition, physical block selection), the other on text analysis (word feature extraction, local grammar rules). Our main contribution consisted in introducing a neural network to find an optimal combination of the two approaches. Tests carried on real fax images show that the neural network improves performance compared to an empirical combination function and to each method used separately.
  • Keywords
    document image processing; facsimile; feature extraction; learning (artificial intelligence); multilayer perceptrons; optical character recognition; text analysis; word processing; OCR; fax cover pages; image document analysis; local grammar rules; machine learning; neural network; physical block selection; proper name extraction; real fax images; senders name extraction; text analysis; word feature extraction; Data mining; Electronic mail; Feature extraction; Image analysis; Intelligent networks; Machine learning; Neural networks; Optical character recognition software; Text analysis; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-2128-2
  • Type

    conf

  • DOI
    10.1109/ICPR.2004.1334144
  • Filename
    1334144