Title :
Neural network-based proper names extraction in fax images
Author :
Azzabou, Noura ; Likforman-Sulem, Laurence
Author_Institution :
Ecole Nat. Superieure des Telecommun., CNRS, Paris, France
Abstract :
In this paper, we are interested in the sender´s name extraction in fax cover pages through a machine learning scheme. For this purpose, two analysis methods are implemented to work in parallel. The first one is based on image document analysis (OCR recognition, physical block selection), the other on text analysis (word feature extraction, local grammar rules). Our main contribution consisted in introducing a neural network to find an optimal combination of the two approaches. Tests carried on real fax images show that the neural network improves performance compared to an empirical combination function and to each method used separately.
Keywords :
document image processing; facsimile; feature extraction; learning (artificial intelligence); multilayer perceptrons; optical character recognition; text analysis; word processing; OCR; fax cover pages; image document analysis; local grammar rules; machine learning; neural network; physical block selection; proper name extraction; real fax images; senders name extraction; text analysis; word feature extraction; Data mining; Electronic mail; Feature extraction; Image analysis; Intelligent networks; Machine learning; Neural networks; Optical character recognition software; Text analysis; Text recognition;
Conference_Titel :
Pattern Recognition, 2004. ICPR 2004. Proceedings of the 17th International Conference on
Print_ISBN :
0-7695-2128-2
DOI :
10.1109/ICPR.2004.1334144