Title of article :
An OCR Free Method for Word Spotting in Printed Documents: the Evaluation of Different Feature Sets
Author/Authors :
Rios, Israel Pontifical Catholic University of Parana, Brazil , Britto Jr, Alceu de Souza Pontifical Catholic University of Parana, Brazil , Koerich, Alessandro Lameiras Pontifical Catholic University of Parana, Brazil , Oliveira, Luis Eduardo Soares Federal University of Parana, Brazil
Abstract :
An OCR free word spotting method is developed and evaluated under a strong experimental protocol. Different feature sets are evaluated under the same experimental conditions. In addition, a tuning process in the document segmentation step is proposed which provides a significant reduction in terms of processing time. For this purpose, a complete OCRfree method for word spotting in printed documents was implemented, and a document database containing document images and their corresponding ground truth text files was created. A strong experimental protocol based on 800 document images allows us to compare the results of the three feature sets used to represent the word image.
Keywords :
word spotting , document retrieval , word recognition
Journal title :
Journal of J.UCS (Journal of Universal Computer Science)
Journal title :
Journal of J.UCS (Journal of Universal Computer Science)