Title :
Study of embedded font context and kernel space methods for improved videotext recognition
Author :
Aradhye, Hrishikesh ; Dorai, Chitra ; Shim, Jae-Chang
Author_Institution :
Ohio State Univ., Columbus, OH, USA
Abstract :
Videotext refers to text superimposed on video frames. A videotext based multimedia description scheme has been adopted in the MPEG-7 standard. A study of published work in the area of videotext extraction and recognition reveals that, despite previous interest, a reliable general purpose video character recognition (VCR) system is yet to be developed. In our research and development of a character recognition algorithm designed specifically for the low resolution output from automatic videotext extractors, we observed that raw VCR accuracies obtained using various classifiers including kernel space methods such as SVMs, are inadequate for accurate video annotation and browsing. Intelligent post-processing mechanisms that are supported by general data characteristics of the domain are hence, required for performance improvement. We describe one such method, referred to as the font context analysis, which works independently of the raw character recognition technique. As a result, it can be easily implemented in conjunction with other VCR algorithms being developed elsewhere, and offer the same performance gains. Experimental results on various video streams show notable improvements in recognition rates with our system incorporating a SVM-based character recognition mechanism and font context analysis
Keywords :
character recognition; feature extraction; image classification; learning automata; telecommunication standards; video signal processing; video tape recorders; viewdata; MPEG-7 standard; SVM-based character recognition; automatic videotext extractors; character recognition algorithm; data characteristics; embedded font context; font context analysis; general purpose VCR system; intelligent postprocessing; kernel space methods; low resolution output; recognition rates; support vector machine based classifier; video annotation; video browsing; video classifiers; video frames; video streams; videotext based multimedia description; videotext extraction; videotext recognition; Algorithm design and analysis; Character recognition; Data mining; Image analysis; Kernel; Performance analysis; Streaming media; Text recognition; Video compression; Video recording;
Conference_Titel :
Image Processing, 2001. Proceedings. 2001 International Conference on
Conference_Location :
Thessaloniki
Print_ISBN :
0-7803-6725-1
DOI :
10.1109/ICIP.2001.958621