DocumentCode
1834117
Title
Augmented edit distance based temporal contiguity analysis for improved videotext recognition
Author
Aradhye, Hrishikesh ; Dorai, Chitra
Author_Institution
Ohio State Univ., Columbus, OH, USA
fYear
2001
fDate
2001
Firstpage
275
Lastpage
280
Abstract
Videotext refers to text superimposed on video frames and it enables automatic content annotation and indexing of large video and image collections. Its importance is underscored by the fact that a videotext-based multimedia description scheme has recently been adopted into the MPEG-7 standard. A study of published work in the area of automatic videotext extraction and recognition reveals that, despite recent interest, a reliable general purpose video character recognition (VCR) system is yet to be developed. In our development of a VCR system designed specifically to handle the low resolution output from videotext extractors, we observed that raw VCR accuracies obtained using various classifiers including kernel space methods such as SVM, are inadequate for accurate video annotation. We propose an intelligent postprocessing mechanism that is supported by general data characteristics of this domain for VCR performance improvement. We describe temporal contiguity analysis, which works independently of the raw character recognition technique and works well even for moving videotext. This novel mechanism can be easily implemented in conjunction with VCR algorithms being developed elsewhere to offer the same performance gains. Experimental results on various video streams show notable improvements in recognition rates with our system incorporating a SVM-based recognition engine and temporal contiguity analysis
Keywords
character recognition; database indexing; feature extraction; learning automata; multimedia databases; very large databases; video databases; viewdata; visual databases; MPEG-7 standard; SVM; augmented edit distance; automatic content annotation; character recognition; image collections; indexing; intelligent postprocessing; large video collections; low resolution output; multimedia description scheme; temporal contiguity analysis; video character recognition; video frames; videotext extractors; videotext recognition; Character recognition; Data mining; Indexing; Kernel; MPEG 7 Standard; Performance gain; Streaming media; Support vector machine classification; Support vector machines; Video recording;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Signal Processing, 2001 IEEE Fourth Workshop on
Conference_Location
Cannes
Print_ISBN
0-7803-7025-2
Type
conf
DOI
10.1109/MMSP.2001.962746
Filename
962746
Link To Document