• DocumentCode
    1834117
  • Title

    Augmented edit distance based temporal contiguity analysis for improved videotext recognition

  • Author

    Aradhye, Hrishikesh ; Dorai, Chitra

  • Author_Institution
    Ohio State Univ., Columbus, OH, USA
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    275
  • Lastpage
    280
  • Abstract
    Videotext refers to text superimposed on video frames and it enables automatic content annotation and indexing of large video and image collections. Its importance is underscored by the fact that a videotext-based multimedia description scheme has recently been adopted into the MPEG-7 standard. A study of published work in the area of automatic videotext extraction and recognition reveals that, despite recent interest, a reliable general purpose video character recognition (VCR) system is yet to be developed. In our development of a VCR system designed specifically to handle the low resolution output from videotext extractors, we observed that raw VCR accuracies obtained using various classifiers including kernel space methods such as SVM, are inadequate for accurate video annotation. We propose an intelligent postprocessing mechanism that is supported by general data characteristics of this domain for VCR performance improvement. We describe temporal contiguity analysis, which works independently of the raw character recognition technique and works well even for moving videotext. This novel mechanism can be easily implemented in conjunction with VCR algorithms being developed elsewhere to offer the same performance gains. Experimental results on various video streams show notable improvements in recognition rates with our system incorporating a SVM-based recognition engine and temporal contiguity analysis
  • Keywords
    character recognition; database indexing; feature extraction; learning automata; multimedia databases; very large databases; video databases; viewdata; visual databases; MPEG-7 standard; SVM; augmented edit distance; automatic content annotation; character recognition; image collections; indexing; intelligent postprocessing; large video collections; low resolution output; multimedia description scheme; temporal contiguity analysis; video character recognition; video frames; videotext extractors; videotext recognition; Character recognition; Data mining; Indexing; Kernel; MPEG 7 Standard; Performance gain; Streaming media; Support vector machine classification; Support vector machines; Video recording;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Signal Processing, 2001 IEEE Fourth Workshop on
  • Conference_Location
    Cannes
  • Print_ISBN
    0-7803-7025-2
  • Type

    conf

  • DOI
    10.1109/MMSP.2001.962746
  • Filename
    962746