• DocumentCode
    2708972
  • Title

    Automatic text extraction from video for content-based annotation and retrieval

  • Author

    Shim, Jae-Chang ; Dorai, Chitra ; Bolle, Ruud

  • Author_Institution
    Andong Nat. Univ., South Korea
  • Volume
    1
  • fYear
    1998
  • fDate
    16-20 Aug 1998
  • Firstpage
    618
  • Abstract
    Efficient content-based retrieval of image and video databases is an important application due to rapid proliferation of digital video data on the Internet and corporate intranets. Text either embedded or superimposed within video frames is very useful for describing the contents of the frames, as it enables both keyword and free-text based search, automatic video logging, and video cataloging. We have developed a scheme for automatically extracting text from digital images and videos for content annotation and retrieval. We present our approach to robust text extraction from video frames, which can handle complex image backgrounds, deal with different font sizes, font styles, and font appearances such as normal and inverse video. Our algorithm results in segmented characters that can be directly processed by an OCR system to produce ASCII text. Results from our experiments with over 5000 frames obtained from twelve MPEG video streams demonstrate the good performance of our system in terms of text identification accuracy and computational efficiency
  • Keywords
    cataloguing; image segmentation; information retrieval; optical character recognition; video coding; video databases; ASCII text; OCR system; automatic text extraction; automatic video logging; complex image backgrounds; computational efficiency; content-based annotation; content-based retrieval; digital images; digital video; font appearances; font sizes; font styles; free-text based search; inverse video; keyword based search; segmented characters; text identification accuracy; video cataloging; video databases; Content based retrieval; Data mining; Digital images; Image databases; Image retrieval; Image segmentation; Information retrieval; Internet; Optical character recognition software; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 1998. Proceedings. Fourteenth International Conference on
  • Conference_Location
    Brisbane, Qld.
  • ISSN
    1051-4651
  • Print_ISBN
    0-8186-8512-3
  • Type

    conf

  • DOI
    10.1109/ICPR.1998.711219
  • Filename
    711219