• DocumentCode
    2384319
  • Title

    A unified text extraction method for instructional videos

  • Author

    Tang, Lijun ; Kender, John R.

  • Author_Institution
    Dept. of Comput. Sci., Columbia Univ., New York, NY, USA
  • Volume
    3
  • fYear
    2005
  • fDate
    11-14 Sept. 2005
  • Abstract
    Videotext can be an efficient semantic index and summary for instructional videos. However, videotext usually appears in different visual formats: handwritten slides, electronic slides, book pages, web pages, handwriting on chalkboard, etc. We propose a unified approach to handle all these kinds of videotext in three steps. First, we detect still video segments by analyzing motion energy patterns in instructional videos, and construct a quality-enhanced candidate text frame for each still video segment. Then, we use a trained SVM classifier to verify the candidate text frames, as well as to segment the text region and individual text blocks from the verified frames. Finally, we filter redundant text frames with similar text content by a Hausdorff distance-based image comparison algorithm. The resulting text frames are automatically organized into HTML and PDF documents to serve as an imagery summarization of the instructional videos. We show the application of our method to 75 instructional videos of five different courses, and discuss its applications.
  • Keywords
    support vector machines; video signal processing; viewdata; HTML documents; Hausdorff distance; PDF documents; SVM classifier; image comparison algorithm; imagery summarization; instructional videos; motion energy patterns; quality-enhanced candidate text frame; redundant text frames filter; semantic index; unified text extraction method; video segments detection; videotext; Books; Filters; Image segmentation; Motion analysis; Motion detection; Pattern analysis; Support vector machine classification; Support vector machines; Videos; Web pages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing, 2005. ICIP 2005. IEEE International Conference on
  • Print_ISBN
    0-7803-9134-9
  • Type

    conf

  • DOI
    10.1109/ICIP.2005.1530617
  • Filename
    1530617