• DocumentCode
    454826
  • Title

    Text Detection, Localization and Segmentation in Compressed Videos

  • Author

    Qian, Xueming ; Liu, Guizhong

  • Author_Institution
    Sch. of Electron. & Inf. Eng., Xi´´an Jiaotong Univ.
  • Volume
    2
  • fYear
    2006
  • fDate
    14-19 May 2006
  • Abstract
    Video text information plays an important role in semantic-based video analysis, indexing and retrieval. Video texts are closely related to the content of a video. Text-based video analysis, browsing and retrieval are usually carried out in the following for steps: video text detection, localization, segmentation and recognition. Videos are commonly stored in compressed formats where MPEG coding techniques are adopted. In this paper, a DCT coefficient based multilingual video text detection and localization scheme for compressed videos is proposed. Candidate text blocks are detected in terms of block texture constraint. An adaptive method for the horizontal and vertical aligned text lines determination is then designed according to the run length of the horizontal and vertical block numbers. The remaining block regions are further verified by local block texture constraints. And the text block region can be localized by virtue of the horizontal and vertical block texture projections. Finally, a foreground and background integrated (FBI) video text segmentation approach is adopted in this paper to eliminate the complex background in text regions. The final experimental results show the effectiveness of our methods
  • Keywords
    data compression; discrete cosine transforms; image texture; object detection; text analysis; video coding; DCT; MPEG coding techniques; block texture constraint; compressed videos; foreground and background integrated; indexing; retrieval; semantic-based video analysis; text detection; vertical aligned text lines determination; video text information; video text localization; video text segmentation; Asia; Discrete cosine transforms; Image edge detection; Indexing; Information analysis; Information retrieval; Layout; Optical character recognition software; Speech analysis; Videos;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
  • Conference_Location
    Toulouse
  • ISSN
    1520-6149
  • Print_ISBN
    1-4244-0469-X
  • Type

    conf

  • DOI
    10.1109/ICASSP.2006.1660360
  • Filename
    1660360