• DocumentCode
    692185
  • Title

    A novel approach for detection and localization of caption in video based on pixel pairs

  • Author

    Boaz, Too Kipyego ; Prabhakar, C.J.

  • Author_Institution
    Dept. of Comput. Sci., Kuvempu Univ., Shimoga, India
  • fYear
    2013
  • fDate
    27-28 Sept. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper proposed a method for localization of caption text, particularly Kannada caption text from video sequence. The proposed method uses multiple frames integration approach based on three characteristics namely character location, edge distribution and pixel contrast. The novelty of our approach lies on the pairing of pixels by finding double edge maps of the original and its rotated image. To highlight the text area based on the video temporal information, a Roberts´ edge detector is applied to reference gray scale frames of both original and rotated images, followed by combination of the two edge maps and later on a multiple frame integration based on the above three characteristics by employing logical AND operator that keeps only pixels that are invariant among the frames. The morphological operations are applied on the edge map to connect the text characters and discard nontext components. The result is then smoothed and overlaid on one of the original reference images to extract text candidate block. The experimental results carried out on sample video data of Kannada TV channel (Commercial) show that the proposed approach achieves a high precision and recall rate.
  • Keywords
    edge detection; text analysis; text detection; video signal processing; Kannada TV channel show; Kannada caption text; Roberts edge detector; character location; double edge maps; edge distribution; gray scale frames; logical AND operator; morphological operations; multiple frames integration approach; nontext components; original reference images; pixel contrast; pixel pairs; text candidate block extraction; video caption detection; video caption localization; video sequence; video temporal information; Caption text; Double edge; Text detection; Text localization;
  • fLanguage
    English
  • Publisher
    iet
  • Conference_Titel
    Research & Technology in the Coming Decades (CRT 2013), National Conference on Challenges in
  • Conference_Location
    Ujire
  • Electronic_ISBN
    978-1-84919-868-4
  • Type

    conf

  • DOI
    10.1049/cp.2013.2488
  • Filename
    6851535