• DocumentCode
    2142667
  • Title

    A New Gradient Based Character Segmentation Method for Video Text Recognition

  • Author

    Shivakumara, Palaiahnakote ; Bhowmick, Souvik ; Su, Bolan ; Tan, Chew Lim ; Pal, Umapada

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    126
  • Lastpage
    130
  • Abstract
    The current OCR cannot segment words and characters from video images due to complex background as well as low resolution of video images. To have better accuracy, this paper presents a new gradient based method for words and character segmentation from text line of any orientation in video frames for recognition. We propose a Max-Min clustering concept to obtain text cluster from the normalized absolute gradient feature matrix of the video text line image. Union of the text cluster with the output of Canny operation of the input video text line is proposed to restore missing text candidates. Then a run length algorithm is applied on the text candidate image for identifying word gaps. We propose a new idea for segmenting characters from the restored word image based on the fact that the text height difference at the character boundary column is smaller than that of the other columns of the word image. We have conducted experiments on a large dataset at two levels (word and character level) in terms of recall, precision and f-measure. Our experimental setup involves 3527 characters of English and Chinese, and this dataset is selected from TRECVID database of 2005 and 2006.
  • Keywords
    image segmentation; object recognition; pattern clustering; text analysis; video signal processing; Canny operation; absolute gradient feature matrix; gradient based character segmentation method; max-min clustering concept; run length algorithm; text candidate image; text cluster; video text recognition; word gap identification; Accuracy; Character recognition; Image recognition; Image segmentation; Optical character recognition software; Text recognition; Gradient features; Video character extraction; Video character recognition; Video document analysis; Word segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.34
  • Filename
    6065289