• DocumentCode
    2954978
  • Title

    A New Method for Word Segmentation from Arbitrarily-Oriented Video Text Lines

  • Author

    Sharma, Neelam ; Shivakumara, Palaiahnakote ; Pal, Umapada ; Blumenstein, Michael ; Tan, Chew Lim

  • Author_Institution
    Griffith Univ., Griffith, QLD, Australia
  • fYear
    2012
  • fDate
    3-5 Dec. 2012
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    Word segmentation has become a research topic to improve OCR accuracy for video text recognition, because a video text line suffers from arbitrary orientation, complex background and low resolution. Therefore, for word segmentation from arbitrarily-oriented video text lines, in this paper, we extract four new gradient directional features for each Canny edge pixel of the input text line image to produce four respective pixel candidate images. The union of four pixel candidate images is performed to obtain a text candidate image. The sequence of the components in the text candidate image according to the text line is determined using nearest neighbor criteria. Then we propose a two-stage method for segmenting words. In the first stage, for the distances between the components, we apply K-means clustering with K=2 to get probable word and non-word spacing clusters. The words are segmented based on probable word spacing and all other components are passed to the second stage for segmenting correct words. For each segmented and un-segmented words passed to the second stage, the method repeats all the steps until the K-means clustering step to find probable word and non-word spacing clusters. Then the method considers cluster nature, height and width of the components to identify the correct word spacing. The method is tested extensively on video curved text lines, non-horizontal straight lines, horizontal straight lines and text lines from the ICDAR-2003 competition data. Experimental results and a comparative study shows the results are encouraging and promising.
  • Keywords
    edge detection; feature extraction; gradient methods; image resolution; image segmentation; pattern clustering; text analysis; text detection; video signal processing; word processing; Canny edge pixel; ICDAR-2003 competition data; K-means clustering; OCR accuracy improvement; arbitrarily-oriented video text lines; complex background; gradient directional feature extraction; low resolution video text line; nearest neighbor criteria; probable nonword spacing clusters; probable word spacing clusters; segmented words; two-stage method; unsegmented words; video text recognition; word segmentation; word spacing identification; Feature extraction; Image edge detection; Image resolution; Image segmentation; Optical character recognition software; Text recognition; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Digital Image Computing Techniques and Applications (DICTA), 2012 International Conference on
  • Conference_Location
    Fremantle, WA
  • Print_ISBN
    978-1-4673-2180-8
  • Electronic_ISBN
    978-1-4673-2179-2
  • Type

    conf

  • DOI
    10.1109/DICTA.2012.6411703
  • Filename
    6411703