• DocumentCode
    3486795
  • Title

    A New Method for Character Segmentation from Multi-oriented Video Words

  • Author

    Sharma, Neelam ; Shivakumara, Palaiahnakote ; Pal, Umapada ; Blumenstein, Michael ; Chew Lim Tan

  • Author_Institution
    Griffith Univ., Gold Coast, NSW, Australia
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    413
  • Lastpage
    417
  • Abstract
    This paper presents a two-stage method for multi-oriented video character segmentation. Words segmented from video text lines are considered for character segmentation in the present work. Words can contain isolated or non-touching characters, as well as touching characters. Therefore, the character segmentation problem can be viewed as a two stage problem. In the first stage, text cluster is identified and isolated (non-touching) characters are segmented. The orientation of each word is computed and the segmentation paths are found in the direction perpendicular to the orientation. Candidate segmentation points computed using the top distance profile are used to find the segmentation path between the characters considering the background cluster. In the second stage, the segmentation results are verified and a check is performed to ascertain whether the word component contains touching characters or not. The average width of the components is used to find the touching character components. For segmentation of the touching characters, segmentation points are then found using average stroke width information, along with the top and bottom distance profiles. The proposed method was tested on a large dataset and was evaluated in terms of precision, recall and f-measure. A comparative study with existing methods reveals the superiority of the proposed method.
  • Keywords
    image segmentation; optical character recognition; text detection; video signal processing; average stroke width information; background cluster; bottom distance profiles; candidate segmentation points; isolated character segmentation; multioriented video character segmentation; multioriented video words; nontouching characters; text cluster identification; top distance profile; touching character segmentation; two-stage method; video text lines; Character recognition; Educational institutions; Electronic mail; Image resolution; Image segmentation; Noise measurement; Optical character recognition software; Multi-oriented Document Processing; Piece-wise Linear Segmentation Line (PLSL); Video Character Recognition; Video Character Segmentation; Video Document Analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.90
  • Filename
    6628655