DocumentCode
3486795
Title
A New Method for Character Segmentation from Multi-oriented Video Words
Author
Sharma, Neelam ; Shivakumara, Palaiahnakote ; Pal, Umapada ; Blumenstein, Michael ; Chew Lim Tan
Author_Institution
Griffith Univ., Gold Coast, NSW, Australia
fYear
2013
fDate
25-28 Aug. 2013
Firstpage
413
Lastpage
417
Abstract
This paper presents a two-stage method for multi-oriented video character segmentation. Words segmented from video text lines are considered for character segmentation in the present work. Words can contain isolated or non-touching characters, as well as touching characters. Therefore, the character segmentation problem can be viewed as a two stage problem. In the first stage, text cluster is identified and isolated (non-touching) characters are segmented. The orientation of each word is computed and the segmentation paths are found in the direction perpendicular to the orientation. Candidate segmentation points computed using the top distance profile are used to find the segmentation path between the characters considering the background cluster. In the second stage, the segmentation results are verified and a check is performed to ascertain whether the word component contains touching characters or not. The average width of the components is used to find the touching character components. For segmentation of the touching characters, segmentation points are then found using average stroke width information, along with the top and bottom distance profiles. The proposed method was tested on a large dataset and was evaluated in terms of precision, recall and f-measure. A comparative study with existing methods reveals the superiority of the proposed method.
Keywords
image segmentation; optical character recognition; text detection; video signal processing; average stroke width information; background cluster; bottom distance profiles; candidate segmentation points; isolated character segmentation; multioriented video character segmentation; multioriented video words; nontouching characters; text cluster identification; top distance profile; touching character segmentation; two-stage method; video text lines; Character recognition; Educational institutions; Electronic mail; Image resolution; Image segmentation; Noise measurement; Optical character recognition software; Multi-oriented Document Processing; Piece-wise Linear Segmentation Line (PLSL); Video Character Recognition; Video Character Segmentation; Video Document Analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location
Washington, DC
ISSN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2013.90
Filename
6628655
Link To Document