DocumentCode :
3487379
Title :
Recognition of Video Text through Temporal Integration
Author :
Phan, Trung Quy ; Shivakumara, Palaiahnakote ; Tong Lu ; Tan, Chew Lim
Author_Institution :
Sch. of Comput., Nat. Univ. of Singapore, Singapore, Singapore
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
589
Lastpage :
593
Abstract :
This paper presents a method for temporal integration, which can be used to improve the recognition accuracy of video texts. Given a word detected in a video frame, we use a combination of Stroke Width Transform and SIFT (Scale Invariant Feature Transform) to track it both backward and forward in time. The text instances within the word´s frame span are then extracted and aligned at pixel level. In the second step, we integrate these instances into a text probability map. By thresholding this map, we obtain an initial binarization of the word. In the final step, the shapes of the characters are refined using the intensity values. This helps to preserve the distinctive character features (e.g., sharp edges and holes), which are useful for OCR engines to distinguish between the different character classes. Experiments on English and German videos show that the proposed method outperforms existing ones in terms of recognition accuracy.
Keywords :
optical character recognition; probability; text detection; transforms; video signal processing; English videos; German videos; OCR engines; SIFT; intensity values; scale invariant feature transform; stroke width transform; temporal integration; text probability map; video frame; video text recognition; word initial binarization; Accuracy; Character recognition; Engines; Feature extraction; Optical character recognition software; Shape; Text recognition; SIFT; Stroke Width Transform; multiple frame integration; temporal integration; text binarization; text enhancement; text probability; text tracking; video text recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.122
Filename :
6628687
Link To Document :
بازگشت