DocumentCode
454826
Title
Text Detection, Localization and Segmentation in Compressed Videos
Author
Qian, Xueming ; Liu, Guizhong
Author_Institution
Sch. of Electron. & Inf. Eng., Xi´´an Jiaotong Univ.
Volume
2
fYear
2006
fDate
14-19 May 2006
Abstract
Video text information plays an important role in semantic-based video analysis, indexing and retrieval. Video texts are closely related to the content of a video. Text-based video analysis, browsing and retrieval are usually carried out in the following for steps: video text detection, localization, segmentation and recognition. Videos are commonly stored in compressed formats where MPEG coding techniques are adopted. In this paper, a DCT coefficient based multilingual video text detection and localization scheme for compressed videos is proposed. Candidate text blocks are detected in terms of block texture constraint. An adaptive method for the horizontal and vertical aligned text lines determination is then designed according to the run length of the horizontal and vertical block numbers. The remaining block regions are further verified by local block texture constraints. And the text block region can be localized by virtue of the horizontal and vertical block texture projections. Finally, a foreground and background integrated (FBI) video text segmentation approach is adopted in this paper to eliminate the complex background in text regions. The final experimental results show the effectiveness of our methods
Keywords
data compression; discrete cosine transforms; image texture; object detection; text analysis; video coding; DCT; MPEG coding techniques; block texture constraint; compressed videos; foreground and background integrated; indexing; retrieval; semantic-based video analysis; text detection; vertical aligned text lines determination; video text information; video text localization; video text segmentation; Asia; Discrete cosine transforms; Image edge detection; Indexing; Information analysis; Information retrieval; Layout; Optical character recognition software; Speech analysis; Videos;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location
Toulouse
ISSN
1520-6149
Print_ISBN
1-4244-0469-X
Type
conf
DOI
10.1109/ICASSP.2006.1660360
Filename
1660360
Link To Document