DocumentCode :
166182
Title :
Devanagari text extraction from natural scene images
Author :
Raj, Hrishav ; Ghosh, Rajesh
Author_Institution :
Comput. Sci. & Eng. Dept., NIT Patna, Patna, India
fYear :
2014
fDate :
24-27 Sept. 2014
Firstpage :
513
Lastpage :
517
Abstract :
In scenic images, information in the form of text provides vital clues for most applications based on image processing. These include assisted navigation content based image retrieval, automatic geocoding and understanding the scene. But in a multicolored complex background, it is quite a daunting task to locate the text. This task is daunting because of non-uniformity in illumination, complexity of the backdrop, and differences in the size font & line-orientation of the text. We propose a novel approach for Devanagari text extraction from natural scene images in this paper. We can use a text-to-speech engine or Optical Character Reader to recognize the extracted text. The basis of our scheme is to analyze the CCs. This is done to extract Devanagari text from scenic images captured by camera. The presence of head line is unique to this script. Our scheme makes use of mathematical morphological operations to extract the headlines. Also the binarization of scenic images was studied. Here the effectiveness of the adaptive thresholding approach was observed. The algorithm was tested on Devanagari text contained within a collection of 100 scenic images.
Keywords :
image colour analysis; image segmentation; mathematical morphology; natural scenes; optical character recognition; text detection; Devanagari text extraction; adaptive thresholding approach; assisted navigation content based image retrieval; automatic geocoding; extracted text recognition; image processing; mathematical morphological operations; multicolored complex background; natural scene images; optical character reader; scenic image binarization; text-to-speech engine; Cameras; Feature extraction; Image edge detection; Morphological operations; Signal processing algorithms; Text recognition; Connected Components; Extracted Text; Morphological opening; Region based; erosion and dilation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-3078-4
Type :
conf
DOI :
10.1109/ICACCI.2014.6968472
Filename :
6968472
Link To Document :
بازگشت