DocumentCode
166182
Title
Devanagari text extraction from natural scene images
Author
Raj, Hrishav ; Ghosh, Rajesh
Author_Institution
Comput. Sci. & Eng. Dept., NIT Patna, Patna, India
fYear
2014
fDate
24-27 Sept. 2014
Firstpage
513
Lastpage
517
Abstract
In scenic images, information in the form of text provides vital clues for most applications based on image processing. These include assisted navigation content based image retrieval, automatic geocoding and understanding the scene. But in a multicolored complex background, it is quite a daunting task to locate the text. This task is daunting because of non-uniformity in illumination, complexity of the backdrop, and differences in the size font & line-orientation of the text. We propose a novel approach for Devanagari text extraction from natural scene images in this paper. We can use a text-to-speech engine or Optical Character Reader to recognize the extracted text. The basis of our scheme is to analyze the CCs. This is done to extract Devanagari text from scenic images captured by camera. The presence of head line is unique to this script. Our scheme makes use of mathematical morphological operations to extract the headlines. Also the binarization of scenic images was studied. Here the effectiveness of the adaptive thresholding approach was observed. The algorithm was tested on Devanagari text contained within a collection of 100 scenic images.
Keywords
image colour analysis; image segmentation; mathematical morphology; natural scenes; optical character recognition; text detection; Devanagari text extraction; adaptive thresholding approach; assisted navigation content based image retrieval; automatic geocoding; extracted text recognition; image processing; mathematical morphological operations; multicolored complex background; natural scene images; optical character reader; scenic image binarization; text-to-speech engine; Cameras; Feature extraction; Image edge detection; Morphological operations; Signal processing algorithms; Text recognition; Connected Components; Extracted Text; Morphological opening; Region based; erosion and dilation;
fLanguage
English
Publisher
ieee
Conference_Titel
Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
Conference_Location
New Delhi
Print_ISBN
978-1-4799-3078-4
Type
conf
DOI
10.1109/ICACCI.2014.6968472
Filename
6968472
Link To Document