DocumentCode :
591962
Title :
Multi-lingual City Name Recognition for Indian Postal Automation
Author :
Pal, Umapada ; Roy, R.K. ; Kimura, Fumitaka
Author_Institution :
Comput. Vision & Pattern Recognition Unit, Indian Stat. Inst., Kolkata, India
fYear :
2012
fDate :
18-20 Sept. 2012
Firstpage :
169
Lastpage :
173
Abstract :
Under three-language formula, the destination address block of postal document of an Indian state is generally written in three languages: English, Hindi and the State official language. From the statistical analysis we found that 12.37%, 76.32% and 10.21% postal documents are written in Bangla, English and Devanagari script, respectively. Because of inter-mixing of these scripts in postal address writings, it is very difficult to identify the script by which a city name is written. To avoid such script identification difficulties, in this paper we proposed a lexicon-driven method for multi-lingual (English, Hindi and Bangla) city name recognition for Indian postal automation. In the proposed scheme, at first, to take care of slanted handwriting of different individuals a slant correction technique is performed. Next, a water reservoir concept is applied to pre-segment the slant corrected city names into possible primitive components (characters or its parts). Pre-segmented components of a city name are then merged into possible characters to get the best city name using the lexicon information. In order to merge these primitive components into characters and to find optimum character segmentation, dynamic programming (DP) is applied using total likelihood of the characters of a city name as an objective function. We tested our system on 16132 Indian trilingual city names and 92.25% overall recognition accuracy was obtained.
Keywords :
dynamic programming; handwritten character recognition; image segmentation; postal services; statistical analysis; Devanagari script; English language; Hindi language; Indian postal automation; State official language; character segmentation; destination address block; dynamic programming; lexicon-driven method; multilingual city name recognition; objective function; postal address writing; postal document; recognition accuracy; script identification; slant correction technique; slant presegmentation; slanted handwriting; statistical analysis; three-language formula; water reservoir concept; Automation; Cavity resonators; Cities and towns; Dynamic programming; Feature extraction; Handwriting recognition; Image segmentation; Handwritten character recognition; Indian Script; Indian postal automation; city name recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
Type :
conf
DOI :
10.1109/ICFHR.2012.238
Filename :
6424387
Link To Document :
بازگشت