Improving Open-Vocabulary Scene Text Recognition

Author

Feild, Jacqueline L. ; Learned-Miller, Erik G.

Author_Institution

Dept. of Comput. Sci., Univ. of Massachusetts Amherst, Amherst, MA, USA

fYear

2013

fDate

25-28 Aug. 2013

Firstpage

604

Lastpage

608

Abstract

This paper presents a system for open-vocabulary text recognition in images of natural scenes. First, we describe a novel technique for text segmentation that models smooth color changes across images. We combine this with a recognition component based on a conditional random field with histogram of oriented gradients descriptors and incorporate language information from a lexicon to improve recognition performance. Many existing techniques for this problem use language information from a standard lexicon, but these may not include many of the words found in images of the environment, such as storefront signs and street signs. We avoid this limitation by incorporating language information from a large web-based lexicon of around 13.5 million words. This lexicon contains words encountered during a crawl of the web, so it is likely to contain proper nouns, like business names and street names. We show that our text segmentation method allows for better recognition performance than the current state-of-the-art text segmentation method. We also evaluate this full system on two standard data sets, ICDAR 2003 and ICDAR 2011, and show an increase in word recognition performance compared to the current state-of-the-art methods.

Keywords

Internet; gradient methods; image colour analysis; image segmentation; natural language processing; text analysis; vocabulary; Web based lexicon; Web crawler; business names; conditional random field; histogram of oriented gradients descriptors; language information; natural scenes; open vocabulary scene text recognition; proper nouns; smooth color models; standard lexicon; storefront signs; street names; street signs; text segmentation; Accuracy; Error correction; Image color analysis; Image recognition; Image segmentation; Standards; Text recognition; scene text recognition; scene text segmentation;

fLanguage

English

Publisher

ieee

Conference_Titel

Document Analysis and Recognition (ICDAR), 2013 12th International Conference on

Conference_Location

Washington, DC

ISSN

1520-5363

Type

conf

DOI

10.1109/ICDAR.2013.125

Filename

6628690