مرکز منطقه ای اطلاع رساني علوم و فناوري - Top-down and bottom-up cues for scene text recognition

DocumentCode :

2717396

Title :

Top-down and bottom-up cues for scene text recognition

Author :

Mishra, Anand ; Alahari, Karteek ; Jawahar, C.V.

Author_Institution :

CVIT, IIIT Hyderabad, Hyderabad, India

fYear :

2012

fDate :

16-21 June 2012

Firstpage :

2687

Lastpage :

2694

Abstract :

Scene text recognition has gained significant attention from the computer vision community in recent years. Recognizing such text is a challenging problem, even more so than the recognition of scanned documents. In this work, we focus on the problem of recognizing text extracted from street images. We present a framework that exploits both bottom-up and top-down cues. The bottom-up cues are derived from individual character detections from the image. We build a Conditional Random Field model on these detections to jointly model the strength of the detections and the interactions between them. We impose top-down cues obtained from a lexicon-based prior, i.e. language statistics, on the model. The optimal word represented by the text image is obtained by minimizing the energy function corresponding to the random field model. We show significant improvements in accuracies on two challenging public datasets, namely Street View Text (over 15%) and ICDAR 2003 (nearly 10%).

Keywords :

character recognition; image recognition; random processes; text detection; ICDAR 2003; bottom-up cues; character detection; computer vision community; conditional random field model; energy function; language statistics; lexicon-based prior; optimal word; scanned document recognition; scene text recognition; street image; street view text; text image; top-down cues; Accuracy; Character recognition; Image edge detection; Optical character recognition software; Support vector machines; Text recognition;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on

Conference_Location :

Providence, RI

ISSN :

1063-6919

Print_ISBN :

978-1-4673-1226-4

Electronic_ISBN :

1063-6919

Type :

conf

DOI :

10.1109/CVPR.2012.6247990

Filename :

6247990

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2717396