Title :
End-to-end text recognition with convolutional neural networks
Author :
Tao Wang ; Wu, D.J. ; Coates, Andrew ; Ng, A.Y.
Author_Institution :
Stanford Univ., Stanford, CA, USA
Abstract :
Full end-to-end text recognition in natural images is a challenging problem that has received much attention recently. Traditional systems in this area have relied on elaborate models incorporating carefully hand-engineered features or large amounts of prior knowledge. In this paper, we take a different route and combine the representational power of large, multilayer neural networks together with recent developments in unsupervised feature learning, which allows us to use a common framework to train highly-accurate text detector and character recognizer modules. Then, using only simple off-the-shelf methods, we integrate these two modules into a full end-to-end, lexicon-driven, scene text recognition system that achieves state-of-the-art performance on standard benchmarks, namely Street View Text and ICDAR 2003.
Keywords :
feature extraction; handwritten character recognition; multilayer perceptrons; natural scenes; text detection; unsupervised learning; ICDAR 2003; Street View Text; character recognizer module; convolutional neural network; end-to-end text recognition; lexicon driven recognition system; multilayer neural network; natural image processing; off-the-shelf method; scene text recognition system; text detection; unsupervised feature learning; Accuracy; Benchmark testing; Character recognition; Detectors; Neural networks; Standards; Text recognition;
Conference_Titel :
Pattern Recognition (ICPR), 2012 21st International Conference on
Conference_Location :
Tsukuba
Print_ISBN :
978-1-4673-2216-4