DocumentCode
3422167
Title
PhotoOCR: Reading Text in Uncontrolled Conditions
Author
Bissacco, Alessandro ; Cummins, Mark ; Netzer, Yuval ; Neven, Hartmut
fYear
2013
fDate
1-8 Dec. 2013
Firstpage
785
Lastpage
792
Abstract
We describe Photo OCR, a system for text extraction from images. Our particular focus is reliable text extraction from smartphone imagery, with the goal of text recognition as a user input modality similar to speech recognition. Commercially available OCR performs poorly on this task. Recent progress in machine learning has substantially improved isolated character classification, we build on this progress by demonstrating a complete OCR system using these techniques. We also incorporate modern data center-scale distributed language modelling. Our approach is capable of recognizing text in a variety of challenging imaging conditions where traditional OCR systems fail, notably in the presence of substantial blur, low resolution, low contrast, high image noise and other distortions. It also operates with low latency, mean processing time is 600 ms per image. We evaluate our system on public benchmark datasets for text extraction and outperform all previously reported results, more than halving the error rate on multiple benchmarks. The system is currently in use in many applications at Google, and is available as a user input modality in Google Translate for Android.
Keywords
image classification; learning (artificial intelligence); optical character recognition; text detection; Android; Google Translate; OCR system; PhotoOCR; datacenter-scale distributed language modelling; imaging conditions; isolated character classification; machine learning; public benchmark datasets; smartphone imagery; text extraction; text reading; text recognition; uncontrolled conditions; Computational modeling; Google; Image segmentation; Mathematical model; Optical character recognition software; Text recognition; Training; OCR; deep learning; scene text; text recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision (ICCV), 2013 IEEE International Conference on
Conference_Location
Sydney, NSW
ISSN
1550-5499
Type
conf
DOI
10.1109/ICCV.2013.102
Filename
6751207
Link To Document