Author :
Baird, Henry S. ; Thompson, Ken
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
fDate :
6/1/1990 12:00:00 AM
Abstract :
By applying semantic analysis to images of extended passages of text, several volumes of a chess encyclopedia have been read with high accuracy. Although carefully proofread, the books were poorly printed and posed a severe challenge to conventional page-layout analysis and character-recognition methods. An experimental page-reader system performed strictly top-down layout analysis for identification of columns, lines, words, and characters. This proceeded rapidly and reliably thanks to a recently developed skew-estimation technique. Resegmentation of broken, touching, and dirty characters was handled in an efficient and integrated manner by a heuristic search operating on isolated words. By analyzing the syntax of game descriptions and applying the rules of chess, the error rate was reduced by a factor of 30 from what was achievable through shape analysis alone. Several computer vision systems integration issues suggested by this experience are discussed
Keywords :
character recognition; computer vision; character-recognition; chess reading; computer vision; heuristic search; page-reader system; semantic analysis; skew-estimation; Application software; Books; Computer errors; Computer vision; Encyclopedias; Error analysis; Game theory; Image analysis; Law; Legal factors; Performance analysis; Shape;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on