DocumentCode :
2144303
Title :
Document Recognition without Strong Models
Author :
Baird, Henry S.
Author_Institution :
Comput. Sci. & Eng. Dept., Lehigh Univ., Bethlehem, PA, USA
fYear :
2011
fDate :
18-21 Sept. 2011
Firstpage :
414
Lastpage :
423
Abstract :
Can a high-performance document image recognition system be built without detailed knowledge of the application? Having benefited from the statistical machine learning revolution of the last twenty years, our architectures rely less on hand-crafted special-case rules and more on models trained on labeled-sample data sets. But urgent questions remain. When we can\´t collect (and label) enough real training data, does it help to complement them with data synthesized using generative models? Is it ever completely safe to rely on synthetic data? If we can\´t manage to train (or craft) a single complete, near-perfect, application-specific "strong" model to drive recognition, can we make progress by combining several imperfect or incomplete "weak" models? Can recognition that is carried out jointly over weak models perform optimally while still running fast? Can a recognizer automatically pick a strong model of its input? Must we always pre-train models for every kind ("style") of input expected, or can a recognizer adapt to unknown styles? Can weak models adapt autonomously, growing stronger and so driving accuracy higher, without any human intervention? Can one model "criticize" - and then proceed to correct - other models, even while it is being criticized and corrected in turn by them? After twenty-five years of research on these questions we have partial answers, many in the affirmative: in addition to promising laboratory demonstrations, we can take pride in successful applications. I\´ll illustrate the evolution of the state of the art with concrete examples, and point out open problems.
Keywords :
document image processing; image recognition; learning (artificial intelligence); set theory; data set; drive recognition; hand crafted special case rule; high performance document image recognition system; human intervention; pretrain model; real training data synthesis; statistical machine learning revolution; Accuracy; Adaptation models; Data models; Games; Humans; Image recognition; Semantics; Rule-based recognition; adaptive recognition; anytime algorithms; joint recognition; learning models; model-driven recognition; strong versus weak models; style-conscious recognition; synthetically generated data; whole-book recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
ISSN :
1520-5363
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2011.91
Filename :
6065346
Link To Document :
بازگشت