DocumentCode :
2631841
Title :
Initial learning of document structure
Author :
Dengel, Andreas
Author_Institution :
German Res. Center for Artificial Intelligence, Kaiserslautern, Germany
fYear :
1993
fDate :
20-22 Oct 1993
Firstpage :
86
Lastpage :
90
Abstract :
Proposes an approach for automatically generating a decision tree which is applied as a model for the logical labeling of business letters. Instead of top-down determination of the discriminating attributes, the system inspects a finite set of document instances that are presented to a learner in a bottom-up position. The learner itself figures out local similarities, rates them with respect to the overall structure, and determines the best structural match of two instances (neighborhood). The entire decision tree is grown step by step deducing subtrees by forming generalizations from a neighborhood. Consequently, heuristics are learned for structurally discriminating documents during subsequent classification
Keywords :
business forms; decision theory; document handling; generalisation (artificial intelligence); heuristic programming; learning (artificial intelligence); pattern classification; trees (mathematics); best structural match; bottom-up inspection; business letters; classification; decision tree; discriminating attributes; document instances; document structure learning; generalizations; heuristics; local similarities; logical labeling; neighborhood; structural discrimination; subtrees; Artificial intelligence; Biomedical imaging; Classification tree analysis; Decision trees; Humans; Image classification; Labeling; Learning systems; Pattern matching; Weather forecasting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
Type :
conf
DOI :
10.1109/ICDAR.1993.395776
Filename :
395776
Link To Document :
بازگشت