DocumentCode :
2076793
Title :
Learning contextual rules for document understanding
Author :
Semeraro, Giovanni ; Esposito, Floriana ; Malerba, Donato
Author_Institution :
Dipartimento di Inf., Universita degli Studi, Bari, Italy
fYear :
1994
fDate :
1-4 Mar 1994
Firstpage :
108
Lastpage :
115
Abstract :
We propose a supervised inductive learning approach for the problem of document understanding, that is, recognizing logical components of a document. For this purpose, FOCL and NDUBI/H, two systems that learn Horn clauses, have been employed. Several experimental results are reported and a critical view of the underlying independence assumption, made by almost all systems that learn from examples, is presented. This led us to redefine the problem of document understanding in terms of a new strategy of supervised inductive learning, called contextual learning. Experiments, in which a dependency hierarchy between concepts is defined, show that contextual rules increase predictive accuracy and decrease learning time for labelling problems, like document understanding. Encouraging results have been obtained when we tried to discover a linear dependency order by means of statistical methods
Keywords :
Horn clauses; classification; document handling; knowledge based systems; learning by example; FOCL; Horn clauses; INDUBI/H; contextual learning; contextual rules; document understanding; independence assumption; learning from examples; linear dependency order; logical components; statistical methods; supervised inductive learning approach; Accuracy; Artificial intelligence; Automatic testing; Data mining; Knowledge acquisition; Labeling; Machine learning; Optical character recognition software; Statistical analysis; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Artificial Intelligence for Applications, 1994., Proceedings of the Tenth Conference on
Conference_Location :
San Antonia, TX
Print_ISBN :
0-8186-5550-X
Type :
conf
DOI :
10.1109/CAIA.1994.323685
Filename :
323685
Link To Document :
بازگشت