DocumentCode :
2198144
Title :
Logical structure recognition of scientific bibliographic references
Author :
Parmentier, E. ; Belaïd, A.
Author_Institution :
CNRS, CRIN, Vandoeuvre-les-Nancy, France
Volume :
2
fYear :
1997
fDate :
18-20 Aug 1997
Firstpage :
1072
Abstract :
Presents an approach for the logical structure recognition of bibliographic references. The objective is to produce, for each reference (given in a display format such as Postscript), structured data containing the hierarchy of fields recognized. As a result of variation among bibliographic references (in the order and typographic format of fields, or writing style of the author, for example), we need a robust and tolerant system architecture. Thus, recognition is performed by a concept-oriented system that uses a model which is automatically built from a reference database. This model represents the reference fields and includes statistics on the occurrence of their terms. Recognition is achieved by a step-by-step activation of the more pertinent concepts. Each activated concept causes the execution of an appropriate searching agent. This architecture is robust and non-deterministic, allowing a solution even in difficult cases
Keywords :
bibliographic systems; document image processing; image recognition; software agents; software fault tolerance; statistics; activated concepts; concept-oriented system; display format; field hierarchy; field order; logical structure recognition; nondeterministic architecture; reference database; reference presentation variations; robust fault-tolerant system architecture; scientific bibliographic references; searching agent; step-by-step activation; term occurrence statistics; typographic format; writing style; Constitution; Data mining; Databases; Displays; Page description languages; Robustness; SGML; Software libraries; Statistics; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
Type :
conf
DOI :
10.1109/ICDAR.1997.620673
Filename :
620673
Link To Document :
بازگشت