Title :
AIDAS: incremental logical structure discovery in PDF documents
Author :
Anjewierden, Anjo
Author_Institution :
Dept. of Social Sci. Inf., Amsterdam Univ., Netherlands
fDate :
6/23/1905 12:00:00 AM
Abstract :
AIDAS is part of a research project in which the aim is to turn technical manuals into a database of indexed training material. We describe the approach AIDAS uses to extract the logical document structure from PDF documents. The approach is based on the idea that the layout structure contains cues about the logical structure and that the logical structure can be discovered incrementally
Keywords :
grammars; indexing; page description languages; user manuals; AIDAS; PDF documents; incremental logical structure discovery; indexed training material database; layout structure; logical document structure extraction; research project; technical manuals; Graphics; Image converters; Image databases; Indexing; Industrial training; Informatics; Layout; Manuals; Ontologies; Rendering (computer graphics);
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
DOI :
10.1109/ICDAR.2001.953816