Title :
An architecture for document management
Author :
Cavalcanti, G. ; Filho, Edson Costa de Barros Carvalho
Author_Institution :
Centro de Inf., Univ. Fed. de Pernambuco, Recife, Brazil
Abstract :
The main goal of this work is to investigate a computational architecture for a document management environment. Its purpose is to digitalize and extract information from documents of any type, transforming them into structured electronic documents. The environment is divided into specification and extraction modules. In the first module, the user performs the document specification, capturing physical, semantic and process information from the document. The extraction module uses this information, in order to recognize documents of the same class as the specified one. This environment offers the possibility of visualizing the classification results and to correct eventual mistakes it has made. Also, it allows document reconstruction from physical and semantic information captured in the specification module.
Keywords :
document image processing; feature extraction; image classification; natural languages; text analysis; document classification; document image processing; document management; document recognition; document reconstruction; document specification module; extraction module; feature extraction; information systems; natural language processing; pattern recognition; physical information; process information; semantic information; structured electronic documents; Computer architecture; Data mining; Image analysis; Image databases; Image processing; Image recognition; Image reconstruction; Image storage; Text analysis; Visual databases;
Conference_Titel :
Image Processing. 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7622-6
DOI :
10.1109/ICIP.2002.1039137