Title :
Managing document images in a digital library: an ontology guided approach
Author :
Harit, Gaurav ; Chaudhury, Santanu ; Ghosh, Hiranmay
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol., New Delhi, India
Abstract :
We present Heritage+, an integrated platform for interactive access of different types of media elements through an unified interface. A unique aspect of Heritage+ is that it deals with document images as distinct media type and implements tools and techniques for browsing and querying document images along with other media elements like video sequences and images. Further, Heritage+ proposes a new scheme for encoding and use of ontology for accessing multimedia collection. In the context of document images, the ontology specifies the document class-specific semantics of the logical components that help in an automated semantically meaningful linking of documents and their components with heterogeneous media-type resources. Further, Heritage+ supports conceptual query of document images along with other media elements. This multifunctional access interface to the document images is provided in Heritage+ using a novel model guided document image segmentation scheme and word-image based indexing scheme.
Keywords :
content management; digital libraries; document image processing; encoding; image segmentation; indexing; query processing; user interfaces; Heritage+; content analysis; document class-specific semantic; document image analysis; document image querying; encoding; image segmentation; image sequence; media element; multifunctional access interface; ontology guided approach; page segmentation; video sequence; word-image based indexing; Encoding; Image analysis; Image segmentation; Image sequence analysis; Indexing; Ontologies; Software libraries; Text analysis; Video sequences; XML;
Conference_Titel :
Document Image Analysis for Libraries, 2004. Proceedings. First International Workshop on
Print_ISBN :
0-7695-2088-X
DOI :
10.1109/DIAL.2004.1263238