Title :
Digital mountain: from granite archive to global access
Author :
Barrett, William ; Hutchison, Luke ; Quass, Dallan ; Nielson, Heath ; Kennard, Douglas
Author_Institution :
Dept. of Comput. Sci., Brigham Young Univ., Provo, UT, USA
Abstract :
Large-scale, multiterabyte digital libraries are becoming feasible due to decreasing costs of storage, CPU, and bandwidth. However, costs associated with preparing content for input into the library remain high due to the amount of human labor required. We describe the digital microfilm pipeline -sequence of image processing operations used to populate a large-scale digital library from a "mountain" of microfilm and reduce the human labor involved. Essential parts of the pipeline include algorithms for document zoning and labeling, consensus-based template creation, reversal of geometric transformations and just-in-time browsing, an interactive technique for progressive access of image content over a low-bandwidth medium. We also suggest more automated approaches to cropping, enhancement and data extraction.
Keywords :
digital libraries; document image processing; microforms; consensus-based template creation; data extraction; digital library; digital microfilm pipeline; digital mountain; document zoning; geometric transformation; image processing; just-in-time browsing; labeling; Bandwidth; Computer science; Costs; Hardware; History; Humans; Image processing; Large-scale systems; Pipelines; Software libraries;
Conference_Titel :
Document Image Analysis for Libraries, 2004. Proceedings. First International Workshop on
Print_ISBN :
0-7695-2088-X
DOI :
10.1109/DIAL.2004.1263243