DocumentCode :
3019398
Title :
Financial document image coding with regions of interest using JPEG2000
Author :
Yin, Xu-Cheng ; Liu, Chang-ping ; Han, Zhi
fYear :
2005
fDate :
29 Aug.-1 Sept. 2005
Firstpage :
96
Abstract :
Document image coding is a very important issue in document analysis and recognition systems provided with vast samples. An image compression algorithm with regions of interest (ROIs) using JPEG2000 is proposed for financial document images which have various categories, complex layouts, and irregular noises. Three types of ROIs: filled information ROIs, seal ROIs, and handwriting ROIs, are detected and extracted through document knowledge analysis and handwriting identification. The first ROIs are detected by document classification, the second are extracted by connected component analysis based on color and shape information, and the third are located by handwriting identification using an incremental Fisher linear discriminant classifier. A ROI mask with a random shape is constructed by thresholding and merging these ROIs. Finally, a financial document image is encoded using JPEG2000 Part I with this ROI mask. Compared to JPEG and DjVu, the method improves visual quality while decreasing storing space.
Keywords :
data compression; document image processing; financial data processing; handwriting recognition; image classification; image coding; image colour analysis; JPEG2000; connected component analysis; document analysis systems; document classification; document knowledge analysis; document recognition systems; financial document image coding; handwriting identification; image color; image compression algorithm; image shape; incremental Fisher linear discriminant classifier; regions of interest; visual quality; Data mining; Image analysis; Image coding; Image recognition; Information analysis; Noise shaping; Seals; Shape; Text analysis; Transform coding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
ISSN :
1520-5263
Print_ISBN :
0-7695-2420-6
Type :
conf
DOI :
10.1109/ICDAR.2005.113
Filename :
1575517
Link To Document :
بازگشت