DocumentCode
2013697
Title
Appearance Based Models in Document Script Identification
Author
Vikram, T.N. ; Guru, D.S.
Author_Institution
Univ. of Mysore, Mysore
Volume
2
fYear
2007
fDate
23-26 Sept. 2007
Firstpage
709
Lastpage
713
Abstract
In this paper we employ appearance based models for document script identification. They are employed to identify scripts at both paragraph and word level. Elaborate experimentation has been conducted which has revealed that they are robust enough to handle highly confusing scripts and their performance does not degrade drastically even in the presence of noise. A generic script identification has been attempted, to identify both Asian and European scripts by considering a dataset of twenty different languages.
Keywords
document image processing; natural language processing; Asian scripts; European scripts; appearance based models; confusing scripts; document script identification; generic script identification; Automation; Character recognition; Computer science; Covariance matrix; Degradation; Europe; Information management; Noise robustness; Principal component analysis; Sorting;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location
Parana
ISSN
1520-5363
Print_ISBN
978-0-7695-2822-9
Type
conf
DOI
10.1109/ICDAR.2007.4377007
Filename
4377007
Link To Document