DocumentCode
696647
Title
Font clustering and classification in document images
Author
Ozturk, Serdar ; Sankur, Billent ; Abak, A.Toygar
Author_Institution
Boğaziçi University, Department of Electrical-Electronic Engineering, Bebek, Istanbul, Turkey
fYear
2000
fDate
4-8 Sept. 2000
Firstpage
1
Lastpage
4
Abstract
Clustering and identification of fonts in document images impacts on the performance of optical character recognition (OCR). Therefore font features and their clustering tendency are investigated. Font clustering is implemented both from shape similarity and from OCR performance points of view. A font recognition algorithm is developed to identify the font group with which a given text was created.
Keywords
Character recognition; Clustering algorithms; Discrete cosine transforms; Feature extraction; Optical character recognition software; Text recognition; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2000 10th European
Conference_Location
Tampere, Finland
Print_ISBN
978-952-1504-43-3
Type
conf
Filename
7075268
Link To Document