Title :
Binary tree-based precision-keeping clustering for very fast Japanese character recognition
Author :
Sobu, Yohei ; Goto, Hideaki ; Aso, Hirotomo
Author_Institution :
Grad. Sch. of Inf. Sci., Tohoku Univ., Sendai, Japan
Abstract :
Real-time character recognition in video frames has been attracting great attention from developers since scene text recognition was recognized as a new field of Optical Character Recognition (OCR) applications. Some oriental languages such as Japanese and Chinese have thousands of characters, and the character recognition takes much longer time in general compared with European languages. Speed-up of character recognition is crucial to develop software for mobile devices such as Smart Phones. This paper proposes a binary tree-based clustering technique that can keep the precision as quite high as possible. The experimental results show that the character recognition using the proposed clustering technique is 8.3 times faster than the full linear matching at mere 0.22% precision drop. When the proposed method is combined with the Sequential Similarity Detection Algorithm (SSDA) and a PCA-based dimensionality reduction, we can achieve 36.2 times faster character matching at 0.29% precision drop.
Keywords :
image matching; natural language processing; optical character recognition; pattern clustering; principal component analysis; smart phones; trees (mathematics); video signal processing; Chinese language; European language; Japanese language; PCA-based dimensionality reduction; binary tree-based precision-keeping clustering; full linear matching; mobile device software; optical character recognition; principal component analysis; realtime character recognition; scene text recognition; sequential similarity detection algorithm; smart phones; very fast Japanese character recognition; video frame; Character recognition; Clustering algorithms; Dictionaries; Mobile handsets; Optical character recognition software; Real time systems; Vectors; Japanese character recognition; character clustering; dimensionality reduction; fast matching algorithm; real-time character recognition;
Conference_Titel :
Image and Vision Computing New Zealand (IVCNZ), 2010 25th International Conference of
Conference_Location :
Queenstown
Print_ISBN :
978-1-4244-9629-7
DOI :
10.1109/IVCNZ.2010.6148843