Title :
A Bayesian Approach to Script Independent Multilingual Keyword Spotting
Author :
Kumar, Girish ; Govindaraju, Vengatesan
Author_Institution :
Dept. of Comput. Sci. & Eng., Univ. at Buffalo, Amherst, NY, USA
Abstract :
We propose a script independent Bayesian framework for keyword spotting in multilingual handwritten documents. The approach relies on local character level score and global word level hypothesis scores and learns a Bayesian logistic regression classifier to distinguish between keywords and non-keywords. In a Bayesian formulation of logistic regression, the integral over weights becomes intractable. Variational approximation is used for inference. In order to learn a robust classifier with minimal number of samples, we apply Bayesian active learning framework to request labels for those word images which provide maximum information gain in improving the classifier. We evaluate our system on multilingual datasets, publicly available IAM dataset for English, AMA for Arabic and LAW dataset for Devanagiri. The system is also evaluated on a synthetic multilingual dataset prepared by combining samples from IAM, AMA and LAW datasets. The results are comparable with the state of art multilingual keyword spotting framework.
Keywords :
Bayes methods; document image processing; handwriting recognition; natural language processing; AMA; Arabic; Devanagiri; English; IAM; LAW; bayesian active learning; bayesian formulation; bayesian logistic regression classifier; global word level hypothesis scores; local character level score; multilingual handwritten documents; script independent multilingual keyword spotting; synthetic multilingual dataset; unconstrained handwriting recognition; variational approximation; Approximation methods; Bayes methods; Feature extraction; Hidden Markov models; Image recognition; Image segmentation; Logistics; Bayesian Active Learning; Handwritten Multilingual Documents; Script Independent; Spotting;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
Print_ISBN :
978-1-4799-4335-7
DOI :
10.1109/ICFHR.2014.66