DocumentCode
178115
Title
Bayesian Active Learning for Keyword Spotting in Handwritten Documents
Author
Kumar, G. ; Govindaraju, V.
Author_Institution
Dept. of Comput. Sci. & Eng., Univ. at Buffalo, Amherst, NY, USA
fYear
2014
fDate
24-28 Aug. 2014
Firstpage
2041
Lastpage
2046
Abstract
We propose the Bayesian Active Learning by Disagreement (BALD) model for keyword spotting in handwritten documents. In the context of keyword spotting in handwritten documents, the background text is all regions in the document that do not contain the keywords. The model tries to learn certain characteristics of the keyword and background text in an active learning framework. It takes into account the local character level scores and global word level scores to distinguish keywords from non-keywords. We propose to apply the bayesian active learning strategy to identify the regions of sample space from which more meaningful labeled samples of keywords and non-keywords can be extracted. This work is an extension to our previous work which used a variational dynamic background model to model the large variations of background text. The approach has been tested on IAM dataset for English. The results show that a decent background model can be learned in a more quicker and efficient manner using the BALD framework. The approach outperforms our prior work and other state of the art approaches.
Keywords
Bayes methods; document image processing; handwriting recognition; learning (artificial intelligence); text analysis; BALD model; Bayesian active learning by disagreement model; IAM dataset; background text; global word level scores; handwritten documents; keyword spotting; local character level scores; variational dynamic background model; Bayes methods; Entropy; Feature extraction; Hidden Markov models; Image segmentation; Logistics; Mathematical model; Bayesian Active Learning; Handwriting Recognition; Spotting;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location
Stockholm
ISSN
1051-4651
Type
conf
DOI
10.1109/ICPR.2014.356
Filename
6977068
Link To Document