Title :
Unsupervised HMM Adaptation Using Page Style Clustering
Author :
Cao, Huaigu ; Prasad, Rohit ; Saleem, Shirin ; Natarajan, Premkumar
Author_Institution :
BBN Technol., Cambridge, MA, USA
Abstract :
In this paper we present an innovative two-stage adaptation approach for handwriting recognition that is based on clustering of similar pages in the training data. In our approach, we first perform page clustering on training data using features such as contour slope, pen pressure, writing velocity, and stroke sparseness. Next, we adapt the writer-independent hidden Markov models (HMMs) to each cluster in the training data. While decoding a test page, we first determine the cluster the test page belongs to and then decode the page with the model associated with that cluster. Experimental results with the two-stage adaptation show significant gains on a held-out validation set.
Keywords :
decoding; handwriting recognition; hidden Markov models; image coding; pattern clustering; unsupervised learning; decoding; handwriting recognition; hidden Markov model; page style clustering; training data; unsupervised HMM adaptation; Error analysis; Handwriting recognition; Hidden Markov models; Loudspeakers; Maximum likelihood decoding; Maximum likelihood linear regression; Speech recognition; Testing; Training data; Writing;
Conference_Titel :
Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
Conference_Location :
Barcelona
Print_ISBN :
978-1-4244-4500-4
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2009.77