Title :
Statistical Hypothesis Testing for Handwritten Word Segmentation Algorithms
Author :
Haji, Mohsin ; Sahoo, K.A. ; Bui, Tien D. ; Suen, Ching ; Ponson, Dominique
Author_Institution :
CENPARMI, Concordia Univ., Montréal, QC, Canada
Abstract :
We present a statistical hypothesis testing method for handwritten word segmentation algorithms. Our proposed method can be used along with any word segmentation algorithm in order to detect over-segmented or under-segmented errors or to adapt the word segmentation algorithm to new data in an unsupervised manner. The main idea behind the proposed approach is to learn the geometrical distribution of words within a sentence using a Markov chain or a Hidden Markov Model (HMM). In the former, we assume all the necessary information is observable, where in the latter, we assume the minimum observable variables are the bounding boxes of the words, and the hidden variables are the part of speech information. Our experimental results on a benchmark database show that not only we can achieve a lower over-segmentation and under-segmentation error rate, but also a higher correct segmentation rate as a result of the proposed hypothesis testing.
Keywords :
handwriting recognition; hidden Markov models; speech processing; statistical testing; text analysis; unsupervised learning; word processing; HMM; Markov chain; benchmark database; correct segmentation rate; geometrical distribution; handwritten word segmentation algorithms; hidden Markov model; observable variables; over-segmentation error rate; over-segmented error; speech information; statistical hypothesis testing; under-segmentation error rate; under-segmented error; unsupervised manner; Adaptation models; Computational modeling; Databases; Hidden Markov models; Markov processes; Speech; Testing; Handwritten Word Segmentation; Hidden Markov Model; Hypothesis Testing; Markov Chain;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2012 International Conference on
Conference_Location :
Bari
Print_ISBN :
978-1-4673-2262-1
DOI :
10.1109/ICFHR.2012.272