DocumentCode :
183323
Title :
An Intelligent Sample Selection Approach to Language Model Adaptation for Hand-Written Text Recognition
Author :
Tanha, Jafar ; de Does, Jesse ; Depuydt, Katrien
Author_Institution :
Inst. for Dutch Lexicology (INL), Leiden, Netherlands
fYear :
2014
fDate :
1-4 Sept. 2014
Firstpage :
349
Lastpage :
354
Abstract :
We present an intelligent sample selection approach to language model adaptation for handwritten text recognition, which exploits a combination of in-domain and out-of-domain data for construction of language models. In comparison to approaches proposed in the literature, our approach is characterized by a careful consideration of the criteria used for ranking samples and an innovative approach to sample selection which iteratively extends the training set for two language models. We propose two methods, in which agreement or disagreement of two ranking criteria (one for each language model) guides the selection of samples to add to the training sets of the models. Both approaches are shown to clearly outperform a strong baseline consisting of a carefully tuned interpolation of in-domain and out-of-domain language models.
Keywords :
document image processing; feature selection; handwritten character recognition; natural language processing; text analysis; text detection; document image; handwritten text recognition; intelligent sample selection; language model adaptation; Adaptation models; Data models; Dictionaries; Interpolation; Measurement; Text recognition; Training; Domain adaptation; Handwritten text recognition; Language modeling; Sample selection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
ISSN :
2167-6445
Print_ISBN :
978-1-4799-4335-7
Type :
conf
DOI :
10.1109/ICFHR.2014.65
Filename :
6981044
Link To Document :
بازگشت