DocumentCode :
3488792
Title :
Using the Web to Create Dynamic Dictionaries in Handwritten Out-of-Vocabulary Word Recognition
Author :
Oprean, Cristina ; Likforman-Sulem, Laurence ; Popescu, Adrian ; Mokbel, Chafic
Author_Institution :
Inst. Mines-Telecom, Telecom ParisTech, Paris, France
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
989
Lastpage :
993
Abstract :
Handwriting recognition systems rely on predefined dictionaries obtained from training data. Small and static dictionaries are usually exploited to obtain high in-vocabulary (IV) accuracy at the expense of coverage. Thus the recognition of out-of-vocabulary (OOV) words cannot be handled efficiently. To improve OOV recognition while keeping IV dictionaries small, we introduce a multi-step approach that exploits Web resources. After an initial IV-OOV sequence classification, external resources are used to create OOV sequence-adapted dynamic dictionaries. A final Viterbi-based decoding is performed over the dynamic dictionary to determine the most probable word for the OOV sequence. We validate our approach with experiments conducted on RIMES, a publicly available database. Results show that improvements are obtained compared to standard handwriting recognition, performed with a static dictionary. Both domain adapted and generic dynamic dictionaries are studied and we show that domain adaptation is beneficial.
Keywords :
Internet; Viterbi decoding; dictionaries; handwritten character recognition; image classification; image coding; image sequences; visual databases; IV dictionaries; IV-OOV sequence classification; OOV recognition; OOV sequence-adapted dynamic dictionaries; RIMES; Viterbi-based decoding; Web resources; external resources; handwritten out-of-vocabulary word recognition system; in-vocabulary accuracy; multistep approach; predefined dictionaries; publicly available database; training data; Dictionaries; Electronic publishing; Encyclopedias; Handwriting recognition; Hidden Markov models; Internet; Web resources; dynamic dictionaries; handwriting word recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.199
Filename :
6628764
Link To Document :
بازگشت