Title :
Joint Optimization of Hidden Conditional Random Fields and Non Linear Feature Extraction
Author :
Vinel, Antoine ; Do, Trinh Minh Tri ; Artières, Thierry
Author_Institution :
LIP6, Univ. Pierre et Marie Curie, Paris, France
Abstract :
We describe an hybrid model that combines deep neural networks (DNN) for nonlinear feature extraction and hidden conditional random fields (HCRF), i.e. conditional random fields with hidden states. The model is globally trained though joint optimization of HCRF and DNN parameters. To deal with this highly non convex optimization criterion, we propose a multi-step training which aims at providing a good initialization before the final joint optimization of all parameters. We investigate then the discriminative power of these models with respect to the architecture of the DNN, and compare our models to HMM and HCRF based algorithms on the IAM database.
Keywords :
convex programming; feature extraction; hidden Markov models; learning (artificial intelligence); neural nets; training; HMM; IAM database; deep neural networks; hidden conditional random fields; highly nonconvex optimization criterion; joint optimization; multistep training; non linear feature extraction; nonlinear feature extraction; Accuracy; Data models; Feature extraction; Handwriting recognition; Hidden Markov models; Optimization; Training; Conditional Random Fields; Deep Neural Networks; Handwriting recognition;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.109