Title :
Backpropagation training for multilayer conditional random field based phone recognition
Author :
Prabhavalkar, Rohit ; Fosler-Lussier, Eric
Author_Institution :
Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA
Abstract :
Conditional random fields (CRFs) have recently found increased popularity in automatic speech recognition (ASR) applications. CRFs have previously been shown to be effective combiners of posterior estimates from multilayer perceptrons (MLPs) in phone and word recognition tasks. In this paper, we describe a novel hybrid Multilayer-CRF structure (ML-CRF), where a MLP-like hidden layer serves as input to the CRF; moreover, we propose a technique for directly training the ML-CRF to optimize a conditional log-likelihood based criterion, based on error backpropagation. The proposed technique thus allows for the implicit learning of suitable feature functions for the CRF. We present results for initial phone recognition experiments on the TIMIT database that indicate that our proposed method is a promising approach for training CRFs.
Keywords :
backpropagation; multilayer perceptrons; random processes; speech recognition; MLP-like hidden layer; TIMIT database; automatic speech recognition; backpropagation training; conditional log-likelihood based criterion; error backpropagation; multilayer conditional random field; multilayer perceptrons; multilayer-CRF structure; phone recognition; word recognition tasks; Application software; Automatic speech recognition; Backpropagation; Computer science; Hidden Markov models; Multilayer perceptrons; Nonhomogeneous media; Probability distribution; Spatial databases; Speech recognition; Backpropagation; Multilayer Perceptrons; Random Fields; Speech Recognition;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495222