Title :
Joint discriminative learning of acoustic and language models on decoding graphs
Author :
Abdelhamid, Abdelaziz A. ; Abdulla, Waleed H.
Author_Institution :
Dept. of Electr. & Comput. Eng., Auckland Univ., Auckland, New Zealand
fDate :
Oct. 29 2013-Nov. 1 2013
Abstract :
In traditional models of speech recognition, acoustic and language models are treated in independence and usually estimated separately, which may yield a suboptimal recognition performance. In this paper, we propose a joint optimization framework for learning the parameters of acoustic and language models using minimum classification error criterion. The joint optimization is performed in terms of a decoding graph constructed using weighted finite-state transducers based on context-dependent hidden Markov models and trigram language models. To emphasize the effectiveness of the proposed framework, two speech corpora, TIMIT and Resource Management (RM1), are incorporated in the conducted experiments. The preliminary experiments show that the proposed approach can achieve significant reduction in phone, word and sentence error rates on both TIMIT and RM1 when compared with conventional parameter estimation approaches.
Keywords :
decoding; graph theory; hidden Markov models; learning (artificial intelligence); natural languages; parameter estimation; pattern classification; speech recognition; RM1; TIMIT; acoustic models; context-dependent hidden Markov models; decoding graphs; joint discriminative learning; joint optimization framework; language models; minimum classification error criterion; parameter estimation approach; resource management; speech corpora; speech recognition; suboptimal recognition performance; trigram language models; weighted finite-state transducers; Acoustics; Decoding; Hidden Markov models; Joints; Optimization; Speech recognition; Training;
Conference_Titel :
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2013 Asia-Pacific
Conference_Location :
Kaohsiung
DOI :
10.1109/APSIPA.2013.6694237