Title :
Performance of hybrid MMI-connectionist/HMM systems on the WSJ speech database
Author :
Rottland, J. ; Neukirchen, Ch ; Willett, D.
Author_Institution :
Dept. of Comput. Sci., Gerhard-Mercator-Univ. Duisburg, Germany
Abstract :
A hybrid MMI-connectionist/hidden Markov model (HMM) speech recognition system for the Wall Street Journal (WSJ) database is presented. The HMM part of this system uses discrete probability density functions (PDF). The neural network (NN) is used to replace a classical vector quantizer (VQ) like a k-means or LBG algorithm, which are typically used in discrete HMM systems. The NN is trained on an algorithm, that tries to achieve maximum mutual information (MMI) between the generated output labels and the underlying phonetic description. The system has been trained and tested with the five thousand word speaker independent WSJ task. The error rates of the MMI-connectionist approach are 21% lower than the error rates of a k-means system. The system achieves error rates which have been achieved before only by the best continuous/semi-continuous HMM speech recognizers, with the advantage of a faster recognition algorithm
Keywords :
acoustic signal processing; hidden Markov models; neural nets; probability; speech processing; speech recognition; training; PDF; Resource Management database; WSJ speech database; discrete probability density functions; error rates; hidden Markov model; hybrid MMI-connectionist/HMM systems; maximum mutual information; neural network; output labels; phonetic description; recognition algorithm; speech recognition system; system performance; training; Cepstral analysis; Computer science; Error analysis; Hidden Markov models; Mutual information; Neural networks; Probability density function; Spatial databases; Speech recognition; System testing;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.598862