Title :
Likelihood normalization using an ergodic HMM for continuous speech recognition
Author_Institution :
Univ. of Electro-Commun., Chofu, Japan
Abstract :
In recent speech recognition technology, the score of a hypothesis is often defined on the basis of HMM likelihood. As is well known, however, direct use of the likelihood as a scoring function causes difficult problems, especially when the length of a speech segment varies depending on the hypothesis, as in word-spotting, and some kind of normalization is indispensable. In this paper, a new method of likelihood normalization using an ergodic HMM is presented, and its performance is compared with those of conventional ones. The comparison is made from three points of view: recognition rate, word-end detection power and mean hypothesis length. It is concluded that the proposed method gives the best overall performance
Keywords :
hidden Markov models; maximum likelihood estimation; speech recognition; continuous speech recognition; ergodic hidden Markov model; hypothesis score; likelihood normalization; mean hypothesis length; performance; recognition rate; scoring function; speech segment length; word-end detection power; word-spotting; Decision theory; Hidden Markov models; Mutual information; Probability; Speech recognition; Viterbi algorithm; Vocabulary;
Conference_Titel :
Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on
Conference_Location :
Philadelphia, PA
Print_ISBN :
0-7803-3555-4
DOI :
10.1109/ICSLP.1996.607267