Title :
Telephone speech data corpus and performances of speaker independent recognition system using the corpus
Author :
Isobe, T. ; Murakami, K.
Author_Institution :
Lab. for Inf. Technol., NTT Data, Kawasaki, Japan
Abstract :
The authors first describe the speech data corpus they collected from 400 male and 400 female subjects over the phone. They then compare the performances of two types of triphone model based speaker independent recognition systems, in which they used the corpus for training models and testing. One system uses a normal continuous mixture density HMM, and the other uses a CDHMM with a tree structure of 2,064 Gaussian distributions, which needs only one thirtieth of the Gaussian computation of a normal one. As a result, the system with the tree-structure CDHMM performed as well as 3% less than the system using the normal CDHMM. This shows that tree-structure CDHMM are useful for telephone speech recognition
Keywords :
Gaussian distribution; hidden Markov models; speech recognition; telephony; tree data structures; CDHMM; Gaussian distributions; continuous mixture density HMM; female subjects; male subjects; performances; speaker independent recognition system; telephone speech data corpus; tree structure; triphone model; Banking; Cities and towns; Gaussian distribution; Hidden Markov models; Information management; Speech recognition; System testing; Technology management; Telephony; Tree data structures;
Conference_Titel :
Interactive Voice Technology for Telecommunications Applications, 1994., Second IEEE Workshop on
Conference_Location :
Kyoto
Print_ISBN :
0-7803-2074-3
DOI :
10.1109/IVTTA.1994.341535