Title : 
Representing dynamic features of phonetic segment in an orthogonalized codebook of HMM based speech recognition system
         
        
            Author : 
Nitta, Tsuneo ; Iwasaki, Jun´ichi ; Masai, Yasuyuki ; Matsu´ura, Hiroshi
         
        
            Author_Institution : 
Toshiba Corp., Kawasaki, Japan
         
        
        
        
        
        
            Abstract : 
The authors propose a matrix quantization (MQ) algorithm named statistical MQ (SMQ) which uses an orthogonalized phonetic segment codebook. The SMQ effectively incorporates pattern variations of each phonetic segment into the orthogonalized phonetic segment codebook, and transforms an input speech to a sequence of phonetic symbols which include about 700 types of phonetic segments. The authors also propose a simple SMQ-HMM training algorithm called an equally counted K-based learning in which each phonetic event observed within the best K is equally counted in a model and output probabilities are smoothed without fuzzy rule. The proposed algorithm has been tested on a 546-word vocabulary data set uttered by 10 unknown speakers, using a real time recognition system, and has achieved the high performance of 96.5%
         
        
            Keywords : 
hidden Markov models; speech coding; speech recognition; SMQ; dynamic features; equally counted K-based learning; input speech; matrix quantization algorithm; orthogonalized codebook; output probabilities; phonetic segment; real time recognition system; speech recognition system; statistical MQ; training algorithm; vocabulary data set; Hidden Markov models; Information systems; Karhunen-Loeve transforms; Laboratories; Quantization; Real time systems; Speech recognition; System testing; Systems engineering and theory; Vocabulary;
         
        
        
        
            Conference_Titel : 
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
         
        
            Conference_Location : 
San Francisco, CA
         
        
        
            Print_ISBN : 
0-7803-0532-9
         
        
        
            DOI : 
10.1109/ICASSP.1992.225891