Title :
High speed speech recognition using tree-structured probability density function
Author :
Watanabe, Takao ; Shinoda, Koichi ; Takagi, Keizaburo ; Iso, Kenichi
Author_Institution :
Inf. Technol. Res. Labs., NEC Corp., Kawasaki, Japan
Abstract :
This paper proposes a new speech recognition method using a tree-structured probability density function (PDF) to realize high speed HMM based speech recognition. In order to reduce the likelihood calculation for a PDF set composed of the Gaussian PDFs for all mixture components, all states and all recognition units, it is coarsely done for the element PDF whose likelihood is not likely to be large. The PDF set is expressed as a tree-structured form. In the recognition process, the likelihood set is calculated by searching the tree; by calculating the likelihood from the cluster PDF at the node and traversing the nodes with the largest likelihood from the root. Experimental results showed that the computation load was drastically reduced with little reduction in the recognition accuracy, in both speaker-independent and speaker-adaptive cases. The algorithm was applied to a personal computer speech recognition software without using special hardware
Keywords :
Gaussian distribution; hidden Markov models; maximum likelihood estimation; microcomputer applications; speech processing; speech recognition; telecommunication computing; tree searching; Gaussian PDF; HMM; PDF set; algorithm; cluster PDF; computation load reduction; element PDF; experimental results; high speed speech recognition; likelihood calculation; likelihood set; mixture components; personal computer; recognition accuracy; recognition units; speaker-adaptive recognition; speaker-independent recognition; speech recognition method; speech recognition software; tree searching; tree-structured probability density function; Clustering algorithms; Costs; Hidden Markov models; Information technology; Laboratories; Microcomputers; National electric code; Probability density function; Software algorithms; Speech recognition; Workstations;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479658