DocumentCode :
294634
Title :
High speed speech recognition using tree-structured probability density function
Author :
Watanabe, Takao ; Shinoda, Koichi ; Takagi, Keizaburo ; Iso, Kenichi
Author_Institution :
Inf. Technol. Res. Labs., NEC Corp., Kawasaki, Japan
Volume :
1
fYear :
1995
fDate :
9-12 May 1995
Firstpage :
556
Abstract :
This paper proposes a new speech recognition method using a tree-structured probability density function (PDF) to realize high speed HMM based speech recognition. In order to reduce the likelihood calculation for a PDF set composed of the Gaussian PDFs for all mixture components, all states and all recognition units, it is coarsely done for the element PDF whose likelihood is not likely to be large. The PDF set is expressed as a tree-structured form. In the recognition process, the likelihood set is calculated by searching the tree; by calculating the likelihood from the cluster PDF at the node and traversing the nodes with the largest likelihood from the root. Experimental results showed that the computation load was drastically reduced with little reduction in the recognition accuracy, in both speaker-independent and speaker-adaptive cases. The algorithm was applied to a personal computer speech recognition software without using special hardware
Keywords :
Gaussian distribution; hidden Markov models; maximum likelihood estimation; microcomputer applications; speech processing; speech recognition; telecommunication computing; tree searching; Gaussian PDF; HMM; PDF set; algorithm; cluster PDF; computation load reduction; element PDF; experimental results; high speed speech recognition; likelihood calculation; likelihood set; mixture components; personal computer; recognition accuracy; recognition units; speaker-adaptive recognition; speaker-independent recognition; speech recognition method; speech recognition software; tree searching; tree-structured probability density function; Clustering algorithms; Costs; Hidden Markov models; Information technology; Laboratories; Microcomputers; National electric code; Probability density function; Software algorithms; Speech recognition; Workstations;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
ISSN :
1520-6149
Print_ISBN :
0-7803-2431-5
Type :
conf
DOI :
10.1109/ICASSP.1995.479658
Filename :
479658
Link To Document :
بازگشت