مرکز منطقه ای اطلاع رساني علوم و فناوري - High speed speech recognition using tree-structured probability density function

DocumentCode :

294634

Title :

High speed speech recognition using tree-structured probability density function

Author :

Watanabe, Takao ; Shinoda, Koichi ; Takagi, Keizaburo ; Iso, Kenichi

Author_Institution :

Inf. Technol. Res. Labs., NEC Corp., Kawasaki, Japan

Volume :

fYear :

1995

fDate :

9-12 May 1995

Firstpage :

556

Abstract :

This paper proposes a new speech recognition method using a tree-structured probability density function (PDF) to realize high speed HMM based speech recognition. In order to reduce the likelihood calculation for a PDF set composed of the Gaussian PDFs for all mixture components, all states and all recognition units, it is coarsely done for the element PDF whose likelihood is not likely to be large. The PDF set is expressed as a tree-structured form. In the recognition process, the likelihood set is calculated by searching the tree; by calculating the likelihood from the cluster PDF at the node and traversing the nodes with the largest likelihood from the root. Experimental results showed that the computation load was drastically reduced with little reduction in the recognition accuracy, in both speaker-independent and speaker-adaptive cases. The algorithm was applied to a personal computer speech recognition software without using special hardware

Keywords :

Gaussian distribution; hidden Markov models; maximum likelihood estimation; microcomputer applications; speech processing; speech recognition; telecommunication computing; tree searching; Gaussian PDF; HMM; PDF set; algorithm; cluster PDF; computation load reduction; element PDF; experimental results; high speed speech recognition; likelihood calculation; likelihood set; mixture components; personal computer; recognition accuracy; recognition units; speaker-adaptive recognition; speaker-independent recognition; speech recognition method; speech recognition software; tree searching; tree-structured probability density function; Clustering algorithms; Costs; Hidden Markov models; Information technology; Laboratories; Microcomputers; National electric code; Probability density function; Software algorithms; Speech recognition; Workstations;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on

Conference_Location :

Detroit, MI

ISSN :

1520-6149

Print_ISBN :

0-7803-2431-5

Type :

conf

DOI :

10.1109/ICASSP.1995.479658

Filename :

479658

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=294634