Title :
Genetic algorithm for optimizing the nonlinear time alignment of automatic speech recognition systems
Author :
Kwong, S. ; Chau, C.W. ; Halang, Wolfgang A.
Author_Institution :
City Univ. of Hong Kong, Hong Kong
fDate :
10/1/1996 12:00:00 AM
Abstract :
Dynamic time warping (DTW) is a nonlinear time-alignment technique for automatic speech recognition (ASR) systems. It had been widely used in many commercial and industrial products, ranging from electronic dailies/dictionaries to wireless voice digit dialers. DTW has the advantages of fast training and searching times, which makes it more popular than other available ASR techniques. However, there exist some limitations to DTW, such as the stringent rule on slope weighting, the nontrivial computation of the K-best paths, and the significant increase in computational time when the endpoint constraint is relaxed or the variations of the length of pattern increased. In this paper, a stochastic method called the genetic algorithm (GA), which is used to solve the nonlinear time alignment problem, is presented. Experimental results show that the GA has a better performance than the DTW. In addition, two derivatives of GA: the hybrid GA and the parallel GA are also presented
Keywords :
computational complexity; genetic algorithms; speech processing; speech recognition; speech recognition equipment; K-best paths; automatic speech recognition systems; commercial products; computational time; dynamic time warping; electronic products; endpoint constraint; genetic algorithm; industrial products; nonlinear time alignment optimisation; performance; slope weighting; stochastic method; Automatic speech recognition; Data mining; Feature extraction; Genetic algorithms; Pattern recognition; Performance evaluation; Spatial databases; Speech recognition; Stochastic processes; Testing;
Journal_Title :
Industrial Electronics, IEEE Transactions on