Title :
Dynamic segmental vector quantization in isolated-word speech recognition
Author :
Nhat, Vo Dinh Minh ; Lee, Sungyoung
Author_Institution :
Dept. of Comput. Eng., Kyung Hee Univ., Yongin-Si, South Korea
Abstract :
The standard vector quantization (VQ) approach that uses a single vector quantizer for each entire duration of the utterance of each class suffers from the following two limitations: 1) high computational cost for large codebook sizes and 2) lack of explicit characterization of the sequential behavior. Both of two these disadvantages can be remedied by treating each utterance class as a concatenation of several information subsources, each of which is represented by a VQ codebook. With this approach, segmentation schemes obviously need to be investigated. And we call this VQ approach dynamic segmental vector quantization (DSVQ). This paper shows how to design DSVQ with some effective segmentation schemes. Better performances could be seen when applying this approach itself or mixed with hidden Markov model (HMM) in isolated-word speech recognition.
Keywords :
hidden Markov models; speech coding; speech recognition; vector quantisation; DSVQ; HMM; VQ codebook; concatenation; dynamic segmental vector quantization; hidden Markov model; isolated-word speech recognition; segmentation scheme; several information subsource; Application software; Clustering algorithms; Code standards; Computational efficiency; Data compression; Hidden Markov models; Speech analysis; Speech recognition; Vector quantization; Vocabulary;
Conference_Titel :
Signal Processing and Information Technology, 2004. Proceedings of the Fourth IEEE International Symposium on
Print_ISBN :
0-7803-8689-2
DOI :
10.1109/ISSPIT.2004.1433722