DocumentCode
3240094
Title
Dynamic segmental vector quantization in isolated-word speech recognition
Author
Nhat, Vo Dinh Minh ; Lee, Sungyoung
Author_Institution
Dept. of Comput. Eng., Kyung Hee Univ., Yongin-Si, South Korea
fYear
2004
fDate
18-21 Dec. 2004
Firstpage
204
Lastpage
208
Abstract
The standard vector quantization (VQ) approach that uses a single vector quantizer for each entire duration of the utterance of each class suffers from the following two limitations: 1) high computational cost for large codebook sizes and 2) lack of explicit characterization of the sequential behavior. Both of two these disadvantages can be remedied by treating each utterance class as a concatenation of several information subsources, each of which is represented by a VQ codebook. With this approach, segmentation schemes obviously need to be investigated. And we call this VQ approach dynamic segmental vector quantization (DSVQ). This paper shows how to design DSVQ with some effective segmentation schemes. Better performances could be seen when applying this approach itself or mixed with hidden Markov model (HMM) in isolated-word speech recognition.
Keywords
hidden Markov models; speech coding; speech recognition; vector quantisation; DSVQ; HMM; VQ codebook; concatenation; dynamic segmental vector quantization; hidden Markov model; isolated-word speech recognition; segmentation scheme; several information subsource; Application software; Clustering algorithms; Code standards; Computational efficiency; Data compression; Hidden Markov models; Speech analysis; Speech recognition; Vector quantization; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing and Information Technology, 2004. Proceedings of the Fourth IEEE International Symposium on
Print_ISBN
0-7803-8689-2
Type
conf
DOI
10.1109/ISSPIT.2004.1433722
Filename
1433722
Link To Document