Dynamic segmental vector quantization in isolated-word speech recognition

Author

Nhat, Vo Dinh Minh ; Lee, Sungyoung

Author_Institution

Dept. of Comput. Eng., Kyung Hee Univ., Yongin-Si, South Korea

fYear

2004

fDate

18-21 Dec. 2004

Firstpage

204

Lastpage

208

Abstract

The standard vector quantization (VQ) approach that uses a single vector quantizer for each entire duration of the utterance of each class suffers from the following two limitations: 1) high computational cost for large codebook sizes and 2) lack of explicit characterization of the sequential behavior. Both of two these disadvantages can be remedied by treating each utterance class as a concatenation of several information subsources, each of which is represented by a VQ codebook. With this approach, segmentation schemes obviously need to be investigated. And we call this VQ approach dynamic segmental vector quantization (DSVQ). This paper shows how to design DSVQ with some effective segmentation schemes. Better performances could be seen when applying this approach itself or mixed with hidden Markov model (HMM) in isolated-word speech recognition.

Keywords

hidden Markov models; speech coding; speech recognition; vector quantisation; DSVQ; HMM; VQ codebook; concatenation; dynamic segmental vector quantization; hidden Markov model; isolated-word speech recognition; segmentation scheme; several information subsource; Application software; Clustering algorithms; Code standards; Computational efficiency; Data compression; Hidden Markov models; Speech analysis; Speech recognition; Vector quantization; Vocabulary;

fLanguage

English

Publisher

ieee

Conference_Titel

Signal Processing and Information Technology, 2004. Proceedings of the Fourth IEEE International Symposium on

Print_ISBN

0-7803-8689-2

Type

conf

DOI

10.1109/ISSPIT.2004.1433722

Filename

1433722