DocumentCode
2132384
Title
A vector-quantizer based method of speaker normalization
Author
Shin, Ok Keun
Author_Institution
Sch. of Inf. Technol., Korea Maritime Univ., Busan, South Korea
fYear
2005
fDate
2005
Firstpage
402
Lastpage
407
Abstract
As an effort to reduce the performance decline of speaker independent speech recognizers due to inter-speaker variations of vocal tract length among population, a method of speaker normalization based on vector quantization is proposed. In this paper, presented is an iterative method of constructing the ´normalized´ codebook that can be used as a text independent warp factor estimator for LVCSR system. Given the normalized codebook, the warp factor is estimated by searching the best fitting warped version of feature vectors of a given utterance. Throughout the whole process of normalized codebook construction and warp factor estimation, neither acoustic, nor phonetic knowledge is made use of The effectiveness of the proposed method is investigated by performing recognition experiments. The results showed more than 4% improvements in word level accuracy.
Keywords
audio coding; feature extraction; iterative methods; speech recognition; vector quantisation; best fitting warped feature vectors; interspeaker variations; iterative method; normalized codebook; performance declination; phonetic knowledge; speaker independent speech recognizer; speaker normalization; speech recognition; text independent warp factor estimator; utterance; vector quantization; vocal tract length; warp factor estimation; Feature extraction; Frequency; Hidden Markov models; Information technology; Iterative methods; Loudspeakers; Maximum likelihood estimation; Signal processing; Speech recognition; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer and Information Science, 2005. Fourth Annual ACIS International Conference on
Print_ISBN
0-7695-2296-3
Type
conf
DOI
10.1109/ICIS.2005.21
Filename
1515437
Link To Document