A vector-quantizer based method of speaker normalization

Author

Shin, Ok Keun

Author_Institution

Sch. of Inf. Technol., Korea Maritime Univ., Busan, South Korea

fYear

2005

fDate

2005

Firstpage

402

Lastpage

407

Abstract

As an effort to reduce the performance decline of speaker independent speech recognizers due to inter-speaker variations of vocal tract length among population, a method of speaker normalization based on vector quantization is proposed. In this paper, presented is an iterative method of constructing the ´normalized´ codebook that can be used as a text independent warp factor estimator for LVCSR system. Given the normalized codebook, the warp factor is estimated by searching the best fitting warped version of feature vectors of a given utterance. Throughout the whole process of normalized codebook construction and warp factor estimation, neither acoustic, nor phonetic knowledge is made use of The effectiveness of the proposed method is investigated by performing recognition experiments. The results showed more than 4% improvements in word level accuracy.

Keywords

audio coding; feature extraction; iterative methods; speech recognition; vector quantisation; best fitting warped feature vectors; interspeaker variations; iterative method; normalized codebook; performance declination; phonetic knowledge; speaker independent speech recognizer; speaker normalization; speech recognition; text independent warp factor estimator; utterance; vector quantization; vocal tract length; warp factor estimation; Feature extraction; Frequency; Hidden Markov models; Information technology; Iterative methods; Loudspeakers; Maximum likelihood estimation; Signal processing; Speech recognition; Vector quantization;

fLanguage

English

Publisher

ieee

Conference_Titel

Computer and Information Science, 2005. Fourth Annual ACIS International Conference on

Print_ISBN

0-7695-2296-3

Type

conf

DOI

10.1109/ICIS.2005.21

Filename

1515437