• DocumentCode
    2132384
  • Title

    A vector-quantizer based method of speaker normalization

  • Author

    Shin, Ok Keun

  • Author_Institution
    Sch. of Inf. Technol., Korea Maritime Univ., Busan, South Korea
  • fYear
    2005
  • fDate
    2005
  • Firstpage
    402
  • Lastpage
    407
  • Abstract
    As an effort to reduce the performance decline of speaker independent speech recognizers due to inter-speaker variations of vocal tract length among population, a method of speaker normalization based on vector quantization is proposed. In this paper, presented is an iterative method of constructing the ´normalized´ codebook that can be used as a text independent warp factor estimator for LVCSR system. Given the normalized codebook, the warp factor is estimated by searching the best fitting warped version of feature vectors of a given utterance. Throughout the whole process of normalized codebook construction and warp factor estimation, neither acoustic, nor phonetic knowledge is made use of The effectiveness of the proposed method is investigated by performing recognition experiments. The results showed more than 4% improvements in word level accuracy.
  • Keywords
    audio coding; feature extraction; iterative methods; speech recognition; vector quantisation; best fitting warped feature vectors; interspeaker variations; iterative method; normalized codebook; performance declination; phonetic knowledge; speaker independent speech recognizer; speaker normalization; speech recognition; text independent warp factor estimator; utterance; vector quantization; vocal tract length; warp factor estimation; Feature extraction; Frequency; Hidden Markov models; Information technology; Iterative methods; Loudspeakers; Maximum likelihood estimation; Signal processing; Speech recognition; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Information Science, 2005. Fourth Annual ACIS International Conference on
  • Print_ISBN
    0-7695-2296-3
  • Type

    conf

  • DOI
    10.1109/ICIS.2005.21
  • Filename
    1515437