• DocumentCode
    3406909
  • Title

    A frequency warping approach for vocal tract length normalization

  • Author

    Qi, Ding ; Wang, Xu ; Bingxi, Wang

  • Author_Institution
    Inf. Eng. Univ., Henan, China
  • Volume
    1
  • fYear
    2004
  • fDate
    31 Aug.-4 Sept. 2004
  • Firstpage
    691
  • Abstract
    A method of vocal tract length normalization (VTLN) is proposed. It uses bilinear transform (BLT) to modify the filterbank in Mel-frequency cepstrum based on the average third formant F3. The effectiveness of this method is examined on vowel and isolated digit recognitions. The baseline vowel recognition models are trained on males data and the baseline isolated digit models are trained on adult men´s data respectively. When the MFCC coefficients of test data are transformed by BLT, the recognition accuracy of females´ vowels is improved by 11.67% and the recognition accuracies of adult women and children´s isolated digits are improved by 19.5% and 13% respectively.
  • Keywords
    channel bank filters; speaker recognition; speech synthesis; transforms; Mel-frequency cepstrum; baseline vowel recognition; bilinear transform; frequency warping; speech recognition; vocal tract length normalization; Acoustics; Filter bank; Frequency estimation; Loudspeakers; Low pass filters; Mel frequency cepstral coefficient; Piecewise linear techniques; Speech processing; Speech recognition; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing, 2004. Proceedings. ICSP '04. 2004 7th International Conference on
  • Print_ISBN
    0-7803-8406-7
  • Type

    conf

  • DOI
    10.1109/ICOSP.2004.1452757
  • Filename
    1452757