• DocumentCode
    394310
  • Title

    Speaker adaptation by hierarchical EigenVoice

  • Author

    Onishi, Yoshijiumi ; Iso, Ken-ichi

  • Author_Institution
    Multimedia Res. Labs., NEC Corp., Kawasaki, Japan
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    We propose a novel speaker adaptation method, hierarchical EigenVoice (HEV). This method extends the eigenvoice through clustering the Gaussian components of HMMs into a hierarchical tree structure. It enables one to autonomously control a number of adaptation parameters (model complexity) depending on the amount of adaptation utterances from a new speaker. The experimental results of Japanese large vocabulary continuous speech recognition confirmed the significant performance increase in all range of the adaptation utterance amounts compared with the conventional speaker adaptation methods.
  • Keywords
    Gaussian processes; eigenvalues and eigenfunctions; hidden Markov models; pattern clustering; speech recognition; Gaussian components clustering; Japan; adaptation parameters; adaptation utterance; continuous density mixture Gaussian HMM; hidden Markov models; hierarchical EigenVoice; hierarchical tree structure; large vocabulary continuous speech recognition; model complexity; speaker adaptation; speaker adaptation method; Adaptation model; Databases; Hidden Markov models; Hybrid electric vehicles; Laboratories; Loudspeakers; Maximum likelihood linear regression; Principal component analysis; Speech recognition; Tree data structures;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198846
  • Filename
    1198846