• DocumentCode
    352328
  • Title

    On-line incremental speaker adaptation with automatic speaker change detection

  • Author

    Zhang, Zhi-Peng ; Furui, Sadaoki ; Ohtsuki, Katsutoshi

  • Author_Institution
    Dept. of Comput. Sci., Tokyo Inst. of Technol., Japan
  • Volume
    2
  • fYear
    2000
  • fDate
    2000
  • Abstract
    In order to improve the performance of speech recognition systems when speakers change frequently and each of them utters a series of several sentences, a new unsupervised, online and incremental speaker adaptation technique combined with automatic detection of speaker changes is proposed. The speaker change is detected by comparing likelihoods using speaker-independent and speaker-adaptive Gaussian mixture models (GMMs). Both the phone HMM and GMM are adapted by MLLR transformation. In a broadcast news transcription task, this method reduces the word error rate by 10.0%. In comparison with the conventional method that uses HMMs for the speaker change detection, the GMM-based method requires a significantly less number of computations at the cost of only a slightly lower word recognition rate
  • Keywords
    Gaussian processes; maximum likelihood estimation; speech recognition; GMM; HMM; MLLR transformation; automatic speaker change detection; broadcast news transcription; incremental speaker adaptation; likelihoods; maximum likelihood linear regression; on-line incremental speaker adaptation; online speaker adaptation; performance; speaker-adaptive Gaussian mixture models; speaker-independent and speaker-adaptive Gaussian mixture models; speech recognition systems; unsupervised speaker adaptation; word error rate; word recognition rate; Cepstral analysis; Computer science; Context modeling; Hidden Markov models; Loudspeakers; Natural languages; Space technology; Speech recognition; TV broadcasting; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2000. ICASSP '00. Proceedings. 2000 IEEE International Conference on
  • Conference_Location
    Istanbul
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-6293-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2000.859121
  • Filename
    859121