• DocumentCode
    3484699
  • Title

    Speaker adaptation based on speaker-dependent eigenphone estimation

  • Author

    Zhang, Wen-Lin ; Zhang, Wei-Qiang ; Li, Bi-Cheng

  • Author_Institution
    Dept. of Inf. Sci., Zhengzhou Inf. Sci. & Technol. Inst., Zhengzhou, China
  • fYear
    2011
  • fDate
    11-15 Dec. 2011
  • Firstpage
    48
  • Lastpage
    52
  • Abstract
    Based on speaker dependent eigenphone estimation, a novel speaker adaptation technique is proposed in this paper. Different from conventional speaker adaptation approaches, the proposed method explicitly models the phone variations for each speaker through subspace modeling in the phone space. The phone coordinate, which is shared by all speakers, contains correlation information between different phones. During speaker adaptation, two schemes for estimation of the new speaker specific phone variation bases (namely eigenphones) are derived under maximum likelihood (ML) criterion and maximum a posteriori (MAP) criterion respectively. Supervised speaker adaptation experiments on a Mandarin Chinese continuous speech recognition task show that the new method outperforms both eigenvoice and maximum likelihood linear regression (MLLR) methods when sufficient adaptation data is available.
  • Keywords
    eigenvalues and eigenfunctions; maximum likelihood estimation; natural languages; regression analysis; speech recognition; Mandarin Chinese continuous speech recognition; eigenvoice; maximum a posteriori criterion; maximum likelihood criterion; maximum likelihood linear regression; phone coordinate; phone space; phone variations; speaker-dependent eigenphone estimation; subspace modeling; supervised speaker adaptation; Adaptation models; Correlation; Hidden Markov models; Maximum likelihood estimation; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
  • Conference_Location
    Waikoloa, HI
  • Print_ISBN
    978-1-4673-0365-1
  • Electronic_ISBN
    978-1-4673-0366-8
  • Type

    conf

  • DOI
    10.1109/ASRU.2011.6163904
  • Filename
    6163904