• DocumentCode
    2665014
  • Title

    Multi-layer structure MLLR adaptation algorithm based on subspace regression classes

  • Author

    Mu, Xiangyu ; Zhang, Shuwu ; Xu, Bo

  • Author_Institution
    Nat. Lab. of Pattern Recognition, Chinese Acad. of Sci., Beijing, China
  • fYear
    2003
  • fDate
    26-29 Oct. 2003
  • Firstpage
    345
  • Lastpage
    350
  • Abstract
    In many adaptation algorithms were proposed in the last decade, most notable MAP estimation and MLLR transformation. When the amount of adaptation data is limited, adaptation can be done by grouping similar Gaussians together to form regression classes and then transforming the Gaussians in groups. We propose a rapid MLLR adaptation algorithm with multiply layer structure, which is called SRCMLR. The method groups the Gaussians at a finer acoustic subspace level, which is constructed on the target driven. It generates the regression class dynamically for each subspace, basing on the outcome of the former MLLR transformation. Because of the new algorithm´s special transformation structure and cluster space, there are fewer parameters to estimate for the subsequent MLLR transformation matrix, so computation load in performing transformation is much reduced. Experiments show that the use of SRCMLLR is more effective than other methods when the adaptation data is scare.
  • Keywords
    Gaussian distribution; maximum likelihood estimation; regression analysis; sparse matrices; speaker recognition; Gaussian groups; MAP estimation; MLLR transformation matrix; maximum a posterior estimation; maximum likelihood linear regression; regression classes; speaker recognition; Automation; Gaussian distribution; Gaussian processes; Hidden Markov models; Laboratories; Loudspeakers; Maximum likelihood linear regression; Parameter estimation; Pattern recognition; Technological innovation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
  • Conference_Location
    Beijing, China
  • Print_ISBN
    0-7803-7902-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2003.1275929
  • Filename
    1275929