• DocumentCode
    989904
  • Title

    Robust Speaker Adaptation by Weighted Model Averaging Based on the Minimum Description Length Criterion

  • Author

    Cui, Xiaodong ; Alwan, Abeer

  • Author_Institution
    Dept. of Electr. Eng., California Univ., Los Angeles, CA
  • Volume
    15
  • Issue
    2
  • fYear
    2007
  • Firstpage
    652
  • Lastpage
    660
  • Abstract
    The maximum likelihood linear regression (MLLR) technique is widely used in speaker adaptation due to its effectiveness and computational advantages. When the adaptation data are sparse, MLLR performance degrades because of unreliable parameter estimation. In this paper, a robust MLLR speaker adaptation approach via weighted model averaging is investigated. A variety of transformation structures is first chosen and a general form of maximum likelihood (ML) estimation of the structures is given. The minimum description length (MDL) principle is applied to account for the compromise between transformation granularity and descriptive ability regarding the tying patterns of structured transformations with a regression tree. Weighted model averaging across the candidate structures is then performed based on the normalized MDL scores. Experimental results show that this kind of model averaging in combination with regression tree tying gives robust and consistent performance across various amounts of adaptation data
  • Keywords
    maximum likelihood estimation; regression analysis; speech recognition; trees (mathematics); maximum likelihood linear regression; minimum description length criterion; regression tree; robust speaker adaptation; weighted model averaging; Adaptation model; Degradation; Linear regression; Loudspeakers; Maximum likelihood estimation; Maximum likelihood linear regression; Parameter estimation; Regression tree analysis; Robustness; Speech recognition; Maximum likelihood linear regression (MLLR); minimum description length (MDL); model averaging; speaker adaptation;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.876773
  • Filename
    4067027