• DocumentCode
    3423443
  • Title

    A unified interpretation of adaptation approaches based on a macroscopic time evolution system and indirect/direct adaptation approaches

  • Author

    Watanabe, Shinji ; Nakamura, Atsushi

  • Author_Institution
    NTT Commun. Sci. Labs., NTT Corp., Tokyo
  • fYear
    2008
  • fDate
    March 31 2008-April 4 2008
  • Firstpage
    4285
  • Lastpage
    4288
  • Abstract
    Incremental adaptation techniques for speech recognition are aimed at adjusting acoustic models quickly and stably to time-variant acoustic characteristics due to temporal changes of speaker, speaking style, noise source, etc. We proposed a novel incremental adaptation framework based on a macroscopic time evolution system, which models the time-variant characteristics by successively updating posterior distributions of acoustic model parameters. In this paper, we provide a unified interpretation of the proposal and the two major conventional approaches of indirect adaptation via transformation parameters (e.g. maximum likelihood linear regression (MLLR)) and direct adaptation of classifier parameters (e.g. maximum a posteriori (MAP)). We reveal analytically and experimentally that the proposed incremental adaptation involves both the conventional and their combinatorial approaches, and simultaneously possesses their quick and stable adaptation characteristics.
  • Keywords
    acoustic signal processing; maximum likelihood estimation; regression analysis; signal classification; speech recognition; classifier parameters; incremental adaptation techniques; macroscopic time evolution system; maximum a posteriori; maximum likelihood linear regression; posterior distributions; speech recognition; time-variant acoustic characteristics; transformation parameters; Acoustic noise; Acoustic propagation; Adaptation model; Estimation error; Laboratories; Loudspeakers; Maximum likelihood linear regression; Parameter estimation; Speech recognition; Stability; acoustic model; incremental adaptation; indirect/direct adaptation; macroscopic time evolution; speech recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
  • Conference_Location
    Las Vegas, NV
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-1483-3
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2008.4518602
  • Filename
    4518602