• DocumentCode
    1749669
  • Title

    Hypothesis-driven adaptation (Hydra): a flexible eigenvoice architecture

  • Author

    Peters, S. Douglas

  • Author_Institution
    Nuance Commun., Montreal, Que., Canada
  • Volume
    1
  • fYear
    2001
  • fDate
    2001
  • Firstpage
    349
  • Abstract
    In this article, a new architecture for speech recognition is introduced. As with many existing speech systems, this new approach involves multi-pass processing. In the present case, however, second-pass models are constructed on-line for each active hypothesis. Models for each hypothesized segment of the current utterance are constructed from linear combinations of "data cluster models" that have been trained on low-variability clusters of the training corpus. The data cluster weights are determined using an "eigenvoice" mechanism that is operative on low-complexity, low definition models. Once determined, the same weights are used to construct high-complexity, high-definition second-pass models generated over the same data clusters. Results from a simple recognition task are reported to demonstrate the interesting properties of the new architecture. The limitations, trade-offs and some possible extensions of the proposed approach are discussed
  • Keywords
    eigenvalues and eigenfunctions; pattern clustering; speech recognition; Hydra; active hypothesis; data cluster models; data cluster weights; flexible eigenvoice architecture; high-definition models; hypothesis-driven adaptation; hypothesized segment; low definition model; low-variability clusters; multi-pass processing; second-pass models; speech recognition; Adaptation model; Automatic speech recognition; Gaussian processes; Humans; Maximum likelihood estimation; Maximum likelihood linear regression; Probability density function; Speech processing; Speech recognition; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
  • Conference_Location
    Salt Lake City, UT
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7041-4
  • Type

    conf

  • DOI
    10.1109/ICASSP.2001.940839
  • Filename
    940839