• DocumentCode
    2800966
  • Title

    Aspect-model-based reference speaker weighting

  • Author

    Hahm, Seongjun ; Ohkawa, Yuichi ; Ito, Masashi ; Suzuki, Motoyuki ; Ito, Akinori ; Makino, Shozo

  • Author_Institution
    Grad. Sch. of Eng., Tohoku Univ., Sendai, Japan
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4302
  • Lastpage
    4305
  • Abstract
    We propose an aspect-model-based reference speaker weighting. The main idea of the approach is that the adapted model is a linear combination of a set of reference speakers like reference speaker weighting (RSW) and eigenvoices. The aspect model is the mixture model of speaker-dependent (SD) models. In this paper, aspect model weighting (AMW) is proposed for finding an optimal weighting of a set of reference speakers unlike RSW and the aspect model which is a kind of cluster models is trained based on likelihood maximization with respect to the training data. The number of adaptation parameters can also be reduced using aspect model approach. For evaluation, we carried out an isolated word recognition experiment on Korean database (KLE452). The results were compared to those of conventional MAP, MLLR, RSW, and eigenvoice. Even though we use only 0.5s of adaptation data, 27.24% relative error rate reduction in comparison with speaker-independent (SI) baseline performance was achieved.
  • Keywords
    eigenvalues and eigenfunctions; maximum likelihood estimation; pattern clustering; speaker recognition; text analysis; Korean database; aspect-model-based reference speaker weighting; cluster models; eigenvoice; likelihood maximization; reference speaker set; speaker-dependent model; speech recognition; word recognition; Bayesian methods; Databases; Educational technology; Error analysis; Hidden Markov models; Informatics; Maximum likelihood linear regression; Speech recognition; Training data; Vectors; Aspect Model Weighting; Reference Speaker Weighting; Speaker Adaptation; Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495672
  • Filename
    5495672