Aspect-model-based reference speaker weighting

Author

Hahm, Seongjun ; Ohkawa, Yuichi ; Ito, Masashi ; Suzuki, Motoyuki ; Ito, Akinori ; Makino, Shozo

Author_Institution

Grad. Sch. of Eng., Tohoku Univ., Sendai, Japan

fYear

2010

fDate

14-19 March 2010

Firstpage

4302

Lastpage

4305

Abstract

We propose an aspect-model-based reference speaker weighting. The main idea of the approach is that the adapted model is a linear combination of a set of reference speakers like reference speaker weighting (RSW) and eigenvoices. The aspect model is the mixture model of speaker-dependent (SD) models. In this paper, aspect model weighting (AMW) is proposed for finding an optimal weighting of a set of reference speakers unlike RSW and the aspect model which is a kind of cluster models is trained based on likelihood maximization with respect to the training data. The number of adaptation parameters can also be reduced using aspect model approach. For evaluation, we carried out an isolated word recognition experiment on Korean database (KLE452). The results were compared to those of conventional MAP, MLLR, RSW, and eigenvoice. Even though we use only 0.5s of adaptation data, 27.24% relative error rate reduction in comparison with speaker-independent (SI) baseline performance was achieved.

Keywords

eigenvalues and eigenfunctions; maximum likelihood estimation; pattern clustering; speaker recognition; text analysis; Korean database; aspect-model-based reference speaker weighting; cluster models; eigenvoice; likelihood maximization; reference speaker set; speaker-dependent model; speech recognition; word recognition; Bayesian methods; Databases; Educational technology; Error analysis; Hidden Markov models; Informatics; Maximum likelihood linear regression; Speech recognition; Training data; Vectors; Aspect Model Weighting; Reference Speaker Weighting; Speaker Adaptation; Speech Recognition;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location

Dallas, TX

ISSN

1520-6149

Print_ISBN

978-1-4244-4295-9

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2010.5495672

Filename

5495672