Selective EM training of acoustic models based on sufficient statistics of single utterances

Author

Cincarek, Tobias ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro

Author_Institution

Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol.

fYear

2005

fDate

27-27 Nov. 2005

Firstpage

168

Lastpage

173

Abstract

In this paper, a new algorithm for selective training of acoustic models is proposed. The algorithm is formulated for an HMM-based model with Gaussian mixture densities, but works in principle for any statistical model, which has sufficient statistics. Since there are too many possibilities for selecting a data subset from a larger database, a heuristic has to be employed. The algorithm is based on deleting single utterances from a data pool temporarily or alternating between successive deletion or addition of utterances. The optimization criterion is the likelihood of the new model parameters given some development data, which can be calculated in a short amount of time based on sufficient statistics. The method is applied to automatically obtain task-dependent acoustic models for infant and elderly speech by selecting utterances from a data pool which are acoustically close to the development data. The proposed method is computationally practical and also addresses the issue of reducing the high costs evolving from the development of applications which make use of speech recognition technology

Keywords

Gaussian processes; acoustic signal processing; expectation-maximisation algorithm; hidden Markov models; speech recognition; speech synthesis; EM training; Gaussian mixture densities; HMM; elderly speech; infant speech; single utterances; speech recognition; task-dependent acoustic models; Acoustic applications; Costs; Databases; Hidden Markov models; Information science; Senior citizens; Speech processing; Speech recognition; Statistics; Training data;

fLanguage

English

Publisher

ieee

Conference_Titel

Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on

Conference_Location

San Juan

Print_ISBN

0-7803-9478-X

Electronic_ISBN

0-7803-9479-8

Type

conf

DOI

10.1109/ASRU.2005.1566486

Filename

1566486