DocumentCode
2875203
Title
Selective EM training of acoustic models based on sufficient statistics of single utterances
Author
Cincarek, Tobias ; Toda, Tomoki ; Saruwatari, Hiroshi ; Shikano, Kiyohiro
Author_Institution
Graduate Sch. of Inf. Sci., Nara Inst. of Sci. & Technol.
fYear
2005
fDate
27-27 Nov. 2005
Firstpage
168
Lastpage
173
Abstract
In this paper, a new algorithm for selective training of acoustic models is proposed. The algorithm is formulated for an HMM-based model with Gaussian mixture densities, but works in principle for any statistical model, which has sufficient statistics. Since there are too many possibilities for selecting a data subset from a larger database, a heuristic has to be employed. The algorithm is based on deleting single utterances from a data pool temporarily or alternating between successive deletion or addition of utterances. The optimization criterion is the likelihood of the new model parameters given some development data, which can be calculated in a short amount of time based on sufficient statistics. The method is applied to automatically obtain task-dependent acoustic models for infant and elderly speech by selecting utterances from a data pool which are acoustically close to the development data. The proposed method is computationally practical and also addresses the issue of reducing the high costs evolving from the development of applications which make use of speech recognition technology
Keywords
Gaussian processes; acoustic signal processing; expectation-maximisation algorithm; hidden Markov models; speech recognition; speech synthesis; EM training; Gaussian mixture densities; HMM; elderly speech; infant speech; single utterances; speech recognition; task-dependent acoustic models; Acoustic applications; Costs; Databases; Hidden Markov models; Information science; Senior citizens; Speech processing; Speech recognition; Statistics; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location
San Juan
Print_ISBN
0-7803-9478-X
Electronic_ISBN
0-7803-9479-8
Type
conf
DOI
10.1109/ASRU.2005.1566486
Filename
1566486
Link To Document