DocumentCode
104071
Title
Improved Bases Selection in Acoustic Model Interpolation for Fast On-Line Adaptation
Author
Shahnawazuddin, S. ; Sinha, Roopak
Author_Institution
Dept. of Electron. & Electr. Eng., Indian Inst. of Technol., Guwahati, Guwahati, India
Volume
21
Issue
4
fYear
2014
fDate
Apr-14
Firstpage
493
Lastpage
497
Abstract
This work presents a novel bases selection approach for acoustic model interpolation based fast on-line adaptation. The proposed approach employs a correlation based similarity measure in the supervector domain (derived by concatenating the Gaussian mean parameters of the adapted models) for the selection of bases. This approach is found to greatly reduce the computational complexity in comparison to the Viterbi-alignment based bases search. Moreover, the proposed approach employs joint representation along with orthogonalization for the dynamic selection of bases. Consequently, the selected bases result in a much balanced coverage of phonetic contexts in the synthesized adapted model. The proposed technique is found to result in improved performance for all three modes of adaptation viz. the utterance-specific, the incremental and the batch modes. For utterance-specific mode, it achieves a relative improvement of 10.2% over baseline with only 3 to 5 seconds of adaptation data.
Keywords
interpolation; speaker recognition; speech processing; Gaussian mean parameters; acoustic model interpolation; fast on-line adaptation; improved bases selection; supervector domain; utterance-specific mode; Acoustics; Adaptation models; Computational modeling; Context; Data models; Dictionaries; Interpolation; Acoustic model interpolation; fast adaptation; joint representation; on-line adaptation;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2014.2306451
Filename
6740837
Link To Document