DocumentCode :
454738
Title :
Multi-Parameter Frequency Warping for Vtln by Gradient Search
Author :
Panchapagesan, Sankaran ; Alwan, Abeer
Author_Institution :
Dept. of Electr. Eng., California Univ., Los Angeles, CA
Volume :
1
fYear :
2006
fDate :
14-19 May 2006
Abstract :
The current method for estimating frequency warping (FW) functions for vocal tract length normalization (VTLN) is by maximizing the ASR likelihood score by an exhaustive search over a grid of FW parameters. Exhaustive search is inefficient when estimating multi-parameter FWs, which have been shown to give improvements in recognition accuracy over single parameter FWs (J.W. McDonough, 2000). Here we develop a gradient search algorithm to obtain the optimal FW parameters for MFCC features, since previous work focussed on PLP cepstral features (J.W. McDonough, 2000). The novel calculation involved was that of the gradient of the Mel filterbank with respect to the FW parameters. Even for a single parameter, the gradient search method was more efficient than grid search by a factor of around 1.6 on the average for male children speakers tested on models trained from adult males. When used to estimate multi-parameter sine-log allpass transform (SLAPT, (J.W. McDonough, 2000)) FWs for VTLN, more than 50% reduction in word error rate was obtained with five parameter SLAPT compared to single-parameter piecewise linear FW
Keywords :
channel bank filters; gradient methods; search problems; speech processing; transforms; MFCC features; Mel filterbank; gradient search algorithm; multiparameter frequency warping; sine-log allpass transform; vocal tract length normalization; Automatic speech recognition; Cepstral analysis; Collision mitigation; Filter bank; Frequency estimation; Mel frequency cepstral coefficient; Parameter estimation; Piecewise linear techniques; Search methods; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on
Conference_Location :
Toulouse
ISSN :
1520-6149
Print_ISBN :
1-4244-0469-X
Type :
conf
DOI :
10.1109/ICASSP.2006.1660237
Filename :
1660237
Link To Document :
بازگشت