DocumentCode :
290041
Title :
Adaptation to new microphones using tied-mixture normalization
Author :
Anastasakos, Anastasios ; Kubala, Francis ; Makhoul, John ; Schwartz, Richard
Author_Institution :
Northeastern Univ., Boston, MA, USA
Volume :
i
fYear :
1994
fDate :
19-22 Apr 1994
Abstract :
In this paper, we present several approaches designed to increase the robustness of BYBLOS, the BBN continuous speech, hidden Markov model (HMM) recognition system. We address the problem of increased degradation in performance when there is mismatch in the characteristics of the training and the test microphones. First we compare RASTA processing and cepstrum mean subtraction as preprocessing methods, to compensate for unknown channel transfer function effects, when we have no information about the new microphone. Then we introduce a new algorithm that computes a probabilistic transformation from the training microphone codebook to that of a new microphone, given some information about the new microphone. We test this algorithm in supervised mode and, combined with a microphone selection method, in unsupervised mode. We present experimental results which show that the proposed algorithm combined with cepstrum mean subtraction, improves the recognition accuracy when the system is tested on a microphone with different characteristics than the one on which it was trained
Keywords :
cepstral analysis; hidden Markov models; microphones; probability; speech recognition; transfer functions; BBN continuous speech; BYBLOS; RASTA processing; algorithm; cepstrum mean subtraction; channel transfer function effects; degradation; hidden Markov model; microphone selection method; performance; preprocessing methods; probabilistic transformation; recognition accuracy; robustness; speech recognition system; supervised mode; test microphones; tied-mixture normalization; training; training microphone codebook; unsupervised mode; Additive noise; Automatic speech recognition; Cepstrum; Degradation; Hidden Markov models; Microphones; Robustness; Speech recognition; System testing; Transfer functions;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on
Conference_Location :
Adelaide, SA
ISSN :
1520-6149
Print_ISBN :
0-7803-1775-0
Type :
conf
DOI :
10.1109/ICASSP.1994.389263
Filename :
389263
Link To Document :
بازگشت