Title :
Speaker adaptation based on spectral normalization and dynamic HMM parameter adaptation
Author_Institution :
GTE Labs. Inc., Waltham, MA, USA
Abstract :
Speaker adaptation has received a considerable amount of attention in recent years. Most of the previous work focused on techniques which require a certain amount of speech to be collected from the target speaker. This paper presents two speaker adaptation methods, including a feature normalization and a HMM parameter adaptation, developed to improve a speaker-independent HMM-based speech recognition system. The proposed adaptation algorithms are text-independent and do not require target speech collection. By applying the feature normalization, the target speech is normalized to reduce the acoustic inter-speaker and environmental variability. By applying the HMM parameter adaptation, the recognition system parameters are dynamically modified to model the target speech. We carried out recognition experiments to assess the performance, using two different speaker-independent recognizers as the baseline systems: a continuous digit recognizer and a keyword recognition system. The results show that when both adaptation techniques are combined, the word error of the digit recognizer using the TI Connected Digit corpus is reduced by about 30% and the detection error of a keyword recognition system using the Road Rally corpora is reduced by about 40%
Keywords :
acoustic signal processing; adaptive signal processing; hidden Markov models; parameter estimation; spectral analysis; speech processing; speech recognition; HMM-based speech recognition; Road Rally corpora; TI Connected Digit corpus; acoustic inter-speaker variability reduction; continuous digit recognizer; detection error; dynamic HMM parameter adaptation; environmental variability reduction; feature normalization; keyword recognition system; performance; recognition experiments; recognition system parameters; speaker adaptation; speaker-independent speech recognition system; spectral normalization; target speech; text-independent algorithms; word error; Character recognition; Clustering algorithms; Decoding; Degradation; Error analysis; Hidden Markov models; Laboratories; Loudspeakers; Pattern recognition; Speech recognition; Target recognition; Vectors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1995. ICASSP-95., 1995 International Conference on
Conference_Location :
Detroit, MI
Print_ISBN :
0-7803-2431-5
DOI :
10.1109/ICASSP.1995.479791