Title :
Investigations on inter-speaker variability in the feature space
Author_Institution :
Philips Res. Lab., Aachen, Germany
Abstract :
We apply Fisher variate analysis to measure the effectiveness of speaker normalization techniques. A trace criterion, which measures the ratio of the variations due to different phonemes compared to variations due to different speakers, serves as a first assessment of a feature set without the need for recognition experiments. By using this measure and by recognition experiments we demonstrate that cepstral mean normalization also has a speaker normalization effect, in addition to the well-known channel normalization effect. Similarly vocal tract normalization (VTN) is shown to remove inter-speaker variability. For VTN we show that normalization on a per sentence basis performs better than normalization on a per speaker basis. Recognition results are given on Wall Street Journal and Hub-4 databases
Keywords :
cepstral analysis; feature extraction; speech processing; speech recognition; Fisher variate analysis; Hub-4 database; Wall Street Journal database; cepstral mean normalization; channel normalization; cognition results; feature set; feature space; inter-speaker variability; phonemes; ratio; recognition experiments; sentence; speaker normalization; trace criterion; vocal tract normalization; Acoustic testing; Cepstral analysis; Covariance matrix; Databases; Extraterrestrial measurements; Laboratories; Loudspeakers; Robustness; Speech enhancement; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758146