Title :
Generative modeling of pseudo-target domain adaptation samples for whispered speech recognition
Author :
Ghaffarzadegan, Shabnam ; Boril, Hynek ; Hansen, John H. L.
Author_Institution :
Center for Robust Speech Syst. (CRSS), Univ. of Texas at Dallas, Richardson, TX, USA
Abstract :
The lack of available large corpora of transcribed whispered speech is one of the major roadblocks for development of successful whisper recognition engines. Our recent study has introduced a Vector Taylor Series (VTS) approach to pseudo-whisper sample generation which requires availability of only a small number of real whispered utterances to produce large amounts of whisper-like samples from easily accessible transcribed neutral recordings. The pseudo-whisper samples were found particularly effective in adapting a neutral-trained recognizer to whisper. Our current study explores the use of denoising autoencoders (DAE) for pseudo-whisper sample generation. Two types of generative models are investigated - one which produces pseudo-whispered cepstral vectors on a frame basis and another which generates pseudo-whisper statistics of whole phone segments. It is shown that the DAE approach considerably reduces word error rates of the baseline system as well as the system adapted on real whisper samples. The DAE approach provides competitive results to the VTS-based method while cutting its computational overhead nearly in half.
Keywords :
error statistics; signal denoising; speech recognition; DAE approach; VTS approach; computational overhead; denoising autoencoder; neutral-trained recognizer; phone segments; pseudo-target domain adaptation sample generative model; pseudo-whisper sample generation; pseudo-whisper statistics; vector Taylor series approach; whispered speech recognition engine; whispered utterance; word error rate reduction; Indexes; Mel frequency cepstral coefficient; Speech; Vector Taylor Series; denoising autoencoders; generative models; whispered speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178927