Title :
Modified splice and its extension to non-stereo data for noise robust speech recognition
Author :
Kumar, D. S. Pavan ; Prasad, N. Vishnu ; Joshi, Vinayak ; Umesh, S.
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Madras, Chennai, India
Abstract :
In this paper, a modification to the training process of the popular SPLICE algorithm has been proposed for noise robust speech recognition. The modification is based on feature correlations, and enables this stereo-based algorithm to improve the performance in all noise conditions, especially in unseen cases. Further, the modified framework is extended to work for non-stereo datasets where clean and noisy training utterances, but not stereo counterparts, are required. Finally, an MLLR-based computationally efficient run-time noise adaptation method in SPLICE framework has been proposed. The modified SPLICE shows 8.6% absolute improvement over SPLICE in Test C of Aurora-2 database, and 2.93% overall. Non-stereo method shows 10.37% and 6.93% absolute improvements over Aurora-2 and Aurora-4 baseline models respectively. Run-time adaptation shows 9.89% absolute improvement in modified framework as compared to SPLICE for Test C, and 4.96% overall w.r.t. standard MLLR adaptation on HMMs.
Keywords :
signal denoising; speech recognition; Aurora-2 database; MLLR-based run-time noise adaptation method; SPLICE algorithm; Test C; absolute improvement; clean training utterances; feature correlations; noise conditions; noise robust speech recognition; noisy training utterances; nonstereo data; performance improvement; run-time adaptation; stereo-based algorithm; stereo-based piece-wise linear compensation for environments; training process; Correlation; Hidden Markov models; Noise measurement; Signal to noise ratio; Training; Vectors; MFCC; Robust speech recognition; SPLICE; feature normalisation; stereo data;
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2013 IEEE Workshop on
Conference_Location :
Olomouc
DOI :
10.1109/ASRU.2013.6707725