Title :
Robust language identification using Power Normalized Cepstral Coefficients
Author :
Arup Kumar Dutta;K. Sreenivasa Rao
Author_Institution :
School of Information Technology, Indian Institute of Technology Kharagpur, India - 721 302
Abstract :
The present work investigates the robustness of Power Normalized Cepstral Coefficients (PNCC) for Language identification (LID) from noisy speech. Though the state of the art vocal tract features like mel frequency cepstral coefficients (MFCC) give good recognition accuracy in clean environments, the performance degrades drastically when the signal to noise ratio decreases. In this work, experiments have been carried out on IITKGP-MLILSC speech database. Gaussian mixture model (GMM) is used to building the language models. We have used NOISEX-92 database to add synthetic noise at different SNR levels. We have also compared the recognition accuracy of two systems, one developed using MFCCs and and the other using PNCCs. Finally, we have shown that PNCC features are more robust to noise.
Keywords :
"Signal to noise ratio","Robustness"
Conference_Titel :
Contemporary Computing (IC3), 2015 Eighth International Conference on
Print_ISBN :
978-1-4673-7947-2
DOI :
10.1109/IC3.2015.7346688