Title :
Combined PNCC feature extractor for robust speech recognition
Author :
Xiaoyu Liu ; Zahorian, Stephen A.
Author_Institution :
Dept. of Electr. & Comput. Eng., Binghamton Univ., Binghamton, NY, USA
Abstract :
Recently, two major types of Power-Normalized Cepstral Coefficients (PNCCs) were proposed as noise robust Automatic Speech Recognition (ASR) front-end. All the literatures for these two PNCCs assume clean training data and clean or noisy test data. However, we find that one PNCC method has good performance for the clean training/noisy test scenario, but degrades when test data is cleaner than the training data. The other PNCC method performs relatively better for noisy training/clean test conditions, but is not very robust for the clean training/noisy test conditions. We propose Combined PNCC (C-PNCC) algorithm, which is superior to both previous PNCCs for clean training/noisy test cases, and which also has reasonably good performance for noisy training/clean test conditions.
Keywords :
feature extraction; filtering theory; speech recognition; ASR; C-PNCC algorithm; PNCC method; clean training; combined PNCC feature extractor; noisy test conditions; power-normalized cepstral coefficients; pre-emphasis filter; robust automatic speech recognition; Filter banks; Mel frequency cepstral coefficient; Noise; Noise measurement; Speech; Testing; Training; C-PNCC; G-PNCC; L-PNCC; front-end; noise reduction;
Conference_Titel :
Signal and Information Processing (ChinaSIP), 2014 IEEE China Summit & International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-1-4799-5401-8
DOI :
10.1109/ChinaSIP.2014.6889206