DocumentCode :
74573
Title :
Robust speech recognition using harmonic features
Author :
Yeh Huann Goh ; Raveendran, Paramesaran ; Jamuar, Sudhanshu S.
Author_Institution :
Dept. of Electr. Eng., Univ. of Malaya, Kuala Lumpur, Malaysia
Volume :
8
Issue :
2
fYear :
2014
fDate :
Apr-14
Firstpage :
167
Lastpage :
175
Abstract :
In this study, the authors propose a speech recognition system using harmonic structure related information to detect harmonic features in noisy environment. The proposed algorithm first extracts the harmonic components contained inside the speech signals using sine function convolution. By setting the frequency of the sine function as equal to the fundamental frequency of speech signals, harmonic components can be extracted out. The reconstructed signal obtained by summing up the extracted harmonic components is found to have a high degree of correlation with the original signal. The extracted frame energy measure of the harmonic components has been further processed to become dynamic harmonic features and then used together with the European Telecommunications Standards Institute (ETSI) front-end processed mel-frequency cepstral coefficients (MFCC) feature or the perceptual linear prediction (PLP) feature in the speech recognition system. The proposed enhanced speech recognition system shows a better recognition rate over the ETSI front-end processed MFCC (or PLP)-based speech recognition system.
Keywords :
signal reconstruction; speech recognition; ETSI front-end processed MFCC; European Telecommunications Standards Institute; PLP-based speech recognition system; dynamic harmonic features; extracted frame energy; extracted harmonic components; front-end processed mel-frequency cepstral coefflcients; harmonic components; harmonic features; harmonic structure related information; high degree of correlation; noisy environment; perceptual linear prediction; reconstructed signal; robust speech recognition; sine function convolution; speech recognition system; speech signal frequency; speech signals;
fLanguage :
English
Journal_Title :
Signal Processing, IET
Publisher :
iet
ISSN :
1751-9675
Type :
jour
DOI :
10.1049/iet-spr.2013.0094
Filename :
6786928
Link To Document :
بازگشت