DocumentCode :
1146772
Title :
Optimizing feature extraction for speech recognition
Author :
Lee, Chulhee ; Hyun, Donghoon ; Choi, Euisun ; Go, Jinwook ; Lee, Chungyong
Author_Institution :
Dept. of Electr. & Electron. Eng., Yonsei Univ., Seoul, South Korea
Volume :
11
Issue :
1
fYear :
2003
fDate :
1/1/2003 12:00:00 AM
Firstpage :
80
Lastpage :
87
Abstract :
We propose a method to minimize the loss of information during the feature extraction stage in speech recognition by optimizing the parameters of the mel-cepstrum transformation, a transform which is widely used in speech recognition. Typically, the mel-cepstrum is obtained by critical band filters whose characteristics play an important role in converting a speech signal into a sequence of vectors. First, we analyze the performance of the mel-cepstrum by changing the parameters of the filters such as shape, center frequency, and bandwidth. Then we propose an algorithm to optimize the parameters of the filters using the simplex method. Experiments with Korean digit words show that the recognition rate improved by about 4-7%.
Keywords :
cepstral analysis; feature extraction; filtering theory; optimisation; speech recognition; Korean digit words; bandwidth; center frequency; critical band filters; feature extraction optimization; filter shape; mel-cepstrum transformation; parameters optimization; recognition rate; simplex method; speech recognition; speech signal; vectors sequence; Bandwidth; Data mining; Feature extraction; Filters; Frequency; Hidden Markov models; Humans; Optimization methods; Performance analysis; Speech recognition;
fLanguage :
English
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1063-6676
Type :
jour
DOI :
10.1109/TSA.2002.805644
Filename :
1179382
Link To Document :
بازگشت