DocumentCode :
302069
Title :
Feature parameter curve method for high performance NN-based speech recognition
Author :
Zhu, Shun ; Chen, Dao Wen ; Huang, Tai Yi
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Univ., Singapore
Volume :
1
fYear :
1996
fDate :
7-10 May 1996
Firstpage :
1
Abstract :
This paper proposes a novel high-performance NN-based isolated word speech recognition method named feature parameter curve method (FPCM), which contains three sub-methods, i.e., the feature frame parameter method, the broken line fitting method and the representing-point sequence (RPS) method. Using feature parameter curve to represent the contour of LPC-based cepstral coefficient sequence, our new methods solve the time alignment problem, greatly lighten the burden on neural network and thus reducing the complexity of the neural network. Compared with other NN-based methods, our methods have many strong points such as much faster training and recognition, simpler structure of neural nets, and is more robust. A recognition system of all Mandarin syllables has been established based on this method and the comparative experiments confirmed the good characteristics stated above. The idea of reconstructing original speech signals is also first shown in speech recognition
Keywords :
cepstral analysis; feature extraction; learning (artificial intelligence); linear predictive coding; natural languages; neural nets; parameter estimation; speech coding; speech recognition; LPC-based cepstral coefficient sequence; Mandarin syllables recognition; broken line fitting method; experiments; feature parameter curve method; isolated word speech recognition; neural network; representing-point sequence method; speech signal reconstruction; time alignment problem solution; training; Automation; Cepstral analysis; Character recognition; Curve fitting; Databases; Isolation technology; Neural networks; Pattern recognition; Robustness; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location :
Atlanta, GA
ISSN :
1520-6149
Print_ISBN :
0-7803-3192-3
Type :
conf
DOI :
10.1109/ICASSP.1996.540275
Filename :
540275
Link To Document :
بازگشت