Title :
Peripheral features for HMM-based speech recognition
Author :
Fukuda, Takashi ; Takigawa, Masashi ; Nitta, Tsuneo
Author_Institution :
Graduate School of Eng., Toyohashi Univ. of Technol., Japan
Abstract :
This paper describes an attempt to extract peripheral features of a point c(ti,qj) on a time-quefrency (TQ) pattern by observing n×n neighborhoods of the point, and then to incorporate these peripheral features into the MFCC-based feature extractor of a speech recognition system as a replacement to dynamic features. In the design of the feature extractor, firstly, the orthogonal bases extracted directly from speech data by using the Karhunen-Loeve transform (KLT) of 7×3 blocks on a TQ pattern are adopted as the peripheral features, then, the upper two primal bases are selected and simplified in the form of ▵t-operator and ▵q-operator. The proposed feature-set of MFCC and peripheral features shows significant improvements in comparison with the standard feature-set of MFCC and dynamic features in experiments with an HMM-based automatic speech recognition (ASR) system. The reason for the increased performance is discussed in terms of minimal-pair tests
Keywords :
Karhunen-Loeve transforms; feature extraction; hidden Markov models; speech recognition; HMM-based speech recognition; Karhunen-Loeve transform; MFCC-based feature extractor; automatic speech recognition system; dynamic features; minimal-pair tests; orthogonal bases extraction; peripheral features extraction; speech data; time-quefrency pattern; Acoustic testing; Automatic speech recognition; Cepstrum; Data mining; Feature extraction; Hidden Markov models; Karhunen-Loeve transforms; Mel frequency cepstral coefficient; Spatial databases; Speech recognition;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940784