DocumentCode
337439
Title
Discriminative spectral-temporal multiresolution features for speech recognition
Author
McMahon, P. ; Harte, N. ; Vaseghi, S. ; McCourt, P.
Author_Institution
Sch. of Electr. Eng., Queen´´s Univ., Belfast, UK
Volume
2
fYear
1999
fDate
15-19 Mar 1999
Firstpage
581
Abstract
Multi-resolution features, which are based on the premise that there may be more cues for phonetic discrimination in a given sub-band than in another, have been shown to outperform the standard MFCC feature set for both classification and recognition tasks on the TIMIT database. This paper presents an investigation into possible strategies to extend these ideas from the spectral domain into both the spectral and temporal domains. Experimental work on the integration of segmental models, which are better at capturing the longer term phonetic correlation of a phonetic unit, into the discriminative multi-resolution framework is presented. Results are presented which show that including this supplementary temporal information offers an improvement performance for the phoneme classification task over the standard multi-resolution MFCC feature set with time derivatives appended. Possible strategies for the extension of theses techniques into the area of continuous speech recognition are discussed
Keywords
correlation methods; feature extraction; hidden Markov models; signal classification; signal resolution; spectral analysis; speech recognition; MFCC feature set; TIMIT database; continuous speech recognition; discriminative spectral-temporal multiresolution features; experiment; performance; phoneme classification; phonetic correlation; phonetic discrimination; phonetic unit; segmental HMM; spectral domain; speech classification; sub-band; temporal domain; temporal information; time derivatives; Auditory system; Cepstral analysis; Electronic mail; Hidden Markov models; Humans; Information filtering; Mel frequency cepstral coefficient; Spatial databases; Speech recognition; Vectors;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.759733
Filename
759733
Link To Document