DocumentCode :
323751
Title :
A novel robust feature of speech signal based on the Mellin transform for speaker-independent speech recognition
Author :
Chen, Jingdong ; Xu, Bo ; Huang, Taiyi
Author_Institution :
Inst. of Autom., Acad. Sinica, Beijing, China
Volume :
2
fYear :
1998
fDate :
12-15 May 1998
Firstpage :
629
Abstract :
This paper presents a novel kind of speech feature which is the modified Mellin transform of the log-spectrum of the speech signal (short for MMTLS). Because of the scale invariance property of the modified Mellin transform, the new feature is insensitive to the variation of the vocal tract length among individual speakers, and thus it is more appropriate for speaker-independent speech recognition than the popular used cepstrum. The preliminary experiments show that the performance of the MMTLS-based method is much better in comparison with those of the LPC- and MFC-based methods. Moreover, the error rate of this method is very consistent for different outlier speakers
Keywords :
acoustic signal processing; feature extraction; spectral analysis; speech recognition; transforms; LPC-based method; MFC-based method; MMTLS-based method; acoustic feature; experiments; log-spectrum; modified Mellin transform; outlier speakers; performance; robust feature; speaker-independent speech recognition; speech signal; vocal tract length; Acoustic noise; Fourier transforms; Frequency estimation; Laboratories; Loudspeakers; Pattern recognition; Robustness; Shape; Speech analysis; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
ISSN :
1520-6149
Print_ISBN :
0-7803-4428-6
Type :
conf
DOI :
10.1109/ICASSP.1998.675343
Filename :
675343
Link To Document :
بازگشت