Title :
Detection of spectral transition for speech perception based on time-frequency analysis
Author :
Zhao, Qun ; Gao, Qianli ; Chi, Huisheng
Author_Institution :
Centre for Inf. Sci., Beijing Univ., China
Abstract :
Current speech or speaker recognition systems rely largely on voiced parts of utterance, though a great amount of information for speech perception is contained in the nonstationary consonants and transition. How to model and characterize the dynamic spectral features describing the transition still remains a question. This paper investigates the modeling and detection of the spectral transition based on time-frequency analysis. Linear and nonlinear modeling of the transitions are proposed using linear and quadratic frequency modulation signals. Then two strategies of detection of the spectral transition are presented, i.e., the Radon-Wigner transform (RWT) and Radon-ambiguity transform (RAT). Both simulated and real speech data from the TIMIT database are used to test the detection procedure
Keywords :
Radon transforms; Wigner distribution; frequency modulation; signal representation; speaker recognition; spectral analysis; speech recognition; time-frequency analysis; Radon-Wigner transform; Radon-ambiguity transform; TIMIT database; dynamic spectral features; linear frequency modulation signals; linear modeling; nonlinear modeling; nonstationary consonants; quadratic frequency modulation signals; speaker recognition; spectral transition detection; speech perception; speech recognition; time-frequency analysis; Hidden Markov models; Information science; Linear predictive coding; Nonlinear dynamical systems; Signal analysis; Signal processing; Speaker recognition; Spectral analysis; Speech analysis; Time frequency analysis;
Conference_Titel :
Information, Communications and Signal Processing, 1997. ICICS., Proceedings of 1997 International Conference on
Print_ISBN :
0-7803-3676-3
DOI :
10.1109/ICICS.1997.647153