Title :
Better visualization for formant analysis by means of time-frequency distributions
Author :
Ma, Ning ; Ching, P.C.
Author_Institution :
Dept. of Electron. Eng., Chinese Univ. of Hong Kong, Shatin, Hong Kong
Abstract :
This work explores the possibility of using time-frequency distributions (TFD) to extract time varying formant information. This technique makes use of the TFD of Cohen´s (see Prentice-Hall, Englewood Cliffs, NJ, 1995) class and provides the profile of formant variation continuously in the time-frequency plane which can be employed to improve formant tracking and formant bandwidth estimation. The performance of this method is compared with other existing methods, which have their own pitfalls, using a modulated synthetic signal as input. It is shown the proposed method gives better formant estimation and also provides better visualization representation. This method is used to analyze real human speech and the results can be helpful for speech understanding and speech synthesis.
Keywords :
speech processing; Cohen´s class; formant analysis; formant bandwidth estimation; formant tracking; formant variation; modulated synthetic signal; performance; real human speech; smoothed Wigner distribution; speech synthesis; speech understanding; time varying formant information; time-frequency distributions; time-frequency plane; visualization representation; Bandwidth; Humans; Kernel; Linear predictive coding; Signal analysis; Spectrogram; Speech analysis; Speech synthesis; Time frequency analysis; Visualization;
Conference_Titel :
TENCON '97. IEEE Region 10 Annual Conference. Speech and Image Technologies for Computing and Telecommunications., Proceedings of IEEE
Conference_Location :
Brisbane, Qld., Australia
Print_ISBN :
0-7803-4365-4
DOI :
10.1109/TENCON.1997.647253