Title :
The statistical structures of male and female speech signals
Author :
Lee, Te-Won ; Jang, Gil-Jin
Author_Institution :
Computational Neurobiol. Lab., Salk Inst., La Jolla, CA, USA
Abstract :
The goal of this paper is to learn or adapt statistical features of gender specific speech signals. The adaptation is performed by finding basis functions that encode the speech signal such that the resulting coefficients are statistically independent and the information redundancy is minimized. We use a flexible independent component analysis (ICA) algorithm to adapt the basis functions as well as the source coefficients for male and female speakers respectively. The learned features show significant differences in frequency and time span. Our results suggest that the male speech features can be described by Gabor-like wavelet filters whereas the female speech signal has a much longer time span. We present a detailed time-frequency analysis strongly suggesting that those features can be used to qualify and quantify gender-specific speech signal differences
Keywords :
digital filters; speech coding; speech processing; statistical analysis; time-frequency analysis; wavelet transforms; Gabor-like wavelet filters; ICA algorithm; basis functions; female speakers; female speech signals; frequency; gender specific speech signals; independent component analysis; information redundancy; male speakers; male speech signals; source coefficients; statistical features; statistical structures; time span; time-frequency analysis; Cepstral analysis; Encoding; Gabor filters; Independent component analysis; Laboratories; Speech analysis; Speech recognition; Statistics; Time frequency analysis; Vectors;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 2001 IEEE International Conference on
Conference_Location :
Salt Lake City, UT
Print_ISBN :
0-7803-7041-4
DOI :
10.1109/ICASSP.2001.940778