Title :
Automatic sex identification from short segments of speech
Author :
Fussell, Jesse W.
Author_Institution :
Dept. of Defense, Fort Meade, MD, USA
Abstract :
The performance of a sex identification system working on 16 ms segments of speech is discussed. Consideration is given to: the effectiveness of individual cepstral coefficients and frame-to-frame differences of those coefficients for sex identification; performance of the Gaussian classifier as a function of the amount of training data; the results of simplifications to the Gaussian classifier; performance when training and testing are done by phoneme, by phoneme-class, and on all speech; effects of training on one phoneme and testing on speech from different phonemes; and distribution of sex identification errors by speaker. It is concluded that, if the speech signal is of high quality, it should not be difficult to build a practical system which uses successive outputs from the single-frame Gaussian classifier to accurately classify a speaker as being male or female. Additional testing needs to be done on data which are not of such high quality
Keywords :
speech recognition; 16 ms; female; frame-to-frame differences; individual cepstral coefficients; male; phoneme-class; sex identification errors; short speech segments; single-frame Gaussian classifier; speech recognition; training data; Acoustic testing; Cepstral analysis; Documentation; Liquids; Loudspeakers; NIST; Neural networks; Signal processing; Speech processing; Training data;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7803-0003-3
DOI :
10.1109/ICASSP.1991.150363