DocumentCode
1916402
Title
Automatic sex identification from short segments of speech
Author
Fussell, Jesse W.
Author_Institution
Dept. of Defense, Fort Meade, MD, USA
fYear
1991
fDate
14-17 Apr 1991
Firstpage
409
Abstract
The performance of a sex identification system working on 16 ms segments of speech is discussed. Consideration is given to: the effectiveness of individual cepstral coefficients and frame-to-frame differences of those coefficients for sex identification; performance of the Gaussian classifier as a function of the amount of training data; the results of simplifications to the Gaussian classifier; performance when training and testing are done by phoneme, by phoneme-class, and on all speech; effects of training on one phoneme and testing on speech from different phonemes; and distribution of sex identification errors by speaker. It is concluded that, if the speech signal is of high quality, it should not be difficult to build a practical system which uses successive outputs from the single-frame Gaussian classifier to accurately classify a speaker as being male or female. Additional testing needs to be done on data which are not of such high quality
Keywords
speech recognition; 16 ms; female; frame-to-frame differences; individual cepstral coefficients; male; phoneme-class; sex identification errors; short speech segments; single-frame Gaussian classifier; speech recognition; training data; Acoustic testing; Cepstral analysis; Documentation; Liquids; Loudspeakers; NIST; Neural networks; Signal processing; Speech processing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
Conference_Location
Toronto, Ont.
ISSN
1520-6149
Print_ISBN
0-7803-0003-3
Type
conf
DOI
10.1109/ICASSP.1991.150363
Filename
150363
Link To Document