• DocumentCode
    1916402
  • Title

    Automatic sex identification from short segments of speech

  • Author

    Fussell, Jesse W.

  • Author_Institution
    Dept. of Defense, Fort Meade, MD, USA
  • fYear
    1991
  • fDate
    14-17 Apr 1991
  • Firstpage
    409
  • Abstract
    The performance of a sex identification system working on 16 ms segments of speech is discussed. Consideration is given to: the effectiveness of individual cepstral coefficients and frame-to-frame differences of those coefficients for sex identification; performance of the Gaussian classifier as a function of the amount of training data; the results of simplifications to the Gaussian classifier; performance when training and testing are done by phoneme, by phoneme-class, and on all speech; effects of training on one phoneme and testing on speech from different phonemes; and distribution of sex identification errors by speaker. It is concluded that, if the speech signal is of high quality, it should not be difficult to build a practical system which uses successive outputs from the single-frame Gaussian classifier to accurately classify a speaker as being male or female. Additional testing needs to be done on data which are not of such high quality
  • Keywords
    speech recognition; 16 ms; female; frame-to-frame differences; individual cepstral coefficients; male; phoneme-class; sex identification errors; short speech segments; single-frame Gaussian classifier; speech recognition; training data; Acoustic testing; Cepstral analysis; Documentation; Liquids; Loudspeakers; NIST; Neural networks; Signal processing; Speech processing; Training data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on
  • Conference_Location
    Toronto, Ont.
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0003-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1991.150363
  • Filename
    150363