• DocumentCode
    3233769
  • Title

    Continuous probabilistic acoustic map for speaker identification

  • Author

    Tseng, Belle L. ; Soong, Frank K. ; Rosenberg, Aaron E.

  • Author_Institution
    MIT, Cambridge, MA, USA
  • Volume
    2
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    161
  • Abstract
    A continuous probabilistic acoustic map (CPAM) approach to speaker recognition is investigated. In the CPAM formulation, the speech input of a speaker is parameterized as a mixture of tied, universal probability density functions (PDFs) with either a CPAM model alone for text-independent operation or a CPAM-based hidden Markov model (HMM) for text-dependent operation. A continuously spoken digit database of 20 speakers (10 M, 10 F) is used to evaluate the CPAM approach in both identification and verification performance. The CPAM approach is shown to perform better than a vector quantization based approach in text-independent speaker recognition, and as well as the text-dependent, conventional, continuous mixture HMM approach with significant representation efficiency. In particular, the CPAM-based HMM achieves an identification error rate of 1.7% and a verification equal-error rate of 4.0% with a CPAM of 128 PDFs while a conventional, continuous mixtures HMM needs 400 PDFs to achieve corresponding error rates of 1.9% and 4.0% using the same combined cepstral features and three-digit test utterances
  • Keywords
    hidden Markov models; speech recognition; HMM; cepstral features; continuous probabilistic acoustic map; continuously spoken digit database; hidden Markov model; identification error rate; speaker identification; speaker recognition; text-dependent operation; text-independent operation; three-digit test utterances; universal probability density functions; verification equal-error rate; Cepstral analysis; Databases; Error analysis; Hidden Markov models; Loudspeakers; Probability density function; Speaker recognition; Speech; Testing; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.226095
  • Filename
    226095