• DocumentCode
    3003303
  • Title

    On the use of instantaneous and transitional spectral information in speaker recognition

  • Author

    Soong, F.K. ; Rosenberg, A.E.

  • Author_Institution
    AT&T Bell Laboratories, Murray Hill, NJ, USA
  • Volume
    11
  • fYear
    1986
  • fDate
    31503
  • Firstpage
    877
  • Lastpage
    880
  • Abstract
    The use of instantaneous and transitional spectral representations of spoken utterances for speaker recognition is investigated. LPC derived-cepstral coefficients are used to represent instantaneous spectral information and best linear fits of each cepstral coefficient over a specified time window are used to represent transitional information. An evaluation has been carried out using a data base of isolated digit utterances over dialed-up telephone lines by 10 talkers. Two vector quantization (VQ) codebooks, instantaneous and transitional, are constructed from training utterances for each speaker. The experimental results show that the instantaneous and transitional representations are relatively uncorrelated thus providing complementary information for speaker recognition. A rectangular window of approximately 100-150 ms duration provides an effective estimate of spectral transitions for speaker recognition. Also, simple transmission channel variations are shown to affect the instantaneous spectral representations and the corresponding recognition performance significantly, while the transitional representations and performance are relatively resistant.
  • Keywords
    Cepstral analysis; Data mining; Filter bank; Linear predictive coding; Predictive coding; Speaker recognition; Spectral analysis; Speech analysis; Telephony; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '86.
  • Type

    conf

  • DOI
    10.1109/ICASSP.1986.1168882
  • Filename
    1168882