• DocumentCode
    284778
  • Title

    Augmented phonetic map for voice verification

  • Author

    Chang, Harry M.

  • Author_Institution
    NYNEX Science & Technology Inc., White Plains, NY, USA
  • Volume
    2
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    169
  • Abstract
    A perceptually based model for speaker identity verification (SIV) using derivative of phase spectrum (DPS) as the primary identity-bearing feature to model individual speakers´ vocal tract dynamics is presented. The basic technique used to model a speaker is to create a two-dimensional trajectory of changing vocal tract based on formant movement and pitch information. The map is further augmented with both instantaneous and dynamic feature parameters of DPS as well as with conventional energy-based acoustic features. A series of verification experiments was conducted, using a three-layer artificial neural network as a classifier, with an isolated digit database recorded over 11 different telephone handsets. The preliminary testing results suggest that this system performs significantly better than a baseline system using a standard cepstrum front-end
  • Keywords
    neural nets; speech recognition; derivative of phase spectrum; energy-based acoustic features; formant movement; isolated digit database; pitch information; speaker identity verification; telephone handsets; three-layer artificial neural network; vocal tract dynamics; voice verification; Cepstral analysis; Cepstrum; Filtering algorithms; Linear predictive coding; Loudspeakers; Robustness; Spatial databases; Speech analysis; Telephony; Time domain analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.226093
  • Filename
    226093