• DocumentCode
    2697063
  • Title

    Addressing Channel Mismatch through Speaker Discriminative Transforms

  • Author

    Pelecanos, Jason ; Navratil, J. ; Ramaswamy, Ganesh N.

  • Author_Institution
    Human Language Technol. Dept., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY
  • fYear
    2006
  • fDate
    28-30 June 2006
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    This paper presents a discriminative criterion applied to Gaussian mixture models (GMMs) to reduce handset mismatch. The criterion is related to the log-likelihood-ratio (LLR) scoring approach commonly used in GMMs for speaker recognition. The algorithm attempts to perform a direct mapping of features from one channel type to an assumed undistorted target channel but with the goal of maximizing speaker discrimination using the transform. The transform attempts to maximize the posterior probability of a group of speaker models given their corresponding speech observations recorded on a different channel
  • Keywords
    Gaussian distribution; probability; speaker recognition; transforms; wireless channels; GMM; Gaussian mixture model; LLR scoring approach; channel mismatch addressing; direct mapping; log-likelihood-ratio; posterior probability; speaker discriminative transform; speaker recognition; Biometrics; Degradation; Humans; Natural languages; Performance evaluation; Speaker recognition; Speech recognition; Speech synthesis; Telephone sets; Telephony;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
  • Conference_Location
    San Juan
  • Print_ISBN
    1-424400471-1
  • Electronic_ISBN
    1-4244-0472-X
  • Type

    conf

  • DOI
    10.1109/ODYSSEY.2006.248111
  • Filename
    4013528