DocumentCode
2697063
Title
Addressing Channel Mismatch through Speaker Discriminative Transforms
Author
Pelecanos, Jason ; Navratil, J. ; Ramaswamy, Ganesh N.
Author_Institution
Human Language Technol. Dept., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY
fYear
2006
fDate
28-30 June 2006
Firstpage
1
Lastpage
6
Abstract
This paper presents a discriminative criterion applied to Gaussian mixture models (GMMs) to reduce handset mismatch. The criterion is related to the log-likelihood-ratio (LLR) scoring approach commonly used in GMMs for speaker recognition. The algorithm attempts to perform a direct mapping of features from one channel type to an assumed undistorted target channel but with the goal of maximizing speaker discrimination using the transform. The transform attempts to maximize the posterior probability of a group of speaker models given their corresponding speech observations recorded on a different channel
Keywords
Gaussian distribution; probability; speaker recognition; transforms; wireless channels; GMM; Gaussian mixture model; LLR scoring approach; channel mismatch addressing; direct mapping; log-likelihood-ratio; posterior probability; speaker discriminative transform; speaker recognition; Biometrics; Degradation; Humans; Natural languages; Performance evaluation; Speaker recognition; Speech recognition; Speech synthesis; Telephone sets; Telephony;
fLanguage
English
Publisher
ieee
Conference_Titel
Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The
Conference_Location
San Juan
Print_ISBN
1-424400471-1
Electronic_ISBN
1-4244-0472-X
Type
conf
DOI
10.1109/ODYSSEY.2006.248111
Filename
4013528
Link To Document