مرکز منطقه ای اطلاع رساني علوم و فناوري - Addressing Channel Mismatch through Speaker Discriminative Transforms

DocumentCode :

2697063

Title :

Addressing Channel Mismatch through Speaker Discriminative Transforms

Author :

Pelecanos, Jason ; Navratil, J. ; Ramaswamy, Ganesh N.

Author_Institution :

Human Language Technol. Dept., IBM Thomas J. Watson Res. Center, Yorktown Heights, NY

fYear :

2006

fDate :

28-30 June 2006

Firstpage :

Lastpage :

Abstract :

This paper presents a discriminative criterion applied to Gaussian mixture models (GMMs) to reduce handset mismatch. The criterion is related to the log-likelihood-ratio (LLR) scoring approach commonly used in GMMs for speaker recognition. The algorithm attempts to perform a direct mapping of features from one channel type to an assumed undistorted target channel but with the goal of maximizing speaker discrimination using the transform. The transform attempts to maximize the posterior probability of a group of speaker models given their corresponding speech observations recorded on a different channel

Keywords :

Gaussian distribution; probability; speaker recognition; transforms; wireless channels; GMM; Gaussian mixture model; LLR scoring approach; channel mismatch addressing; direct mapping; log-likelihood-ratio; posterior probability; speaker discriminative transform; speaker recognition; Biometrics; Degradation; Humans; Natural languages; Performance evaluation; Speaker recognition; Speech recognition; Speech synthesis; Telephone sets; Telephony;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Speaker and Language Recognition Workshop, 2006. IEEE Odyssey 2006: The

Conference_Location :

San Juan

Print_ISBN :

1-424400471-1

Electronic_ISBN :

1-4244-0472-X

Type :

conf

DOI :

10.1109/ODYSSEY.2006.248111

Filename :

4013528

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2697063