• DocumentCode
    446757
  • Title

    An affine transform for speaker recognition enhancement under mismatched coding conditions

  • Author

    Salam, A. Abdul ; Fakhr, Waleed ; Hamdy, Nadder

  • Author_Institution
    Dept. of Commun. & Electron., Arab Acad. for Sci. & Technol., Cairo
  • Volume
    2
  • fYear
    2003
  • fDate
    30-30 Dec. 2003
  • Firstpage
    621
  • Abstract
    Text-independent speaker recognition performance suffers significantly under mismatched coding conditions between training and testing speech data. In this paper, a baseline HMM-based speaker recognition system is tested under various mismatched conditions with a large number of different HMM topologies. Training and testing the models using only the voiced segments of the samples is then considered. A technique based on a diagonal affine transform in the cepstrum domain is proposed, which maps the mismatched test cepstrum data onto the baseline cepstrum domain. Results for 2 different state-of-the-art codecs and a large number of different model topologies show encouraging improvement in performance compared to the mismatched cases
  • Keywords
    affine transforms; hidden Markov models; speaker recognition; HMM topology; baseline cepstrum domain; codecs; diagonal affine transform; hidden Markov model; mismatched coding conditions; mismatched test cepstrum data; speaker recognition enhancement; testing speech data; training data set; Cepstral analysis; Cepstrum; Hidden Markov models; Internet telephony; Phase change materials; Speaker recognition; Speech codecs; Speech coding; System testing; Topology; Affine transform; HMM; Mismatched Conditions; Speaker recognition; voiced;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Circuits and Systems, 2003 IEEE 46th Midwest Symposium on
  • Conference_Location
    Cairo
  • ISSN
    1548-3746
  • Print_ISBN
    0-7803-8294-3
  • Type

    conf

  • DOI
    10.1109/MWSCAS.2003.1562363
  • Filename
    1562363