• DocumentCode
    2799970
  • Title

    An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition

  • Author

    Tsao, Yu ; Sun, Hanwu ; Li, Haizhou ; Lee, Chin-Hui

  • Author_Institution
    Nat. Inst. of Inf. & Commun. Technol., Kyoto, Japan
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    4422
  • Lastpage
    4425
  • Abstract
    We propose an acoustic segment model (ASM) approach to incorporating temporal information into speaker modeling in text-independent speaker recognition. In training, the proposed framework first estimates a collection of ASM-based universal background models (UBMs). Multiple sets of speaker-specific ASMs are then obtained by adapting the ASM-based UBMs with speaker-specific enrollment data. A novel usage of language models of the ASM units is also proposed to characterize transitions among ASMs. In the testing phase the ASM sets for the claimed speaker and UBMs, along with a bigram ASM language model, are used to calculate detection scores for each given test utterance. We report on speaker recognition experiments using the NIST 2001 SRE database. The results clearly indicate that the proposed ASM-based method achieves a notable improvement over the GMM-based speaker modeling in which no temporal modeling is considered. Moreover, a further error reduction is obtained by integrating the language model, another inclusion of temporal properties made possibly by ASM based speaker modeling.
  • Keywords
    acoustic signal processing; speaker recognition; ASM; UBM; acoustic segment model approach; speaker modeling; temporal information; text independent speaker recognition; universal background models; Clustering algorithms; Hidden Markov models; Iterative algorithms; Loudspeakers; Natural languages; Partitioning algorithms; Pattern classification; Quantization; Speaker recognition; Testing; Speaker recognition; acoustic segment model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495617
  • Filename
    5495617