• DocumentCode
    3201683
  • Title

    Text independent Speaker Identification using Gaussian mixture model

  • Author

    Ting, Chee-Ming ; Salleh, Sh-Hussain ; Tan, Tian-Swee ; Ariff, A.K.

  • Author_Institution
    Fac. of Electr. Eng., Univ. Teknol. Malaysia, Skudai
  • fYear
    2007
  • fDate
    25-28 Nov. 2007
  • Firstpage
    194
  • Lastpage
    198
  • Abstract
    This paper describes text-independent (TI) Speaker Identification (ID) using Gaussian mixture models (GMM). The use of GMM approach is motivated by that the individual Gaussian components of a GMM are shown to represent some general speaker-dependent spectral shapes that are effective for speaker identity modeling. For speaker model training, a fast re-estimation algorithm based on highest likelihood mixture clustering is introduced. In this work, the GMM is evaluated on TI Speaker ID task via series of experiments (model convergence, effect of feature set, number of Gaussian components, and training utterance length on identification rate). The database consisted of Malay clean sentence speech database uttered by 10 speakers (3 female and 7 male). Each speaker provides the same 40 sentences utterances (average length- 3.5s) with different text. The sentences for testing were different from those for training. The GMM achieved 98.4% identification rate using 5 training sentences. The model training based on highest likelihood clustering is shown to perform comparably to conventional expectation-maximization training but consumes much shorter computational time.
  • Keywords
    Gaussian processes; expectation-maximisation algorithm; speaker recognition; text analysis; Gaussian mixture model; expectation-maximization training; likelihood mixture clustering; re-estimation algorithm; speaker-dependent spectral shapes; text independent speaker identification; Biometrics; Clustering algorithms; Hidden Markov models; Intelligent systems; Loudspeakers; Spatial databases; Speaker recognition; Spectral shape; Speech; Telephony; Gaussian Mixture Model; Speaker Indentification;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent and Advanced Systems, 2007. ICIAS 2007. International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4244-1355-3
  • Electronic_ISBN
    978-1-4244-1356-0
  • Type

    conf

  • DOI
    10.1109/ICIAS.2007.4658373
  • Filename
    4658373