• DocumentCode
    337491
  • Title

    On the use of orthogonal GMM in speaker recognition

  • Author

    Liu, Li ; He, Jialong

  • Author_Institution
    Dept. of Speech & Hearing Sci., Arizona State Univ., Tempe, AZ, USA
  • Volume
    2
  • fYear
    1999
  • fDate
    15-19 Mar 1999
  • Firstpage
    845
  • Abstract
    The Gaussian mixture modeling (GMM) techniques are increasingly being used for both speaker identification and verification. Most of these models assume diagonal covariance matrices. Although empirically any distribution can be approximated with a diagonal GMM, a large number of mixture components are usually needed to obtain a good approximation. A consequence of using a large GMM is that its training is time consuming and its response speed is very slow. This paper proposes a modification to the standard diagonal GMM approach. The proposed scheme includes an orthogonal transformation: feature vectors are first transformed to the space spanned by the eigenvectors of the covariance matrix before applying to the diagonal GMM. Only a small computational load is introduced by this transformation, but results from both speaker identification and verification experiments indicated that the orthogonal transformation considerably improves the recognition performance. For a specific performance level, the GMM with orthogonal transform needs only one-fourth the number of Gaussian functions required by the standard GMM
  • Keywords
    Gaussian processes; covariance matrices; speaker recognition; Gaussian functions; Gaussian mixture modeling; diagonal GMM; diagonal covariance matrices; eigenvectors; experiments; feature vectors; orthogonal GMM; orthogonal transformation; recognition performance; response speed; speaker identification; speaker recognition; speaker verification; training; Auditory system; Covariance matrix; Feature extraction; Helium; Iterative algorithms; Iterative methods; Speaker recognition; Speech; Training data; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
  • Conference_Location
    Phoenix, AZ
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-5041-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1999.759803
  • Filename
    759803