• DocumentCode
    699885
  • Title

    From GMM to HMM for embedded password-based speaker recognition

  • Author

    Larcher, Anthony ; Bonastre, Jean-Francois ; Mason, John S. D.

  • Author_Institution
    Lab. d´Inf. d´Avignon (LIA), UAPV, France
  • fYear
    2008
  • fDate
    25-29 Aug. 2008
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Embedded speaker recognition in mobile devices involves a limited amount of computing resource but also is linked with several ergonomic constraints. For example both the enrolment and the test have to be done using short audio sequences. Even if they proved their efficiency in more classical situations, GMM/UBM based systems show their limits in this context. This paper deals with this problem and proposes to take into account the linguistic nature of the speech material inside the GMM/UBM framework. The proposed solution mixes the text-independent aspects of the GMM/UBM with a semi-continuous like approach in order to deal with the text-dependent information. This system respects both the resource and the ergonomic constraints of the considered application field. The preliminary experiments are done on the publicly available database ValidDB and show the potential of the proposed approach. Particularly, when compared to the GMM/UBM, our approach decreases drastically both the computational cost and the equal error rates when impostors don´t know the user passwords. For other situations the performance remains comparable between both approaches.
  • Keywords
    Gaussian processes; ergonomics; hidden Markov models; message authentication; mixture models; mobile computing; speaker recognition; GMM-UBM framework; Gaussian mixture model; HMM; PASSWORD-BASED SPEAKER RECOGNITION; ValidDB database; computational cost; equal error rates; ergonomic constraints; hidden Markov model; mobile devices; semi continuous like approach; text-dependent information; text-independent aspects; universal background model; Acoustics; Adaptation models; Computational modeling; Hidden Markov models; Noise measurement; Speaker recognition; Speech;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2008 16th European
  • Conference_Location
    Lausanne
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7080417