• DocumentCode
    178049
  • Title

    A novel scheme for speaker recognition using a phonetically-aware deep neural network

  • Author

    Yun Lei ; Scheffer, Nicolas ; Ferrer, Luciana ; McLaren, Moray

  • Author_Institution
    Speech Technol. & Res. Lab., SRI Int., Menlo Park, CA, USA
  • fYear
    2014
  • fDate
    4-9 May 2014
  • Firstpage
    1695
  • Lastpage
    1699
  • Abstract
    We propose a novel framework for speaker recognition in which extraction of sufficient statistics for the state-of-the-art i-vector model is driven by a deep neural network (DNN) trained for automatic speech recognition (ASR). Specifically, the DNN replaces the standard Gaussian mixture model (GMM) to produce frame alignments. The use of an ASR-DNN system in the speaker recognition pipeline is attractive as it integrates the information from speech content directly into the statistics, allowing the standard backends to remain unchanged. Improvement from the proposed framework compared to a state-of-the-art system are of 30% relative at the equal error rate when evaluated on the telephone conditions from the 2012 NIST speaker recognition evaluation (SRE). The proposed framework is a successful way to efficiently leverage transcribed data for speaker recognition, thus opening up a wide spectrum of research directions.
  • Keywords
    Gaussian processes; neural nets; speaker recognition; 2012 NIST SRE; 2012 NIST speaker recognition evaluation; ASR; ASR-DNN system; automatic speech recognition; frame alignment; i-vector model; phonetically-aware deep neural network; speaker recognition pipeline; speech content; standard GMM; standard Gaussian mixture model; statistic extraction; Hidden Markov models; Mathematical model; NIST; Speaker recognition; Speech; Speech recognition; deep neural network; speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
  • Conference_Location
    Florence
  • Type

    conf

  • DOI
    10.1109/ICASSP.2014.6853887
  • Filename
    6853887