• DocumentCode
    1889570
  • Title

    Text-independent speaker identification based on feature transformation to phoneme-independent subspace

  • Author

    Lu, Haoze ; Okamoto, Haruka ; Nishida, Masafumi ; Horiuchi, Yasuo ; Kuroiwa, Shingo

  • Author_Institution
    Grad. Sch. of Adv. Integration Sci., Chiba Univ., Chiba
  • fYear
    2008
  • fDate
    10-12 Nov. 2008
  • Firstpage
    692
  • Lastpage
    695
  • Abstract
    In text-independent (TI) speaker identification, the variation of phonetic information strongly affects the performance of speaker identification. If this phonetic information in his/her speech data can be suppressed, a robust TI speaker identification system will be realized by using speech features having less phonetic information. In this paper, we propose a TI speaker identification method that suppresses the phonetic information by a subspace method, under the assumption that a subspace with large variance in the speech feature space is a ldquophoneme-dependent subspacerdquo and a complementary subspace of it is a ldquophoneme-independent subspacerdquo. Principal Component Analysis (PCA) is utilized to construct these subspaces. We carried out GMM-based speaker identification experiments using both a new feature vector of the proposed method and the conventional MFCC. As a result, the proposed method reduced the identification error rate by 21% compared with the conventional MFCC.
  • Keywords
    feature extraction; natural language processing; principal component analysis; speech recognition; Gaussian mixture model; feature transformation; phoneme-dependent subspace; phoneme-independent subspace; principal component analysis; speech feature space; text-independent speaker identification; Data mining; Discrete cosine transforms; Eigenvalues and eigenfunctions; Error analysis; Feature extraction; Mel frequency cepstral coefficient; Principal component analysis; Robustness; Speaker recognition; Speech processing; GMM; MFB; MFCC; phonetic information; principal component analysis; speaker recognition; subspace projection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Technology, 2008. ICCT 2008. 11th IEEE International Conference on
  • Conference_Location
    Hangzhou
  • Print_ISBN
    978-1-4244-2250-0
  • Electronic_ISBN
    978-1-4244-2251-7
  • Type

    conf

  • DOI
    10.1109/ICCT.2008.4716204
  • Filename
    4716204