• DocumentCode
    3430617
  • Title

    A Fishervoice based feature fusion method for short utterance speaker recognition

  • Author

    Chenhao Zhang ; Zheng, Thomas Fang

  • Author_Institution
    Div. of Tech. Innovation & Dev., Tsinghua Univ., Beijing, China
  • fYear
    2013
  • fDate
    6-10 July 2013
  • Firstpage
    165
  • Lastpage
    169
  • Abstract
    For GMM-UBM based text-independent speaker recognition, the performance decreases significantly when the utterance is getting too short, and that is mostly due to the lack of distinguishable information from a single kind of feature. Fusion of different features followed by a dimensionality reduction process has been proved useful to provide a satisfying solution. However, some fusion methods based on the traditional Linear Discriminant Analysis (LDA) may cause the singular matrix problem. Therefore, a Fishervoice based feature fusion method incorporating with the Principal Component Analysis (PCA) and the LDA is proposed, where several features, such as MFCC, PLAR and LPCC, which are commonly used, are concatenated, and then projected into a lower-dimensional subspace. Compared with the baseline GMM-UBM systems using any single feature and using the LDA based fusion method, the proposed one can effectively reduce the equal error rate and give the best performance for text-independent speaker recognition for utterances as short as about 2 seconds.
  • Keywords
    matrix algebra; principal component analysis; sensor fusion; speaker recognition; Fishervoice based feature fusion method; GMM-UBM; LDA; PCA; linear discriminant analysis; principal component analysis; short utterance speaker recognition; singular matrix problem; text-independent speaker recognition; Feature extraction; Mel frequency cepstral coefficient; Principal component analysis; Speaker recognition; Speech; Speech recognition; Vectors; Feature fusion; Fishervoice; LDA; PCA; Short utterance speaker recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing (ChinaSIP), 2013 IEEE China Summit & International Conference on
  • Conference_Location
    Beijing
  • Type

    conf

  • DOI
    10.1109/ChinaSIP.2013.6625320
  • Filename
    6625320