• DocumentCode
    66235
  • Title

    Shifted-Delta MLP Features for Spoken Language Recognition

  • Author

    Wang, Haipeng ; Leung, Cheung-Chi ; Lee, Tan ; Ma, Bin ; Li, Haizhou

  • Author_Institution
    Dept. of Electr. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
  • Volume
    20
  • Issue
    1
  • fYear
    2013
  • fDate
    Jan. 2013
  • Firstpage
    15
  • Lastpage
    18
  • Abstract
    This letter presents our study of applying phoneme posterior features for spoken language recognition (SLR). In our work, phoneme posterior features are estimated from a multilayer perceptron (MLP) based phoneme recognizer, and are further processed through transformations including taking logarithm, PCA transformation, and appending shifted delta coefficients. The resulting shifted-delta MLP (SDMLP) features show similar distribution as conventional shifted-delta cepstral (SDC) features, and are more robust compared to the SDC features. Experiments on the NIST LRE2005 dataset show that the SDMLP features fit well with the state-of-the-art GMM-based SLR systems, and SDMLP features outperform SDC features significantly.
  • Keywords
    cepstral analysis; multilayer perceptrons; principal component analysis; speech recognition; GMM-based SLR systems; NIST LRE2005 dataset; PCA transformation; SDC features; SDMLP; multilayer perceptron based phoneme recognizer; shifted delta coefficients; shifted-delta MLP feature; shifted-delta cepstral features; spoken language recognition; Cepstral analysis; Feature extraction; Principal component analysis; Robustness; Speech; Speech recognition; Vectors; Feature robustness; shifted-delta MLP features; shifted-delta cepstral features; spoken language recognition;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Letters, IEEE
  • Publisher
    ieee
  • ISSN
    1070-9908
  • Type

    jour

  • DOI
    10.1109/LSP.2012.2227312
  • Filename
    6353152