DocumentCode
66235
Title
Shifted-Delta MLP Features for Spoken Language Recognition
Author
Wang, Haipeng ; Leung, Cheung-Chi ; Lee, Tan ; Ma, Bin ; Li, Haizhou
Author_Institution
Dept. of Electr. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
Volume
20
Issue
1
fYear
2013
fDate
Jan. 2013
Firstpage
15
Lastpage
18
Abstract
This letter presents our study of applying phoneme posterior features for spoken language recognition (SLR). In our work, phoneme posterior features are estimated from a multilayer perceptron (MLP) based phoneme recognizer, and are further processed through transformations including taking logarithm, PCA transformation, and appending shifted delta coefficients. The resulting shifted-delta MLP (SDMLP) features show similar distribution as conventional shifted-delta cepstral (SDC) features, and are more robust compared to the SDC features. Experiments on the NIST LRE2005 dataset show that the SDMLP features fit well with the state-of-the-art GMM-based SLR systems, and SDMLP features outperform SDC features significantly.
Keywords
cepstral analysis; multilayer perceptrons; principal component analysis; speech recognition; GMM-based SLR systems; NIST LRE2005 dataset; PCA transformation; SDC features; SDMLP; multilayer perceptron based phoneme recognizer; shifted delta coefficients; shifted-delta MLP feature; shifted-delta cepstral features; spoken language recognition; Cepstral analysis; Feature extraction; Principal component analysis; Robustness; Speech; Speech recognition; Vectors; Feature robustness; shifted-delta MLP features; shifted-delta cepstral features; spoken language recognition;
fLanguage
English
Journal_Title
Signal Processing Letters, IEEE
Publisher
ieee
ISSN
1070-9908
Type
jour
DOI
10.1109/LSP.2012.2227312
Filename
6353152
Link To Document