DocumentCode :
66235
Title :
Shifted-Delta MLP Features for Spoken Language Recognition
Author :
Wang, Haipeng ; Leung, Cheung-Chi ; Lee, Tan ; Ma, Bin ; Li, Haizhou
Author_Institution :
Dept. of Electr. Eng., Chinese Univ. of Hong Kong, Hong Kong, China
Volume :
20
Issue :
1
fYear :
2013
fDate :
Jan. 2013
Firstpage :
15
Lastpage :
18
Abstract :
This letter presents our study of applying phoneme posterior features for spoken language recognition (SLR). In our work, phoneme posterior features are estimated from a multilayer perceptron (MLP) based phoneme recognizer, and are further processed through transformations including taking logarithm, PCA transformation, and appending shifted delta coefficients. The resulting shifted-delta MLP (SDMLP) features show similar distribution as conventional shifted-delta cepstral (SDC) features, and are more robust compared to the SDC features. Experiments on the NIST LRE2005 dataset show that the SDMLP features fit well with the state-of-the-art GMM-based SLR systems, and SDMLP features outperform SDC features significantly.
Keywords :
cepstral analysis; multilayer perceptrons; principal component analysis; speech recognition; GMM-based SLR systems; NIST LRE2005 dataset; PCA transformation; SDC features; SDMLP; multilayer perceptron based phoneme recognizer; shifted delta coefficients; shifted-delta MLP feature; shifted-delta cepstral features; spoken language recognition; Cepstral analysis; Feature extraction; Principal component analysis; Robustness; Speech; Speech recognition; Vectors; Feature robustness; shifted-delta MLP features; shifted-delta cepstral features; spoken language recognition;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2012.2227312
Filename :
6353152
Link To Document :
بازگشت