DocumentCode :
2608405
Title :
Combining Cepstral and Prosodic Features in Language Identification
Author :
Yin, Bo ; Ambikairajah, Eliathamby ; Chen, Fang
Author_Institution :
Sch. of Electr. Eng. & Telecommun., National ICT Australia Ltd., Eveleigh, NSW
Volume :
4
fYear :
0
fDate :
0-0 0
Firstpage :
254
Lastpage :
257
Abstract :
A novel approach of combining cepstral features and prosodic features in language identification is presented in this paper. This combination approach shows a significant improvement on a GMM-UBM based language identification (LID) system which utilizes modern shifted delta cepstrum (SDC) and feature warping techniques. The proposed system achieves a high accuracy of 87.1% on a 10-language task, and outperforms the baseline system by 12%. The prosodic features are proven to be very effective in both tonal and non-tonal LID, as they deliver new language-discrimination information in addition to those from widely used cepstral features. Additionally, the performance of MFCC and PLP features with different coefficient numbers in language identification tasks are researched and compared. Less number of coefficients is more likely to be sufficient or even better for language identification
Keywords :
cepstral analysis; natural languages; speech recognition; GMM-UBM based language identification; cepstral features; feature warping; prosodic features; shifted delta cepstrum; Australia; Cepstral analysis; Cepstrum; Data mining; Mel frequency cepstral coefficient; Performance analysis; Speech processing; Speech recognition; Telephony; Unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
ISSN :
1051-4651
Print_ISBN :
0-7695-2521-0
Type :
conf
DOI :
10.1109/ICPR.2006.381
Filename :
1699828
Link To Document :
بازگشت