Title :
Combining Cepstral and Prosodic Features in Language Identification
Author :
Yin, Bo ; Ambikairajah, Eliathamby ; Chen, Fang
Author_Institution :
Sch. of Electr. Eng. & Telecommun., National ICT Australia Ltd., Eveleigh, NSW
Abstract :
A novel approach of combining cepstral features and prosodic features in language identification is presented in this paper. This combination approach shows a significant improvement on a GMM-UBM based language identification (LID) system which utilizes modern shifted delta cepstrum (SDC) and feature warping techniques. The proposed system achieves a high accuracy of 87.1% on a 10-language task, and outperforms the baseline system by 12%. The prosodic features are proven to be very effective in both tonal and non-tonal LID, as they deliver new language-discrimination information in addition to those from widely used cepstral features. Additionally, the performance of MFCC and PLP features with different coefficient numbers in language identification tasks are researched and compared. Less number of coefficients is more likely to be sufficient or even better for language identification
Keywords :
cepstral analysis; natural languages; speech recognition; GMM-UBM based language identification; cepstral features; feature warping; prosodic features; shifted delta cepstrum; Australia; Cepstral analysis; Cepstrum; Data mining; Mel frequency cepstral coefficient; Performance analysis; Speech processing; Speech recognition; Telephony; Unsupervised learning;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.381