DocumentCode :
3423257
Title :
Language identification using MLKSFM for pre-classification with novel front-end features
Author :
Wang, Liang ; Ambikairajah, Eliathamby ; Choi, Eric H C
Author_Institution :
Sch. of Electr. Eng. & Telecommun., Univ. of New South Wales, Sydney, NSW
fYear :
2008
fDate :
March 31 2008-April 4 2008
Firstpage :
4245
Lastpage :
4248
Abstract :
This paper presents two novel contributions to automatic language identification. The first one is the use of the modified multi-layer Kohonen self-organizing feature map (MLKSFM) as a pre-classification for language identification (LID). Secondly, we discuss the novel application of empirical mode decomposition (EMD) to generate features for the LID pre-classification task. The use of instantaneous frequency (IF) and instantaneous amplitude (IA) of a speech signal as features for the pre-classifier is investigated. The experiment results on a 16-language speech database indicates that, the EMD by itself cannot perform well in the LID task, however it helps to improve the pre-classification rate when concatenated with other cepstral features. The overall LID performance is also increased when pre-classification is applied. We achieve LID rates of 85.2% and 62.3% for 45-sec and 10-sec test utterances, respectively.
Keywords :
audio databases; self-organising feature maps; speech recognition; 16-language speech database; empirical mode decomposition; instantaneous amplitude; instantaneous frequency; language identification; modified multi-layer Kohonen self-organizing feature map; speech signal; Amplitude estimation; Australia; Frequency estimation; Laboratories; Natural languages; Signal analysis; Spatial databases; Speech; System performance; Vector quantization; Language identification; empirical mode decomposition; modified MLKSFM; modified group delay function; pre-classification;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on
Conference_Location :
Las Vegas, NV
ISSN :
1520-6149
Print_ISBN :
978-1-4244-1483-3
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2008.4518592
Filename :
4518592
Link To Document :
بازگشت