DocumentCode :
706296
Title :
Automatic language recognition with tonal and non-tonal language pre-classification
Author :
Liang Wang ; Ambikairajah, Eliathamby ; Choi, Eric H. C.
Author_Institution :
Sch. of EE&Telecomm., Univ. of New South Wales, Sydney, NSW, Australia
fYear :
2007
fDate :
3-7 Sept. 2007
Firstpage :
2375
Lastpage :
2379
Abstract :
Parallel Phoneme Recognition followed by Language Modelling (PPRLM) systems currently provide state of the art language identification performance on conversational telephone speech. In this paper an innovative method for tonal and non-tonal language pre-classification by using prosodie information is reported. Our motivation is to improve recognition accuracy and save the amount of CPU run-time while handling large number of languages. Also, by incorporating different confidence measures into the traditional PPRLM framework, we propose an optimized language recognition system that can be applied in an open-set language recognition task. For a task of 12 target languages and 4 non-target languages, our results show that with the optimized pre-classification, Universal Background Phone Model confidence measuring and Witten-Bell discounting the system can achieve recognition accuracy rates of 77.9% for 30-sec speech segments and 49.2% for 10-sec speech segments.
Keywords :
natural language processing; speech recognition; CPU run-time; PPRLM systems; Witten-Bell discounting system; automatic language recognition; conversational telephone speech; language identification performance; nontarget languages; nontonal language preclassification; open-set language recognition task; optimized language recognition system; optimized preclassification; parallel phoneme recognition followed-by-language modelling; prosodic information; recognition accuracy improvement; recognition accuracy rates; speech segments; target languages; tonal language preclassification; universal background phone model confidence measure; Accuracy; Acoustics; Europe; Feature extraction; Hidden Markov models; Speech; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2007 15th European
Conference_Location :
Poznan
Print_ISBN :
978-839-2134-04-6
Type :
conf
Filename :
7099233
Link To Document :
بازگشت