DocumentCode :
3165992
Title :
Recognition of highly imbalanced code-mixed bilingual speech with frame-level language detection based on blurred posteriorgram
Author :
Yeh, Ching-Feng ; Heidel, Aaron ; Lee, Hong-Yi ; Lee, Lin-Shan
Author_Institution :
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4873
Lastpage :
4876
Abstract :
In this work, we proposed a new framework for recognition of highly imbalanced code-mixed bilingual speech using an additional frame-level language detector in the conventional recognition system. Blurred posteriorgram features (BPFs) are also proposed to be used in the language detector. The approach was evaluated with real spontaneous lectures offered at National Taiwan University. The highly imbalanced language distribution in code-mixed speech makes the task difficult. Preliminary experimental results showed not only very good performance improvement, but the improvement is complementary to that brought by better acoustic models, whether due to better adaptation approach or increased training data. The code-mixed bilingual speech is frequently used in the daily lives of many people in the globalized world today.
Keywords :
natural language processing; speech processing; acoustic model; blurred posteriorgram feature; code mixed bilingual speech recognition; frame level language detection; frame level language detector; imbalanced language distribution; recognition system; Acoustics; Adaptation models; Detectors; Hidden Markov models; Speech; Speech coding; Speech recognition; ASR; code-mixing; multilingual;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289011
Filename :
6289011
Link To Document :
بازگشت