DocumentCode
3165992
Title
Recognition of highly imbalanced code-mixed bilingual speech with frame-level language detection based on blurred posteriorgram
Author
Yeh, Ching-Feng ; Heidel, Aaron ; Lee, Hong-Yi ; Lee, Lin-Shan
Author_Institution
Grad. Inst. of Commun. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fYear
2012
fDate
25-30 March 2012
Firstpage
4873
Lastpage
4876
Abstract
In this work, we proposed a new framework for recognition of highly imbalanced code-mixed bilingual speech using an additional frame-level language detector in the conventional recognition system. Blurred posteriorgram features (BPFs) are also proposed to be used in the language detector. The approach was evaluated with real spontaneous lectures offered at National Taiwan University. The highly imbalanced language distribution in code-mixed speech makes the task difficult. Preliminary experimental results showed not only very good performance improvement, but the improvement is complementary to that brought by better acoustic models, whether due to better adaptation approach or increased training data. The code-mixed bilingual speech is frequently used in the daily lives of many people in the globalized world today.
Keywords
natural language processing; speech processing; acoustic model; blurred posteriorgram feature; code mixed bilingual speech recognition; frame level language detection; frame level language detector; imbalanced language distribution; recognition system; Acoustics; Adaptation models; Detectors; Hidden Markov models; Speech; Speech coding; Speech recognition; ASR; code-mixing; multilingual;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location
Kyoto
ISSN
1520-6149
Print_ISBN
978-1-4673-0045-2
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2012.6289011
Filename
6289011
Link To Document