DocumentCode
3863294
Title
Chinese syllable-to-character conversion with recurrent neural network based supervised sequence labelling
Author
Yi Liu;Jing Hua;Xiangang Li;Tong Fu;Xihong Wu
Author_Institution
Peking University, Beijing, China
fYear
2015
Firstpage
350
Lastpage
353
Abstract
Chinese Syllable-to-Character (S2C) conversion is the important component for Input Methods, and the key problem in Chinese S2C conversion is the serious phenomenon in Chinese language. In order to disambiguate homophones to improve Chinese S2C conversion, in this paper, Chinese S2C conversion is treated as a sequence labelling task, and the recurrent neural network (RNN) based on supervise sequence labelling is introduced to achieve the direct conversion from syllable sequences to word sequences. Through the direct conversion with the proposed RNN, the cascade error in multi-pass approaches can be eliminated effectively. Experimental results indicate that, in second pass decoding, the re-ranking with RNN language model has better performance than N-gram language model in both perplexity and S2C conversion accuracy. Moreover, the direct S2C conversion with RNN can improve the accuracy from 93.77% (RNN language model) to 94.17%.
Keywords
"Decoding","Training","Labeling","Context","Recurrent neural networks","Prediction algorithms","Vocabulary"
Publisher
ieee
Conference_Titel
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
Type
conf
DOI
10.1109/APSIPA.2015.7415292
Filename
7415292
Link To Document