• DocumentCode
    3863294
  • Title

    Chinese syllable-to-character conversion with recurrent neural network based supervised sequence labelling

  • Author

    Yi Liu;Jing Hua;Xiangang Li;Tong Fu;Xihong Wu

  • Author_Institution
    Peking University, Beijing, China
  • fYear
    2015
  • Firstpage
    350
  • Lastpage
    353
  • Abstract
    Chinese Syllable-to-Character (S2C) conversion is the important component for Input Methods, and the key problem in Chinese S2C conversion is the serious phenomenon in Chinese language. In order to disambiguate homophones to improve Chinese S2C conversion, in this paper, Chinese S2C conversion is treated as a sequence labelling task, and the recurrent neural network (RNN) based on supervise sequence labelling is introduced to achieve the direct conversion from syllable sequences to word sequences. Through the direct conversion with the proposed RNN, the cascade error in multi-pass approaches can be eliminated effectively. Experimental results indicate that, in second pass decoding, the re-ranking with RNN language model has better performance than N-gram language model in both perplexity and S2C conversion accuracy. Moreover, the direct S2C conversion with RNN can improve the accuracy from 93.77% (RNN language model) to 94.17%.
  • Keywords
    "Decoding","Training","Labeling","Context","Recurrent neural networks","Prediction algorithms","Vocabulary"
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
  • Type

    conf

  • DOI
    10.1109/APSIPA.2015.7415292
  • Filename
    7415292