DocumentCode
468345
Title
A New Method of the Automatically Marked Chinese Part of Speech Based on Gaussian Prior Smoothing Maximum Entropy Model
Author
Zhao, Wei ; Zhao, Faxing ; Li, Wenhui
Author_Institution
Jilin Univ., Changchun
Volume
3
fYear
2007
fDate
24-27 Aug. 2007
Firstpage
447
Lastpage
453
Abstract
With its many virtues, maximum entropy (ME) model has been favored in natural language processing. Because of the limitation of the training data, the parameters sparse phenomenon is serious in Chinese part of speech. The model is prone to over fit training data, therefore some smoothing method should be applied on maximum entropy model. While several smoothing methods for maximum entropy models have been proposed to address this problem, Gaussian prior smoothing method has an outstanding performance. Based on this smoothing maximum entropy model and characteristics of Chinese, a new Chinese part-of-speech system is presented. Result of experiment shows that it works well.
Keywords
Gaussian processes; maximum entropy methods; natural language processing; smoothing methods; speech processing; Gaussian prior smoothing maximum entropy model; Gaussian prior smoothing method; natural language processing; over fit training data; parameters sparse phenomenon; Computer science; Educational institutions; Entropy; Hidden Markov models; Natural language processing; Natural languages; Probability distribution; Smoothing methods; Speech processing; Training data;
fLanguage
English
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location
Haikou
Print_ISBN
978-0-7695-2874-8
Type
conf
DOI
10.1109/FSKD.2007.86
Filename
4406278
Link To Document