DocumentCode :
468196
Title :
The Performance Appraising of Language Models and the Entropy Estimation of Chinese
Author :
Zhang, Yangsen ; Huang, Gaijuan ; Mai, Miao
Author_Institution :
Beijing Inf. Sci. & Technol. Univ., Beijing
Volume :
2
fYear :
2007
fDate :
24-27 Aug. 2007
Firstpage :
50
Lastpage :
54
Abstract :
We give a quantified reasoning and description of the perplexity for evaluating language models using the concept of entropy in information theory: The smaller the entropy of the language estimated by the language model is, the more precise the language model is; an interpolated model based on two (n-1)-gram models is better than the (n-1)-gram component models, but not a n-gram model. We also explore the methods to estimating the entropy of Chinese using language models.
Keywords :
entropy; information analysis; natural language processing; entropy estimation; information theory; language models; quantified reasoning; Appraisal; Codes; Computational linguistics; Electronic mail; Entropy; Information science; Information theory; Natural languages; Predictive models; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems and Knowledge Discovery, 2007. FSKD 2007. Fourth International Conference on
Conference_Location :
Haikou
Print_ISBN :
978-0-7695-2874-8
Type :
conf
DOI :
10.1109/FSKD.2007.579
Filename :
4406044
Link To Document :
بازگشت