DocumentCode
3730399
Title
A new representation of statistical language model
Author
Zhenjun Yue; Siyuan Gu; Chanzhen Rong; Yuan Wang
Author_Institution
College of Communications Engineering, PLA University of Science and Technology, Nangjing, China
fYear
2015
Firstpage
483
Lastpage
488
Abstract
In this paper, according to fuzzy mathematical theory, the fuzzy evaluation sets were firstly established, then the frequencies of words or sentences in the corpus were represented as fuzzy membership vectors. In the end, the corresponding statistical language model was established through fuzzy mathematics, and the optimization method for determining the priority of the sentences was put forward too. The fuzzy membership vector was not as sharply as the maximum likelihood estimation in the use of frequency information, meanwhile fuzzy arithmetic could also effectively overcome noisy in the maximum likelihood estimation. The simulation experiments using Buffon´s needle data verify the rationality and validity of the given method. The given method in this paper also does not need smoothing, so it indirectly overcomes the various problems caused by smoothing in classical statistical language models.
Keywords
"Maximum likelihood estimation","Probability","Smoothing methods","Mathematical model","Frequency estimation","Computational modeling","Data models"
Publisher
ieee
Conference_Titel
Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
Type
conf
DOI
10.1109/FSKD.2015.7381990
Filename
7381990
Link To Document