• DocumentCode
    3730399
  • Title

    A new representation of statistical language model

  • Author

    Zhenjun Yue; Siyuan Gu; Chanzhen Rong; Yuan Wang

  • Author_Institution
    College of Communications Engineering, PLA University of Science and Technology, Nangjing, China
  • fYear
    2015
  • Firstpage
    483
  • Lastpage
    488
  • Abstract
    In this paper, according to fuzzy mathematical theory, the fuzzy evaluation sets were firstly established, then the frequencies of words or sentences in the corpus were represented as fuzzy membership vectors. In the end, the corresponding statistical language model was established through fuzzy mathematics, and the optimization method for determining the priority of the sentences was put forward too. The fuzzy membership vector was not as sharply as the maximum likelihood estimation in the use of frequency information, meanwhile fuzzy arithmetic could also effectively overcome noisy in the maximum likelihood estimation. The simulation experiments using Buffon´s needle data verify the rationality and validity of the given method. The given method in this paper also does not need smoothing, so it indirectly overcomes the various problems caused by smoothing in classical statistical language models.
  • Keywords
    "Maximum likelihood estimation","Probability","Smoothing methods","Mathematical model","Frequency estimation","Computational modeling","Data models"
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems and Knowledge Discovery (FSKD), 2015 12th International Conference on
  • Type

    conf

  • DOI
    10.1109/FSKD.2015.7381990
  • Filename
    7381990