• DocumentCode
    1866760
  • Title

    A word prediction methodology for automatic sentence completion

  • Author

    Spiccia, Carmelo ; Augello, Agnese ; Pilato, Giovanni ; Vassallo, Giorgio

  • Author_Institution
    Ist. di Calcolo e Reti ad Alte Prestazioni (ICAR), Palermo, Italy
  • fYear
    2015
  • fDate
    7-9 Feb. 2015
  • Firstpage
    240
  • Lastpage
    243
  • Abstract
    Word prediction generally relies on n-grams occurrence statistics, which may have huge data storage requirements and does not take into account the general meaning of the text. We propose an alternative methodology, based on Latent Semantic Analysis, to address these issues. An asymmetric Word-Word frequency matrix is employed to achieve higher scalability with large training datasets than the classic Word-Document approach. We propose a function for scoring candidate terms for the missing word in a sentence. We show how this function approximates the probability of occurrence of a given candidate word. Experimental results show that the proposed approach outperforms non neural network language models.
  • Keywords
    computational linguistics; matrix algebra; natural language processing; statistical analysis; text analysis; asymmetric word-word frequency matrix; automatic sentence completion; latent semantic analysis; n-grams occurrence statistics; word prediction methodology; Accuracy; Semantics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Semantic Computing (ICSC), 2015 IEEE International Conference on
  • Conference_Location
    Anaheim, CA
  • Type

    conf

  • DOI
    10.1109/ICOSC.2015.7050813
  • Filename
    7050813