• DocumentCode
    2273546
  • Title

    Probabilistic N-gram language model for SMS Lingo

  • Author

    Damdoo, Rina ; Shrawankar, Urmila

  • Author_Institution
    Dept. of Comput. Sci. & Eng., G.H. Raisoni Coll. of Eng., Nagpur, India
  • fYear
    2012
  • fDate
    25-27 April 2012
  • Firstpage
    114
  • Lastpage
    118
  • Abstract
    This paper presents a pioneering step in designing Bi-Gram based decoder for SMS Lingo. In the last few years, a significant increment in both the computational power and storage capacity of computers, and the availability of large volumes of bilingual data, have made possible for Statistical Machine Translation (SMT) to become an actual and practical technology. This paper employs Bi-Gram Language Model (LM) with a SMT decoder through which a sentence written with short forms in an SMS is translated into long form sentence. Here the results over a development and test set are analyzed and commented. The main objective behind this project is to analyze the improvement in efficiency as the size of bilingual corpus increases.
  • Keywords
    language translation; natural language processing; probability; statistical analysis; LM; SMS Lingo; SMT; SMT decoder; bigram based decoder; bigram language model; long form sentence; probabilistic n-gram language model; statistical machine translation; Computational modeling; Decoding; Educational institutions; Smoothing methods; Sun; Testing; Training data; Bi-gram; HashMap; Parallel alligned corpus; Probability distribution table(PDT); Statistical Language Model; Statistical Machine Translation (SMT);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Recent Advances in Computing and Software Systems (RACSS), 2012 International Conference on
  • Conference_Location
    Chennai
  • Print_ISBN
    978-1-4673-0252-4
  • Type

    conf

  • DOI
    10.1109/RACSS.2012.6212708
  • Filename
    6212708