DocumentCode
2273546
Title
Probabilistic N-gram language model for SMS Lingo
Author
Damdoo, Rina ; Shrawankar, Urmila
Author_Institution
Dept. of Comput. Sci. & Eng., G.H. Raisoni Coll. of Eng., Nagpur, India
fYear
2012
fDate
25-27 April 2012
Firstpage
114
Lastpage
118
Abstract
This paper presents a pioneering step in designing Bi-Gram based decoder for SMS Lingo. In the last few years, a significant increment in both the computational power and storage capacity of computers, and the availability of large volumes of bilingual data, have made possible for Statistical Machine Translation (SMT) to become an actual and practical technology. This paper employs Bi-Gram Language Model (LM) with a SMT decoder through which a sentence written with short forms in an SMS is translated into long form sentence. Here the results over a development and test set are analyzed and commented. The main objective behind this project is to analyze the improvement in efficiency as the size of bilingual corpus increases.
Keywords
language translation; natural language processing; probability; statistical analysis; LM; SMS Lingo; SMT; SMT decoder; bigram based decoder; bigram language model; long form sentence; probabilistic n-gram language model; statistical machine translation; Computational modeling; Decoding; Educational institutions; Smoothing methods; Sun; Testing; Training data; Bi-gram; HashMap; Parallel alligned corpus; Probability distribution table(PDT); Statistical Language Model; Statistical Machine Translation (SMT);
fLanguage
English
Publisher
ieee
Conference_Titel
Recent Advances in Computing and Software Systems (RACSS), 2012 International Conference on
Conference_Location
Chennai
Print_ISBN
978-1-4673-0252-4
Type
conf
DOI
10.1109/RACSS.2012.6212708
Filename
6212708
Link To Document