DocumentCode :
2273546
Title :
Probabilistic N-gram language model for SMS Lingo
Author :
Damdoo, Rina ; Shrawankar, Urmila
Author_Institution :
Dept. of Comput. Sci. & Eng., G.H. Raisoni Coll. of Eng., Nagpur, India
fYear :
2012
fDate :
25-27 April 2012
Firstpage :
114
Lastpage :
118
Abstract :
This paper presents a pioneering step in designing Bi-Gram based decoder for SMS Lingo. In the last few years, a significant increment in both the computational power and storage capacity of computers, and the availability of large volumes of bilingual data, have made possible for Statistical Machine Translation (SMT) to become an actual and practical technology. This paper employs Bi-Gram Language Model (LM) with a SMT decoder through which a sentence written with short forms in an SMS is translated into long form sentence. Here the results over a development and test set are analyzed and commented. The main objective behind this project is to analyze the improvement in efficiency as the size of bilingual corpus increases.
Keywords :
language translation; natural language processing; probability; statistical analysis; LM; SMS Lingo; SMT; SMT decoder; bigram based decoder; bigram language model; long form sentence; probabilistic n-gram language model; statistical machine translation; Computational modeling; Decoding; Educational institutions; Smoothing methods; Sun; Testing; Training data; Bi-gram; HashMap; Parallel alligned corpus; Probability distribution table(PDT); Statistical Language Model; Statistical Machine Translation (SMT);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Recent Advances in Computing and Software Systems (RACSS), 2012 International Conference on
Conference_Location :
Chennai
Print_ISBN :
978-1-4673-0252-4
Type :
conf
DOI :
10.1109/RACSS.2012.6212708
Filename :
6212708
Link To Document :
بازگشت