Title :
Probabilistic language model for template messaging based on Bi-gram
Author :
Damdoo, Rina ; Shrawankar, Urmila
Author_Institution :
Dept. of Comput. Sci. & Eng., G.H. Raisoni Coll. of Eng., Nagpur, India
Abstract :
This paper reports the benefits of Probabilistic language modeling in template messaging domain. Through a Statistical Machine Translation (SMT) sentences written with short forms, misspelled words and chatting slang can be corrected. Given a source-language (e.g., Short message) sentence, the problem of machine translation is to automatically produce a target-language (e.g., Long form English) translation, to be used by the young generation for messaging. The main goal behind this project is to analyze the improvement in efficiency as the size of bilingual corpus increases. Machine learning and translation systems, dictionary and textbook preparations, patent and reference searches, and various information retrieval systems are the main applications of the project.
Keywords :
dictionaries; information retrieval systems; language translation; natural languages; patents; text analysis; Bi-gram; SMT sentences; bilingual corpus; chatting slang; dictionary; information retrieval systems; long form English; machine learning; misspelled words; patent; probabilistic language model; reference searches; short message; source-language sentence; statistical machine translation; target-language translation; template messaging; textbook preparations; translation systems; Lead; Manganese; Smoothing methods; Language Model; Machine Translation; N-gram; Probability distribution table (PDT); Statistical Machine Translation (SMT); Text Normalization;
Conference_Titel :
Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on
Conference_Location :
Nagapattinam, Tamil Nadu
Print_ISBN :
978-1-4673-0213-5