DocumentCode :
166361
Title :
Joint layer based deep learning framework for bilingual machine transliteration
Author :
Sanjanaashree, P. ; Anand Kumar, M.
Author_Institution :
Center for Excellence in Comput. Eng. & Networking, Amrita Vishwa Vidyapeetham, Coimbatore, India
fYear :
2014
fDate :
24-27 Sept. 2014
Firstpage :
1737
Lastpage :
1743
Abstract :
Between the growth of Internet or World Wide Web (WWW) and the emersion of the social networking site like Friendster, Myspace etc., information society started facing exhilarating challenges in language technology applications such as Machine Translation (MT) and Information Retrieval (IR). Nevertheless, there were researchers working in Machine Translation that deal with real time information for over 50 years since the first computer has come along. Merely, the need for translating data has become larger than before as the world was getting together through social media. Especially, translating proper nouns and technical terms has become openly challenging task in Machine Translation. The Machine transliteration was emerged as a part of information retrieval and machine translation projects to translate the Named Entities based on phoneme and grapheme, hence, those are not registered in the dictionary. Many researchers have used approaches such as conventional Graphical models and also adopted other machine translation techniques for Machine Transliteration. Machine Transliteration was always looked as a Machine Learning Problem. In this paper, we presented a new area of Machine Learning approach termed as a Deep Learning for improving the bilingual machine transliteration task for Tamil and English languages with limited corpus. This technique precedes Artificial Intelligence. The system is built on Deep Belief Network (DBN), a generative graphical model, which has been proved to work well with other Machine Learning problem. We have obtained 79.46% accuracy for English to Tamil transliteration task and 78.4 % for Tamil to English transliteration.
Keywords :
belief networks; computational linguistics; information retrieval; language translation; natural language processing; DBN; English languages; Friendster; IR; Internet; MT; Myspace; Tamil languages; Tamil-to-English transliteration; World Wide Web; bilingual machine transliteration; computational linguistics; deep belief network; generative graphical model; grapheme; information retrieval; information society; joint layer based deep learning framework; language technology applications; machine learning problem; machine translation; named entities; phoneme; proper nouns; real time information; social media; social networking site; technical terms; Computers; Dictionaries; Joints; Neurons; Support vector machines; Training; Vectors; Artificial Intelligence; Computational Linguistics; Deep Belief Networks; Deep Learning; Machine Transliteration; Natural Language Processing; Restricted Boltzmann Machine;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
Conference_Location :
New Delhi
Print_ISBN :
978-1-4799-3078-4
Type :
conf
DOI :
10.1109/ICACCI.2014.6968553
Filename :
6968553
Link To Document :
بازگشت