• DocumentCode
    166361
  • Title

    Joint layer based deep learning framework for bilingual machine transliteration

  • Author

    Sanjanaashree, P. ; Anand Kumar, M.

  • Author_Institution
    Center for Excellence in Comput. Eng. & Networking, Amrita Vishwa Vidyapeetham, Coimbatore, India
  • fYear
    2014
  • fDate
    24-27 Sept. 2014
  • Firstpage
    1737
  • Lastpage
    1743
  • Abstract
    Between the growth of Internet or World Wide Web (WWW) and the emersion of the social networking site like Friendster, Myspace etc., information society started facing exhilarating challenges in language technology applications such as Machine Translation (MT) and Information Retrieval (IR). Nevertheless, there were researchers working in Machine Translation that deal with real time information for over 50 years since the first computer has come along. Merely, the need for translating data has become larger than before as the world was getting together through social media. Especially, translating proper nouns and technical terms has become openly challenging task in Machine Translation. The Machine transliteration was emerged as a part of information retrieval and machine translation projects to translate the Named Entities based on phoneme and grapheme, hence, those are not registered in the dictionary. Many researchers have used approaches such as conventional Graphical models and also adopted other machine translation techniques for Machine Transliteration. Machine Transliteration was always looked as a Machine Learning Problem. In this paper, we presented a new area of Machine Learning approach termed as a Deep Learning for improving the bilingual machine transliteration task for Tamil and English languages with limited corpus. This technique precedes Artificial Intelligence. The system is built on Deep Belief Network (DBN), a generative graphical model, which has been proved to work well with other Machine Learning problem. We have obtained 79.46% accuracy for English to Tamil transliteration task and 78.4 % for Tamil to English transliteration.
  • Keywords
    belief networks; computational linguistics; information retrieval; language translation; natural language processing; DBN; English languages; Friendster; IR; Internet; MT; Myspace; Tamil languages; Tamil-to-English transliteration; World Wide Web; bilingual machine transliteration; computational linguistics; deep belief network; generative graphical model; grapheme; information retrieval; information society; joint layer based deep learning framework; language technology applications; machine learning problem; machine translation; named entities; phoneme; proper nouns; real time information; social media; social networking site; technical terms; Computers; Dictionaries; Joints; Neurons; Support vector machines; Training; Vectors; Artificial Intelligence; Computational Linguistics; Deep Belief Networks; Deep Learning; Machine Transliteration; Natural Language Processing; Restricted Boltzmann Machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advances in Computing, Communications and Informatics (ICACCI, 2014 International Conference on
  • Conference_Location
    New Delhi
  • Print_ISBN
    978-1-4799-3078-4
  • Type

    conf

  • DOI
    10.1109/ICACCI.2014.6968553
  • Filename
    6968553