Title :
An approach to integrate translation memory in MT system for English to Indian language
Author :
Shukla, V.N. ; Jaiswal, Nishtha ; Shukla, Seema
Author_Institution :
Election Comm. of India, New Delhi, India
Abstract :
Although there is a requirement for English to Indian language translation, there are no proper and enough tools for the translators to use, because of the complexities involved in it. The research in developing Machine translation systems is going on but they are not yet to the stage of product or useful systems development. Major reason is the word level, phrase level and semantic level translation complexities involved because of the different cultures and use of the languages and non availability of good quality Translation Memories. Translation memories are widely used by translation industry because of their ease of use, ability to create glossaries, terminologies and phrasal dictionaries in a domain. It also helps maintain writing styles throughout the series of documents. Also, Human Machine Interface is better for human translators. On the other hand Machine Translation (MT) systems when tuned properly produce better translations for simpler sentences. Complex and compound sentences are difficult for MT to translate solely by machines. Post editing by human is required. This paper proposes integration of these two methods to get the respective advantages of both the systems providing better translation support systems for translators to work faster without compromising quality and help human translator to maintain proper writing styles intact. The concept was validated with a small experiment using a widely used TM tool (Wordfast) and English to Indian Language Rule Based Machine Translation system provided by IIT Kanpur.
Keywords :
dictionaries; document handling; glossaries; human computer interaction; language translation; natural language processing; English-to-Indian language rule based machine translation system; English-to-Indian language translation; MT system; Wordfast; glossaries; human machine interface; phrasal dictionaries; phrase level translation complexities; semantic level translation complexities; terminologies; translation industry; translation memory integration; translation support systems; word level translation complexities; Complexity theory; Databases; Grammar; Semantics; Terminology; Web services; RBMT; Translation Memory (TM); Translation Memory eXchange (TMX);
Conference_Titel :
Cognitive Computing and Information Processing (CCIP), 2015 International Conference on
Conference_Location :
Noida
DOI :
10.1109/CCIP.2015.7100699