• DocumentCode
    3659819
  • Title

    Building a parallel Corpora: Translation issues and remedial case

  • Author

    Archana G P; Jithesh V S; Remya L B;Elizabeth Sherly

  • Author_Institution
    Virtual Resource Center for Language Computing, Indian Institute of Information Technology and Management- Kerala, India
  • fYear
    2015
  • Firstpage
    2414
  • Lastpage
    2417
  • Abstract
    In a multilingual country like India, language translation plays a prominent role in the field of text processing and applications, such as, machine learning, machine translation, information extraction, information retrieval, and natural language understanding systems. There are many issues and challenges in translation between languages such as lexical divergences, ambiguities, lexical mismatches, reordering, syntactic and semantic issues, structural changes etc. In this paper, we have presented a comparative analysis by highlighting some of the crucial issues, challenges and solutions in translation from Hindi to Malayalam language, while executing Indian Language Corpora Initiative (ILCI) project of DeitY, GoI. In this analysis, we have found that Malayalam and Hindi have very distinct structures in linguistic levels like phonological, lexical and syntactic, which has to be handled carefully in translation.
  • Keywords
    "Syntactics","Context","Buildings","Information technology","Informatics","Semantics","Agriculture"
  • Publisher
    ieee
  • Conference_Titel
    Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on
  • Print_ISBN
    978-1-4799-8790-0
  • Type

    conf

  • DOI
    10.1109/ICACCI.2015.7275980
  • Filename
    7275980