• DocumentCode
    3159266
  • Title

    Automatic standardization of spelling variations of Hindi text

  • Author

    Goyal, Vishal ; Lehal, Gurpreet Singh

  • Author_Institution
    Dept. of Comput. Sci., Punjabi Univ., Patiala, India
  • fYear
    2010
  • fDate
    17-19 Sept. 2010
  • Firstpage
    764
  • Lastpage
    767
  • Abstract
    The phonetic nature of Indian languages and multiple dialects, transliteration of proper names, words borrowed from foreign languages has resulted in spelling variations of the same word. Such variations sometimes can be treated as errors in writing. While developing machine translation system, the task of standardizing the spellings for further processing the text is considered to play vital role in improving the accuracy of translation. In this paper, the rule based approach for standardizing spelling variations in the Hindi text while developing Hindi to Punjabi Machine Translation System has been explained. It was analyzed that only 7.45% text was standardized using this approach and thus had increased the accuracy of the machine translation system.
  • Keywords
    language translation; natural language processing; text analysis; Hindi text; Hindi-Punjabi machine translation system; Indian languages; rule based approach; spelling variation standardization; Accuracy; Databases; Dictionaries; Knowledge based systems; Speech; Speech recognition; Standardization; Machine Translation; Natural Language Processing; Preprocessing Module; Standardizing spelling variations; Text Normalization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer and Communication Technology (ICCCT), 2010 International Conference on
  • Conference_Location
    Allahabad, Uttar Pradesh
  • Print_ISBN
    978-1-4244-9033-2
  • Type

    conf

  • DOI
    10.1109/ICCCT.2010.5640441
  • Filename
    5640441