• DocumentCode
    3318443
  • Title

    A Double Metaphone encoding for Bangla and its application in spelling checker

  • Author

    UzZaman, Naushad ; Khan, Mumit

  • Author_Institution
    Center for Res. of Bangla Language Process., BRAC Univ., Dhaka, Bangladesh
  • fYear
    2005
  • fDate
    30 Oct.-1 Nov. 2005
  • Firstpage
    705
  • Lastpage
    710
  • Abstract
    We present a Double Metaphone encoding for Bangla that can be used by spelling checkers to improve the quality of suggestions for misspelled words. The complex rules of Bangla spelling present a significant challenge in producing suggestions for a misspelled word when employing the traditional edit-distance methods; one must take phonetic similarity into account for the suggested alternatives to be reasonably accurate. We propose a Double Metaphone encoding for Bangla, taking into account the various context-sensitive rules, including those involving the large repertoire of consonant clusters in Bangla, and present a comparison with the traditional edit-distance based methods in producing suggestions for misspelled words.
  • Keywords
    computational linguistics; natural languages; speech coding; word processing; Bangla; Double Metaphone encoding; context-sensitive rules; misspelled word; phonetic similarity; spelling checker; traditional edit-distance method; Clustering algorithms; Encoding; Modems; Natural languages; Bangla; Bengali; Double Metaphone; Phonetic Encoding; Spelling Checker; Spelling suggestions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
  • Print_ISBN
    0-7803-9361-9
  • Type

    conf

  • DOI
    10.1109/NLPKE.2005.1598827
  • Filename
    1598827