• DocumentCode
    2235391
  • Title

    Development of a rule based learning system for splitting compound words in Malayalam language

  • Author

    Nair, Latha R. ; Peter, S. David

  • Author_Institution
    Div. of Comput. Sci., Cochin Univ. of Sci. & Technol., Cochin, India
  • fYear
    2011
  • fDate
    22-24 Sept. 2011
  • Firstpage
    751
  • Lastpage
    755
  • Abstract
    Morphological analyzers are essential for any type of natural language processing works. As Malayalam like other Dravidian languages is an agglutinative language it needs a compound word splitter as a preprocessor. An algorithm has been developed and successfully used for splitting the compound words. 90% success has been established in initial scrutiny of around 4000 compound words. The splitter can be used for developing and implementing a full fledged morphological analyzer.
  • Keywords
    knowledge based systems; learning (artificial intelligence); natural language processing; word processing; Dravidian languages; Malayalam language; agglutinative language; compound word splitter; morphological analyzers; natural language processing works; rule based learning system; Algorithm design and analysis; Arrays; Compounds; Educational institutions; Morphology; Speech processing; Dravidian languages; Malayalam; finite automata; morphology; natural language processing; parts-of-speech tagger; sandhi splitter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Recent Advances in Intelligent Computational Systems (RAICS), 2011 IEEE
  • Conference_Location
    Trivandrum
  • Print_ISBN
    978-1-4244-9478-1
  • Type

    conf

  • DOI
    10.1109/RAICS.2011.6069410
  • Filename
    6069410