Title :
Development of a rule based learning system for splitting compound words in Malayalam language
Author :
Nair, Latha R. ; Peter, S. David
Author_Institution :
Div. of Comput. Sci., Cochin Univ. of Sci. & Technol., Cochin, India
Abstract :
Morphological analyzers are essential for any type of natural language processing works. As Malayalam like other Dravidian languages is an agglutinative language it needs a compound word splitter as a preprocessor. An algorithm has been developed and successfully used for splitting the compound words. 90% success has been established in initial scrutiny of around 4000 compound words. The splitter can be used for developing and implementing a full fledged morphological analyzer.
Keywords :
knowledge based systems; learning (artificial intelligence); natural language processing; word processing; Dravidian languages; Malayalam language; agglutinative language; compound word splitter; morphological analyzers; natural language processing works; rule based learning system; Algorithm design and analysis; Arrays; Compounds; Educational institutions; Morphology; Speech processing; Dravidian languages; Malayalam; finite automata; morphology; natural language processing; parts-of-speech tagger; sandhi splitter;
Conference_Titel :
Recent Advances in Intelligent Computational Systems (RAICS), 2011 IEEE
Conference_Location :
Trivandrum
Print_ISBN :
978-1-4244-9478-1
DOI :
10.1109/RAICS.2011.6069410