Title :
Statistical Tagger for Bhojpuri (employing Support Vector Machine)
Author :
Srishti Singh;Girish Nath Jha
Author_Institution :
Centre for Linguistics, Jawaharlal Nehru University, New Delhi, India
Abstract :
The authors present the first Support Vector Machines (SVM) based statistical Parts of Speech (POS) Tagger developed for Bhojpuri. Bhojpuri is a less resourced Indo Aryan language of the Asian continent and the POS tagger presented here is a step towards developing language resources for it. SVMs have already been trained on other languages like Malayalam and Bengali with an accuracy of 86-90 %. The present research came up with approximately 87.3 -88.6% accuracy for test datasets.
Keywords :
"Support vector machines","Tagging"
Conference_Titel :
Advances in Computing, Communications and Informatics (ICACCI), 2015 International Conference on
Print_ISBN :
978-1-4799-8790-0
DOI :
10.1109/ICACCI.2015.7275829