Title :
Application of multilayer perceptron network for tagging parts-of-speech
Author :
Ahmed ; Raju, S. Bapi ; Chandrasekhar, Pammi V S ; Prasad, M. Krishna
Author_Institution :
Dept. of Comput. Sci. & Eng., Muffakham Jah Coll. of Eng. & Technol., Hyderabad, India
Abstract :
This paper presents a neural network based part-of-speech tagger that learns to assign correct part-of-speech tags to the words in a sentence. A multilayer perceptron (MLP) network with three-layers is used. The MLP-tagger is trained with error back-propagation learning algorithm. The representation scheme for the input and output of the network is adapted from Ma et al. (1966). The tagger is trained on SUSANNE English tagged-corpus consisting of 156,622 words. The MLP-tagger is trained using 85% of the corpus. Based on the tag mappings learned, the MLP-tagger demonstrated an accuracy of 90.04% on test data that also included words unseen during the training. Results from our experiments suggest that the MLP-tagger combined with the representation scheme adopted here could be a better substitute for traditional tagging approaches. This method shows promise for addressing parts-of-speech tagging problem for Indian language text considering the fact that most of the Indian language corpora, especially tagged ones, are still considerably small in size.
Keywords :
backpropagation; feedforward neural nets; multilayer perceptrons; speech recognition; Indian language corpora; Indian language text; SUSANNE English tagged-corpus; error back-propagation learning algorithm; multilayer perceptron network; neural network based part-of-speech tagger; representation scheme; Application software; Computer errors; Computer networks; Computer science; Educational institutions; Hidden Markov models; Multilayer perceptrons; Neural networks; Tagging; Testing;
Conference_Titel :
Language Engineering Conference, 2002. Proceedings
Print_ISBN :
0-7695-1885-0
DOI :
10.1109/LEC.2002.1182291