Title :
Weakly supervised neural networks for Part-Of-Speech tagging
Author :
Chopra, Sonik ; Bangalore, S.
Author_Institution :
AT&T Labs.-Res., Florham Park, NJ, USA
Abstract :
We introduce a simple and novel method for the weakly supervised problem of Part-Of-Speech tagging with a dictionary. Our method involves training a connectionist network that simultaneously learns a distributed latent representation of the words, while maximizing the tagging accuracy. To compensate for the unavailability of true labels, we resort to training the model using a Curriculum: instead of random order, the model is trained using an ordered sequence of training samples, proceeding from “easier” to “harder” samples. On a standard test corpus, we show that without using any grammatical information, our model is able to outperform the standard EM algorithm in tagging accuracy, and its performance is comparable to other state-of-the-art models. We also show that curriculum learning for this setting significantly improves performance, both in terms of speed of convergence and in terms of generalization.
Keywords :
grammars; learning (artificial intelligence); natural language processing; neural nets; speech processing; connectionist network; curriculum learning; distributed latent representation; grammatical information; ordered sequence; part-of-speech tagging; random order; standard EM algorithm; tagging accuracy; training samples; weakly supervised neural networks; Accuracy; Artificial neural networks; Dictionaries; Hidden Markov models; Tagging; Training; Vectors;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288291