DocumentCode :
3151949
Title :
Weakly supervised neural networks for Part-Of-Speech tagging
Author :
Chopra, Sonik ; Bangalore, S.
Author_Institution :
AT&T Labs.-Res., Florham Park, NJ, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
1965
Lastpage :
1968
Abstract :
We introduce a simple and novel method for the weakly supervised problem of Part-Of-Speech tagging with a dictionary. Our method involves training a connectionist network that simultaneously learns a distributed latent representation of the words, while maximizing the tagging accuracy. To compensate for the unavailability of true labels, we resort to training the model using a Curriculum: instead of random order, the model is trained using an ordered sequence of training samples, proceeding from “easier” to “harder” samples. On a standard test corpus, we show that without using any grammatical information, our model is able to outperform the standard EM algorithm in tagging accuracy, and its performance is comparable to other state-of-the-art models. We also show that curriculum learning for this setting significantly improves performance, both in terms of speed of convergence and in terms of generalization.
Keywords :
grammars; learning (artificial intelligence); natural language processing; neural nets; speech processing; connectionist network; curriculum learning; distributed latent representation; grammatical information; ordered sequence; part-of-speech tagging; random order; standard EM algorithm; tagging accuracy; training samples; weakly supervised neural networks; Accuracy; Artificial neural networks; Dictionaries; Hidden Markov models; Tagging; Training; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288291
Filename :
6288291
Link To Document :
بازگشت