DocumentCode
185331
Title
Part of speech tagging with Naïve Bayes methods
Author
Cretulescu, R. ; David, Alexandre ; Morariu, D. ; Vintan, Lucian
Author_Institution
Comput. Sci. & Electr. Eng. Dept., “Lucian Blaga” Univ. of Sibiu, Sibiu, Romania
fYear
2014
fDate
17-19 Oct. 2014
Firstpage
446
Lastpage
451
Abstract
In this paper we have focused on the problem of automatic prediction of parts of speech in sentences. We present an experimental framework which includes the analysis and the implementation of methods for part of speech (POS) labeling (tagging). We have tested three methods that predict the POS without current word´s context and also three context awareness statistic methods. The main goal of our work was to evaluate the three statistical methods Forward, Backward and Complete Method in order to analyze their applicability in the problem of automatically prediction of the POS. These methods are derived from the classic Naïve Bayes classifier. In our research we have used the WordNet database and a set of benchmarks called the Brown University Standard Corpus of Present - Day American English. The results obtained by the non-context-awareness methods compared to the results obtained by statistical methods are better but not so reliable like the statistical methods.
Keywords
natural language processing; statistical analysis; Brown University Standard Corpus of Present; POS labeling; backward method; complete method; context awareness statistic methods; forward method; naive Bayes methods; part-of-speech tagging; parts-of-speech prediction; Accuracy; Bayes methods; Context; Measurement; Speech; Tagging; Training; NLP; Naïve Bayes; Part of Speech Prediction;
fLanguage
English
Publisher
ieee
Conference_Titel
System Theory, Control and Computing (ICSTCC), 2014 18th International Conference
Conference_Location
Sinaia
Type
conf
DOI
10.1109/ICSTCC.2014.6982457
Filename
6982457
Link To Document