DocumentCode
2467688
Title
Twitter part-of-speech tagging using pre-classification Hidden Markov model
Author
Sun, Shichang ; Liu, Hongbo ; Lin, Hongfei ; Abraham, Ajith
fYear
2012
fDate
14-17 Oct. 2012
Firstpage
1118
Lastpage
1123
Abstract
Hidden Markov models (HMM) have been widely used in natural language processing (NLP), especially in syntactic level applications, which appears naturally as short-range-dependent sequence recognition problems. But the structure of HMM limits the usage of global knowledge including the sentiment analysis of the text, which has become an increasingly popular research topic in NLP now. In this paper, we propose a novel treatment of HMM model to use the result of sentimental subjectivity analysis in syntactic level task, i.e. part-of-speech (POS) tagging. The subjectivity information is introduced as a pre-classification procedure into the interval-type HMM. The subjectivity degree of the testing sentence is used as a combination factor to choose an appropriate value from the interval. Experiments results on public tagging data sets shows that the proposed approach enhanced the performance of POS tagging.
Keywords
hidden Markov models; natural language processing; social networking (online); text analysis; HMM; NLP; POS tagging; Twitter part-of-speech tagging; global knowledge; interval-type HMM; natural language processing; preclassification hidden Markov model; public tagging data sets; sentimental subjectivity analysis; short-range-dependent sequence recognition problems; syntactic level applications; testing sentence subjectivity degree; text sentiment analysis; Analytical models; Data models; Hidden Markov models; Natural language processing; Niobium; Tagging; Training; Hidden Markov models; Naive Bayes model; Part-of-speech tagging; Subjectivity analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics (SMC), 2012 IEEE International Conference on
Conference_Location
Seoul
Print_ISBN
978-1-4673-1713-9
Electronic_ISBN
978-1-4673-1712-2
Type
conf
DOI
10.1109/ICSMC.2012.6377881
Filename
6377881
Link To Document