DocumentCode
3757344
Title
Research on Construction of Tibetan Sentiment Corpus
Author
Tao Huang;Xiaodong Yan
Author_Institution
Sch. of Inf. Eng., Minzu Univ. of China, Beijing, China
fYear
2015
Firstpage
591
Lastpage
593
Abstract
Sentiment classification is one of the research hot spots of Natural Language Processing. Compared with English and Chinese, it is hard for Tibetan to do some research of sentiment analysis because of the situation that we lack of related sentiment corpus. In this paper, we construct a Tibetan sentiment corpus by crawling from Tibetan website and artificial Chinese-Tibetan translation. The final corpus we build is basically reaching a experimental requirement. The corpus contains 10,134 Emotion sentences, including 2,025 artificial translation corpus, and 8109 corpus crawl through the network.
Keywords
"Sentiment analysis","Internet","Training","Data mining","Crawlers","Monitoring"
Publisher
ieee
Conference_Titel
Broadband and Wireless Computing, Communication and Applications (BWCCA), 2015 10th International Conference on
Type
conf
DOI
10.1109/BWCCA.2015.31
Filename
7424896
Link To Document