DocumentCode :
3658884
Title :
Biterm-based multilayer perceptron network for tagging short text
Author :
Hao Hu;Ping Li;Yan Chen
Author_Institution :
Center for Intelligent and Networked Systems, School of Computer Science, Southwest Petroleum University, Chengdu 610500, P.R. China
fYear :
2015
fDate :
7/1/2015 12:00:00 AM
Firstpage :
212
Lastpage :
217
Abstract :
With the emergence of online social media, usergenerated content (UGC) in short text form is becoming the most popular and valuable information available on the web. Mining potential resources from large scale short texts is particularly critical but challenging for many content analysis tasks. Among the text mining, one of useful techniques is tagging. Conventional approach to tag short text only use simple text-level word co-occurence. However, different from long articles, short texts have limited words, which do not provide sufficient information on tags. In this paper, we propose a novel method for tagging short texts by using multilayer perceptron network (MLP), which takes full advantage of the generation of word co-occurrences by biterms whose definition will be given in the text. With the help of biterm, we make the inference effective with rich training information in MLP. Experiments on real-word short text collections show that the proposed methods outperforms the tranditional approach on ZHIHU datasets.
Keywords :
"Training","Tagging","Testing","Accuracy","Conferences","Random access memory","Internet"
Publisher :
ieee
Conference_Titel :
Cybernetics and Intelligent Systems (CIS) and IEEE Conference on Robotics, Automation and Mechatronics (RAM), 2015 IEEE 7th International Conference on
Print_ISBN :
978-1-4673-7337-1
Electronic_ISBN :
2326-8239
Type :
conf
DOI :
10.1109/ICCIS.2015.7274575
Filename :
7274575
Link To Document :
بازگشت