Title : 
Improving Short Text Classification through Better Feature Space Selection
         
        
            Author : 
Meng Wang ; Lanfen Lin ; Feng Wang
         
        
            Author_Institution : 
Coll. of Comput. Sci. & Technol., Zhejiang Univ., Hangzhou, China
         
        
        
        
        
        
            Abstract : 
Nowadays people are overwhelmed by more and more short information from lots of different applications, especially with the rapid development of mobile systems. One way to alleviate this issue is an automatic classification of the short texts before they are delivered to users. Several methods have been proposed to classify the short texts, and they are largely based on expanding the short texts to longer ones with external resources to solve the sparseness problem. Different from these studies, we tackle the sparseness problem by selecting a better feature space in which the feature vectors of the short texts are denser, and our method needs no external resources at all. The experimental results on an open dataset show that this method can significantly improve the short text classification accuracy comparing with the baseline, especially when the dimension of the feature space is low.
         
        
            Keywords : 
classification; text analysis; vectors; feature space selection; feature vector; short text classification; sparseness problem; Accuracy; Encyclopedias; Presses; Search engines; Support vector machine classification; Text categorization; feature space; feature words; short text classification;
         
        
        
        
            Conference_Titel : 
Computational Intelligence and Security (CIS), 2013 9th International Conference on
         
        
            Conference_Location : 
Leshan
         
        
            Print_ISBN : 
978-1-4799-2548-3
         
        
        
            DOI : 
10.1109/CIS.2013.32