DocumentCode
3752266
Title
Document classification with spherical word vectors
Author
Yiqiao Pan;Chao Xing;Dong Wang
Author_Institution
Center for Speech and Language Technology (CSLT) Research Institute of Information Technology, Tsinghua University, Beijing, P.R. China
fYear
2015
Firstpage
270
Lastpage
273
Abstract
Recent research shows that low-dimensional continuous representations of words (word vectors) can be successfully employed to classify documents, and document vectors derived from semantic clustering work better than those derived from simple average pooling. On the other hand, our recent study demonstrated that embedding words on a hypersphere offers better performance on tasks including semantic relatedness and bilingual translation when compared to the original approach that embeds words in an unconstrained plane space. In this paper, spherical word vectors are applied to the document classification task. The experiments show that spherical word vectors can deliver good performance when combined with semantic clustering based on vMF distributions.
Keywords
"Semantics","Training","Mixture models","Clustering methods","Syntactics","Mathematical model","Data models"
Publisher
ieee
Conference_Titel
Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
Type
conf
DOI
10.1109/APSIPA.2015.7415518
Filename
7415518
Link To Document