• DocumentCode
    3752266
  • Title

    Document classification with spherical word vectors

  • Author

    Yiqiao Pan;Chao Xing;Dong Wang

  • Author_Institution
    Center for Speech and Language Technology (CSLT) Research Institute of Information Technology, Tsinghua University, Beijing, P.R. China
  • fYear
    2015
  • Firstpage
    270
  • Lastpage
    273
  • Abstract
    Recent research shows that low-dimensional continuous representations of words (word vectors) can be successfully employed to classify documents, and document vectors derived from semantic clustering work better than those derived from simple average pooling. On the other hand, our recent study demonstrated that embedding words on a hypersphere offers better performance on tasks including semantic relatedness and bilingual translation when compared to the original approach that embeds words in an unconstrained plane space. In this paper, spherical word vectors are applied to the document classification task. The experiments show that spherical word vectors can deliver good performance when combined with semantic clustering based on vMF distributions.
  • Keywords
    "Semantics","Training","Mixture models","Clustering methods","Syntactics","Mathematical model","Data models"
  • Publisher
    ieee
  • Conference_Titel
    Signal and Information Processing Association Annual Summit and Conference (APSIPA), 2015 Asia-Pacific
  • Type

    conf

  • DOI
    10.1109/APSIPA.2015.7415518
  • Filename
    7415518