• Title of article

    Learning to classify short text from scientific documents using topic models with various types of knowledge

  • Author/Authors

    Vo، نويسنده , , Duc-Thuan and Ock، نويسنده , , Cheol-Young، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2015
  • Pages
    15
  • From page
    1684
  • To page
    1698
  • Abstract
    Classification of short text is challenging due to data sparseness, which is a typical characteristic of short text. In this paper, we propose methods for enhancing features using topic models, which make short text seem less sparse and more topic-oriented for classification. We exploited topic model analysis based on Latent Dirichlet Allocation for enriched datasets, and then we presented new methods for enhancing features by combining external texts from topic models that make documents more effective for classification. In experiments, we utilized the title contents of scientific articles as short text documents, and then enriched these documents using topic models from various types of universal datasets for classification in order to show that our approach performs efficiently.
  • Keywords
    Topic model , Data sparseness , Latent Dirichlet Allocation , information retrieval , Short text classification
  • Journal title
    Expert Systems with Applications
  • Serial Year
    2015
  • Journal title
    Expert Systems with Applications
  • Record number

    2355556