• DocumentCode
    3696627
  • Title

    Effectiveness of social media text classification by utilizing the online news category

  • Author

    Phat Jotikabukkana;Virach Sornlertlamvanich;Okumura Manabu;Choochart Haruechaiyasak

  • Author_Institution
    School of ICT, Sirindhorn International Institute of Technology, Thammasat University, Pathum Thani 12121, Thailand
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    Social media text can illustrate significant information of our real social situation. It can show the direction of real-time social movement. However, it has its own characteristics such as using short text and informal language, many unstructured information and argot. This kind of text is hard to classify and difficult to analyze to extract the useful information. In this paper, we propose an effective technique to classify the social media text by utilizing the initial keywords from well-formed sources of data, such as online news. Term frequency-inverse document frequency weighting technique (TF-IDF) and Word Article Matrix (WAM) are used as main methods in this research. We use the extracted keywords from the well-formed source as a main factor to do experiment on Twitter messages. We found a set of the social media keywords can represent the essence of social events and can be used to classify the text effectively.
  • Keywords
    "Hafnium","Manganese"
  • Publisher
    ieee
  • Conference_Titel
    Advanced Informatics: Concepts, Theory and Applications (ICAICTA), 2015 2nd International Conference on
  • Print_ISBN
    978-1-4673-8142-0
  • Type

    conf

  • DOI
    10.1109/ICAICTA.2015.7335361
  • Filename
    7335361