• DocumentCode
    2867205
  • Title

    Automatic classification of software related microblogs

  • Author

    Prasetyo, P.K. ; Lo, Daniel ; Achananuparp, P. ; Yuan Tian ; Ee-Peng Lim

  • Author_Institution
    Sch. of Inf. Syst., Singapore Manage. Univ., Singapore, Singapore
  • fYear
    2012
  • fDate
    23-28 Sept. 2012
  • Firstpage
    596
  • Lastpage
    599
  • Abstract
    Millions of people, including those in the software engineering communities have turned to microblogging services, such as Twitter, as a means to quickly disseminate information. A number of past studies by Treude et al., Storey, and Yuan et al. have shown that a wealth of interesting information is stored in these microblogs. However, microblogs also contain a large amount of noisy content that are less relevant to software developers in engineering software systems. In this work, we perform a preliminary study to investigate the feasibility of automatic classification of microblogs into two categories: relevant and irrelevant to engineering software systems. We extract features from the textual content of the microblogs and the titles of any URLs mentioned in the microblogs. These features are then used to learn a discriminative model used in classifying relevant and irrelevant microblogs. We show that our trained model can achieve a promising classification performance.
  • Keywords
    Web sites; information dissemination; pattern classification; software engineering; URL; automatic classification; classification performance; discriminative model; information dissemination; irrelevant microblogs; microblogging services; relevant microblogs; software engineering; software engineering communities; software related microblogs; software systems; textual content; Accuracy; Feature extraction; Media; Software engineering; Software systems; Twitter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Software Maintenance (ICSM), 2012 28th IEEE International Conference on
  • Conference_Location
    Trento
  • ISSN
    1063-6773
  • Print_ISBN
    978-1-4673-2313-0
  • Type

    conf

  • DOI
    10.1109/ICSM.2012.6405330
  • Filename
    6405330