• DocumentCode
    2745796
  • Title

    Clustering in tweets using a fuzzy neighborhood model

  • Author

    Miyamoto, Sadaaki ; Suzuki, Shohei ; Takumi, Satoshi

  • Author_Institution
    Dept. of Risk Eng., Univ. of Tsukuba, Tsukuba, Japan
  • fYear
    2012
  • fDate
    10-15 June 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Clustering of keywords in tweets is studied. A series of tweets is handled as a sequence of words and an inner product space is introduced to a set of keywords on the basis of positive definite kernels using a fuzzy neighborhood defined on that sequence. Methods of agglomerative hierarchical clustering as well as c-means clustering are applied. Pairwise constraints are moreover introduced to improve interpretability of clusters. Real tweets are analyzed with discussion of the resulting clusters.
  • Keywords
    constraint handling; fuzzy set theory; pattern clustering; social networking (online); word processing; agglomerative hierarchical clustering; c-means clustering; cluster interpretability; fuzzy neighborhood model; keyword clustering; pairwise constraint; product space; tweet; word sequence; Accidents; Clustering algorithms; Kernel; Oceans; Rain; Twitter; Typhoons;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Fuzzy Systems (FUZZ-IEEE), 2012 IEEE International Conference on
  • Conference_Location
    Brisbane, QLD
  • ISSN
    1098-7584
  • Print_ISBN
    978-1-4673-1507-4
  • Electronic_ISBN
    1098-7584
  • Type

    conf

  • DOI
    10.1109/FUZZ-IEEE.2012.6250800
  • Filename
    6250800