• DocumentCode
    673200
  • Title

    Hot topic extraction based on frequency, position, scattering and topical weight for time sliced news documents

  • Author

    Jahnavi, Y. ; Radhika, Y.

  • Author_Institution
    Comput. Sci. & Eng. Dept., GITAM Univ., Hyderabad, India
  • fYear
    2013
  • fDate
    21-22 Sept. 2013
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    Internet based news documents are the basic information transmission media. In such a case detecting hot topics and tracking the event development is most important. However, it is almost impossible to view all the generated topics, due to its large amount of size. Therefore it is necessary to rank the topics. The topic ranking should be done on the importance basis. But this importance is determined by how frequently a topic appears and this importance varies in different time slots. For extracting hot topics, most of the text mining approaches with vector space model need to determine the weighting of the feature terms. Existing traditional algorithms can´t achieve high accuracy for retrieving hot terms, because they have not considered position, scattering and topicality. This paper presents an innovative and effective hot term extraction by considering position, scattering and topicality of terms along with frequency.
  • Keywords
    Internet; data mining; document handling; information retrieval; text analysis; Internet based news documents; event development tracking; hot term retrieval; hot topic extraction; information transmission media; innovative hot term extraction; text mining approach; time sliced news documents; topic frequency; topic position; topic scattering; topical weight; vector space model; Algorithm design and analysis; Feature extraction; Scattering; Text mining; Time-frequency analysis; Vectors; TF-PDF; Term Weighting; Text Mining; Vector Space Model (VSM);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Advanced Computing Technologies (ICACT), 2013 15th International Conference on
  • Conference_Location
    Rajampet
  • Print_ISBN
    978-1-4673-2816-6
  • Type

    conf

  • DOI
    10.1109/ICACT.2013.6710495
  • Filename
    6710495