• DocumentCode
    2866102
  • Title

    Classification of breaking news taken from the online news sites

  • Author

    Kilic, Erdal ; Tavus, Mustafa Resit ; Karhan, Zehra

  • Author_Institution
    Bilgisayar Muhendisligi Bolumu, Ondokuz Mayis Univ., Samsun, Turkey
  • fYear
    2015
  • fDate
    16-19 May 2015
  • Firstpage
    363
  • Lastpage
    366
  • Abstract
    In this study, we aimed to provide access to the breaking news depending on the category to which the user wants. First, accessing to news in certain categories are provided from the news provider by using RSS (Really Simple Syndication). Preprocessing is implemented by cleaning xml tags and punctuation which can cause illusions before the content are obtained on datum. The features which can represent our classes in categories were determined by applying the methods in data mining for content after preprocessing phase. In the last step of process, Classification of category process is done by obtaining breaking news´ content taken as online. In the phase of classification, Categorization were implemented with features which represent each category and by using C4.5i Naive Bayes and SMO (Sequential minimal optimization) functions, respectively. The performance rates in the usage methods and classification rates are shown in comparison.
  • Keywords
    Bayes methods; Internet; Web sites; optimisation; pattern classification; RSS; SMO; breaking news classification; category process classification; naive Bayes function; online news site; really simple syndication; sequential minimal optimization; Blogs; Cleaning; Optimization; Signal processing; Text mining; XML; C4.5; Categorization News; Naive Bayes; SMO; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing and Communications Applications Conference (SIU), 2015 23th
  • Conference_Location
    Malatya
  • Type

    conf

  • DOI
    10.1109/SIU.2015.7129834
  • Filename
    7129834