• DocumentCode
    2813786
  • Title

    Adaptive Web document classification with MCRDR

  • Author

    Kim, Yang Sok ; Park, Sung Sik ; Deards, Edward ; Kang, Byeong Ho

  • Author_Institution
    Sch. of Comput., Tasmania Univ., Hobart, Tas., Australia
  • Volume
    1
  • fYear
    2004
  • fDate
    5-7 April 2004
  • Firstpage
    476
  • Abstract
    With the explosive increase in Web based information, the need for an intelligent agent for automatic classification has also been increased resulting in many research discoveries in this area. Machine learning (ML) based document classification is now the prevalent approach. However, classification by ML may not keep the same performance because the knowledge generated from the training set may not be appropriate for certain types of Web information. People are often concerned more about the newly uploaded information such as Web based online news than information already available. This explains why it is not widely used in real applications. However, the manual classification method, by the domain users, cannot be a solution either until the knowledge acquisition bottleneck issue is resolved. Multiple classification ripple down rules, an incremental knowledge acquisition method, is suggested to overcome this problem with fast learning and low cost maintenance.
  • Keywords
    Internet; classification; knowledge acquisition; learning (artificial intelligence); MCRDR; Web document classification; Web information; automatic classification; fast learning; knowledge acquisition; low cost maintenance; machine learning; multiple classification ripple down rules; Classification tree analysis; Costs; Explosives; Intelligent agent; Knowledge acquisition; Machine learning; Manuals; Monitoring; Search engines; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
  • Print_ISBN
    0-7695-2108-8
  • Type

    conf

  • DOI
    10.1109/ITCC.2004.1286502
  • Filename
    1286502