• DocumentCode
    3627022
  • Title

    Document Classification Based on the Topic Evaluation and Its Usage in Data Compression

  • Author

    Jan Martinovic;Jiri Dvorsky

  • Author_Institution
    VSB - Tech. Univ. of Ostrava, Ostrava
  • fYear
    2007
  • Firstpage
    204
  • Lastpage
    207
  • Abstract
    Several actions are usually performed when document is appended to textual database in information retrieval system. The most frequent actions are compression of the document and cluster analysis of the textual database to improve quality of answers to users´ queries. The information retrieved from the clustering can be very helpful in compression. Word-based compression using information about cluster hierarchy is presented in this paper. Some experimental results are provided at the end of the paper.
  • Keywords
    "Data compression","Search engines","Information retrieval","Web search","Intelligent agent","Spatial databases","Clustering algorithms","Clustering methods","Extraterrestrial measurements","Conferences"
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology Workshops, 2007 IEEE/WIC/ACM International Conferences on
  • Print_ISBN
    0-7695-3028-1
  • Type

    conf

  • DOI
    10.1109/WI-IATW.2007.109
  • Filename
    4427572