Title :
Document Classification Based on the Topic Evaluation and Its Usage in Data Compression
Author :
Jan Martinovic;Jiri Dvorsky
Author_Institution :
VSB - Tech. Univ. of Ostrava, Ostrava
Abstract :
Several actions are usually performed when document is appended to textual database in information retrieval system. The most frequent actions are compression of the document and cluster analysis of the textual database to improve quality of answers to users´ queries. The information retrieved from the clustering can be very helpful in compression. Word-based compression using information about cluster hierarchy is presented in this paper. Some experimental results are provided at the end of the paper.
Keywords :
"Data compression","Search engines","Information retrieval","Web search","Intelligent agent","Spatial databases","Clustering algorithms","Clustering methods","Extraterrestrial measurements","Conferences"
Conference_Titel :
Web Intelligence and Intelligent Agent Technology Workshops, 2007 IEEE/WIC/ACM International Conferences on
Print_ISBN :
0-7695-3028-1
DOI :
10.1109/WI-IATW.2007.109